Apache Iceberg

Snowflake Unveils Polaris Catalog and Emphasizes Commitment to Interoperability with AWS, Google Cloud, Microsoft Azure, Salesforce, and More

Retrieved on: 
星期一, 六月 3, 2024

With Polaris Catalog, users now gain a single, centralized place for any engine to find and access an organization’s Iceberg tables with full, open interoperability.

Key Points: 
  • With Polaris Catalog, users now gain a single, centralized place for any engine to find and access an organization’s Iceberg tables with full, open interoperability.
  • Since Polaris Catalog’s backend implementation will be open source, organizations can freely swap the hosting infrastructure while eliminating vendor lock-in.
  • This comes on the heels of Snowflake and Microsoft’s recent partnership expansion , which creates more seamless interoperability between Snowflake and Fabric.
  • Snowflake also recently announced Snowflake Arctic , one of the most open, enterprise-grade large language models (LLM) on the market.

Dremio Reinforces Ongoing Commitment to Open Lakehouses with New Support for Apache Iceberg REST Catalog Specification

Retrieved on: 
星期四, 五月 30, 2024

The Iceberg REST Catalog Specification is the agreed upon foundation for metadata accessibility across Iceberg catalogs.

Key Points: 
  • The Iceberg REST Catalog Specification is the agreed upon foundation for metadata accessibility across Iceberg catalogs.
  • With this new capability, Dremio is able to seamlessly read from, and write to, any REST-compatible Iceberg catalog, and provide customers with the open, flexible ecosystem needed for enterprise interoperability at scale.
  • Lakehouses create a new open architecture for analytics by completely separating metadata from compute and storage, and externalizing transactional semantics.
  • This news emphasizes Dremio's focus on fostering an open community around Apache Iceberg, the open standard for data lakehouse tables.

Informatica Unveils Blueprint for Enterprise Generative AI Applications for Snowflake Cortex AI

Retrieved on: 
星期二, 五月 21, 2024

Informatica (NYSE: INFA), an enterprise cloud data management leader, unveiled new innovations for the Snowflake Data Cloud: Native SQL ELT to deliver better performance for data pipeline workloads and provide access to 250+ native Snowflake functions and a Blueprint for enterprise-grade generative AI application development for Snowflake Cortex AI based on a foundation of rich metadata, trusted data and no-code orchestration.

Key Points: 
  • Informatica (NYSE: INFA), an enterprise cloud data management leader, unveiled new innovations for the Snowflake Data Cloud: Native SQL ELT to deliver better performance for data pipeline workloads and provide access to 250+ native Snowflake functions and a Blueprint for enterprise-grade generative AI application development for Snowflake Cortex AI based on a foundation of rich metadata, trusted data and no-code orchestration.
  • Blueprint for Enterprise-Grade Generative AI Applications with Cortex AI provides customers with a template architecture to develop generative AI applications that are contextualized with enterprise metadata, grounded with high-quality, trusted data and scaled through no-code development and orchestration.
  • The blueprint combines Snowflake’s Cortex AI generative AI service with key IDMC services including Cloud Data Integration, Cloud Data Quality, Cloud Data Cataloging and Governance, Cloud Data Access Management, Master Data Management and Cloud Application Integration orchestration delivering a retrieval augmented generation (RAG) solution that grounds generative AI applications with trusted data and metadata while ensuring appropriate data access controls.
  • These are the latest Snowflake integrations since Informatica launched four new product capabilities at Snowflake Summit 2023, including Informatica Superpipe, Enterprise Data Integrator Private Preview, Cloud Data Integration-Free and support for Apache Iceberg on Snowflake.

Teradata Embraces Open Table Formats, Iceberg and Delta Lake, to Deliver the Most Open and Connected Ecosystem for Trusted AI

Retrieved on: 
星期二, 四月 30, 2024

Teradata’s fully open and connected approach is designed to be future-ready and allow enterprises to employ a modern data strategy with unmatched agility and flexibility for executing Trusted AI at scale.

Key Points: 
  • Teradata’s fully open and connected approach is designed to be future-ready and allow enterprises to employ a modern data strategy with unmatched agility and flexibility for executing Trusted AI at scale.
  • Teradata’s agnostic OTF support and open catalog integration is designed to enable the platform to read various catalogs with predictable execution.
  • “In today’s data landscape, we’re seeing wide adoption of open table formats with 51% of organizations actively adopting Delta tables and 27% adopting Apache Iceberg.
  • Forward-looking statements in this release include the availability, capabilities, and benefits provided by the integration of open table formats with Teradata’s VantageCloud platform and Teradata AI Unlimited.

Salesforce Unveils Zero Copy Partner Network, an Ecosystem Committed to Secure, Bidirectional Zero Copy Integration with Salesforce Data Cloud

Retrieved on: 
星期四, 四月 25, 2024

Salesforce (NYSE: CRM) today announced the Salesforce Zero Copy Partner Network, a global ecosystem of technology and solution providers building secure, bidirectional zero copy integrations with Salesforce Data Cloud so that data can be actioned across the Salesforce Einstein 1 Platform .

Key Points: 
  • Salesforce (NYSE: CRM) today announced the Salesforce Zero Copy Partner Network, a global ecosystem of technology and solution providers building secure, bidirectional zero copy integrations with Salesforce Data Cloud so that data can be actioned across the Salesforce Einstein 1 Platform .
  • Salesforce previously introduced the concept of zero copy bidirectional integrations with Data Cloud via partnerships with Amazon Redshift , Databricks , Google Cloud’s BigQuery , and Snowflake .
  • Salesforce and Microsoft are jointly working to give customers the ability to access their critical business data in Azure Synapse and bidirectional zero copy data access with Microsoft Fabric and Salesforce Data Cloud.
  • Data Kits enable ISVs to distribute their high-value datasets to Data Cloud customers and land that data in Data Cloud pre-mapped into the Data Cloud customer data model, with no transformations required.

Iceberg Summit Unveils Speaker Lineup for May 14-15 Free, Virtual Event

Retrieved on: 
星期三, 四月 24, 2024

Today the Iceberg Summit Selection Committee announced the lineup of speakers for the first Iceberg Summit: a free, virtual event being held May 14-15, 2024.

Key Points: 
  • Today the Iceberg Summit Selection Committee announced the lineup of speakers for the first Iceberg Summit: a free, virtual event being held May 14-15, 2024.
  • The summit will be the first event dedicated to Apache Iceberg, spanning two days with more than 30 sessions.
  • Iceberg Summit explores the state and evolution of technology in the Iceberg project and ecosystem, as well as the real-world experiences of data practitioners and developers working with Iceberg.
  • “The growth and vibrancy of the Iceberg community has been tremendous, and truly exemplifies the Apache Way.”
    To register for Iceberg Summit, visit https://iceberg-summit.org .

Starburst Advances 'Icehouse' for Near Real-Time Analytics on the Open Data Lakehouse

Retrieved on: 
星期三, 四月 10, 2024

NEW YORK, April 10, 2024 /PRNewswire/ -- Starburst, the open data lakehouse company, today announced at Data Universe its fully managed Icehouse implementation on Starburst's multi-cloud data lakehouse service, Galaxy. With the Galaxy Icehouse, customers can benefit from the scalability, performance, and cost-effectiveness of a combined Trino and Iceberg architecture (Icehouse) without the burden and cost of building and maintaining a custom solution themselves. This announcement builds on the strong momentum of Starburst Galaxy including 3x year-over-year growth in both active customers and usage volume. Starburst is setting new benchmarks in the industry, proven by the rapid adoption of its Galaxy platform, addressing customers' need for an open data lakehouse architecture. Customers can sign up for early access to Galaxy Icehouse here starting today.

Key Points: 
  • NEW YORK, April 10, 2024 /PRNewswire/ -- Starburst , the open data lakehouse company, today announced at Data Universe its fully managed Icehouse implementation on Starburst's multi-cloud data lakehouse service, Galaxy.
  • Starburst is setting new benchmarks in the industry, proven by the rapid adoption of its Galaxy platform, addressing customers' need for an open data lakehouse architecture.
  • Organizations are increasingly turning to an open data lakehouse architecture to power interactive applications and run their business.
  • Effectively operationalizing an Icehouse requires handling data ingestion, data governance, Iceberg data management, and capacity management at scale, especially in multi-cloud environments.

Starburst Advances 'Icehouse' for Near Real-Time Analytics on the Open Data Lakehouse

Retrieved on: 
星期三, 四月 10, 2024

NEW YORK, April 10, 2024 /PRNewswire/ -- Starburst, the open data lakehouse company, today announced at Data Universe its fully managed Icehouse implementation on Starburst's multi-cloud data lakehouse service, Galaxy. With the Galaxy Icehouse, customers can benefit from the scalability, performance, and cost-effectiveness of a combined Trino and Iceberg architecture (Icehouse) without the burden and cost of building and maintaining a custom solution themselves. This announcement builds on the strong momentum of Starburst Galaxy including 3x year-over-year growth in both active customers and usage volume. Starburst is setting new benchmarks in the industry, proven by the rapid adoption of its Galaxy platform, addressing customers' need for an open data lakehouse architecture. Customers can sign up for early access to Galaxy Icehouse here starting today.

Key Points: 
  • NEW YORK, April 10, 2024 /PRNewswire/ -- Starburst , the open data lakehouse company, today announced at Data Universe its fully managed Icehouse implementation on Starburst's multi-cloud data lakehouse service, Galaxy.
  • Starburst is setting new benchmarks in the industry, proven by the rapid adoption of its Galaxy platform, addressing customers' need for an open data lakehouse architecture.
  • Organizations are increasingly turning to an open data lakehouse architecture to power interactive applications and run their business.
  • Effectively operationalizing an Icehouse requires handling data ingestion, data governance, Iceberg data management, and capacity management at scale, especially in multi-cloud environments.

Dremio Solidifies its Position as Premier Apache Iceberg Data Lakehouse Platform with New Ingestion Automation and Optimization Capabilities

Retrieved on: 
星期二, 四月 9, 2024

Santa Clara, CA, April 09, 2024 (GLOBE NEWSWIRE) -- Dremio , the unified lakehouse platform for self-service analytics and AI, has unveiled new capabilities that simplify the process of building and managing an Apache Iceberg data lakehouse.

Key Points: 
  • Santa Clara, CA, April 09, 2024 (GLOBE NEWSWIRE) -- Dremio , the unified lakehouse platform for self-service analytics and AI, has unveiled new capabilities that simplify the process of building and managing an Apache Iceberg data lakehouse.
  • By automating Iceberg management processes, Dremio not only reduces total cost of ownership (TCO), but also enhances data team productivity and improves overall time-to-insight.
  • "Dremio has been a key partner in helping us build our modern data stack solution that powers our Project BI Data Lakehouse.
  • "As  the premier data lakehouse platform for Apache Iceberg, we are excited to further extend our data ingestion, processing, and optimization capabilities for Apache’s leading open source, high performance format,” said Tomer Shiran, founder of Dremio.

Dremio Announces General Availability on Microsoft Azure

Retrieved on: 
星期二, 三月 26, 2024

Dremio , the unified lakehouse platform for self-service analytics, today announced the general availability of Dremio Cloud on Microsoft Azure.

Key Points: 
  • Dremio , the unified lakehouse platform for self-service analytics, today announced the general availability of Dremio Cloud on Microsoft Azure.
  • “With Dremio Cloud on Azure, we have the ability to shift from just standard reporting to leveraging a data set as a product.
  • "Dremio Cloud on Microsoft Azure is a game-changer for organizations seeking self-service analytics with flexibility in a SaaS environment.
  • We're thrilled to bring Dremio Cloud to Azure, enabling our customers to experience the future of data management,” said Tomer Shiran, founder and CPO of Dremio.