Virtualization Technology News and Information
Article
RSS
Airbyte Makes Hundreds of Data Sources Available for Artificial Intelligence Applications

Airbyte made available connectors for the Pinecone and Chroma vector databases as the destination for moving data from hundreds of data sources, which then can be accessed by artificial intelligence (AI) models.

"We are the first general-purpose data movement platform to add support for vector databases - the first to build a bridge between data movement platforms and AI," said Michel Tricot, CEO, Airbyte. "Now, Pinecone and Chroma users don't have to struggle with creating custom code to bring in data; they can use the new Airbyte connector to select the data sources they want."

Because vector databases have the ability to interpret data to create relationships, their usage is increasingly popular as users seek to gain more meaning from data. Vector databases are ideal for applications like recommendation systems, anomaly detection and natural language processing, and as sources for AI applications - specifically Large Language Models (LLM).

The vector database destination in Airbyte now enables users to configure the full ELT pipeline, starting from extracting records from a wide variety of sources to separating unstructured and structured data, preparing and embedding text contents of records, and finally loading them into vector databases - all through a single, user-friendly interface. These vector databases can then be accessed by LLMs. All existing advantages of the Airbyte platform are now extended to vector databases, including the following.

  • The largest catalog of data sources that can be connected within minutes, and optimized for performance.
  • Availability of the no-code connector builder that makes it possible to easily and quickly create new connectors for data integrations that addresses the "long-tail" of data sources.
  • Ability to do incremental syncs to only extract changes in the data from a previous sync.
  • Built-in resiliency in the event of a disrupted session moving data, so the connection will resume from the point of the disruption.
  • Secure authentication for data access.
  • Ability to schedule and monitor status of all syncs.

Airbyte continues to innovate and support cutting-edge technologies to empower organizations in their data integration journey. The addition of vector database support marks another significant milestone in Airbyte's commitment to providing powerful and efficient solutions for data integration and analysis.

The vector database destination is currently in alpha status and available supporting: Pinecone on both Airbyte Cloud and the Open Source Software (OSS) version; Chroma and the embedded DocArray database on Airbyte OSS; plus more options in the future.

Published Tuesday, August 08, 2023 1:10 PM by David Marshall
Filed under:
Comments
There are no comments for this post.
To post a comment, you must be a registered user. Registration is free and easy! Sign up now!
Calendar
<August 2023>
SuMoTuWeThFrSa
303112345
6789101112
13141516171819
20212223242526
272829303112
3456789