Create a Vertex Data Store
- Updated: 2024/11/06
Create a Vertex Data Store
This is a prerequisite step before you create a Grounded by data store Model connection for using the Google Vertex AI RAG capability.
You would first create a bucket in Google Cloud, and then create a Vertex Data Store in the Agent Builder. We recommend these steps as an example.
Create a bucket in Google Cloud
Before creating a Data Store you would create a bucket in Google Cloud and upload your data to it.
- Log in to your Google Cloud account and navigate to Cloud Storage and click Create Bucket.
- Provide a name for your bucket, keep the remaining parameters as per default settings and click Continue.
- Next, upload your documents into this new bucket. This could comprise of any document format the Vertex Data Store supports, such as: PDF, HTML, TXT, and others.
Create a Vertex Data Store
Once you have completed creating a bucket, you would navigate to the
tab to create a data store.- In the Cloud Storage.Note: You have already created a bucket in Cloud Storage in the earlier steps.
screen, select
- Select .
- Keep all other parameters as per default selection, but specify the folder.
- Click Folder and select the bucket you created earlier, and click Continue.
- In the Configure your data store screen, keep the default options for Location of your data store field.
- Provide a name for the Your data store name
field.Note: The data store comprises of vectorized data that is chunked and encoded, and used for RAG solutions. If you want to chunk your content then you would configure it at this point.
- Expand the Document Processing Options section.
- In the Document parsing section, we recommend
selecting Layout Parser as it works well for most
document types. Note: See Parse and chunk documents.
- Selecting the Layout Parser option enables Document chunking.
- Next, check the Include ancestor headings in chunks option as this helps return a more comprehensive response picked from multiple chunks with overlapping data. Checking this option is optional, as per your requirement.
- Click Create to complete configuring the data store.
Google Data Store summary
In the Agent Builder, navigate to and click the data store you created. You would see a summary screen of your data store with information that you would reference when creating a Grounded by data store Model connection in AI Agent Studio.
- Project ID
- This is the ID of the project as it appears in the Google Data Store. You would require this value when creating a Grounded by data store Model connection in AI Agent Studio. You would find this ID when you select your data store.
- Data store ID
- You would use this ID for defining the authentication details when creating a Grounded by data store Model connection in AI Agent Studio.
- Region
- Displays the region the data store is deployed in.
Select a region from the drop-down list to connect for authenticating the Model connection. You can also add your own region that you configured when you created a data source in Agent Builder. See Create a data store.
- Connected apps
- Shows the Apps the data store is connected to.Note: You can create an App in the Google Data Store and connect it to the data store you created, if you want to test the data source directly in the Google Data Store.
After creating a data store, uploading documents to it takes time. Click on
to view the status of the upload.