minutes

Floris Schoenmakers

Create a Custom AI RAG from your Google Sheet

Imagine you want to build AI applications using your own data but lack expertise in databases, APIs, or large language models (LLMs). A straightforward solution is to create a copy of your data in a ‘shadow’ database using tools like Google Sheets, Notion, or Airtable.

Then, you can connect this database to an LLM without writing code. In this example we are going to connect a (live) google sheet to a LLM. The relations are as follows:

‍

Introduction

When experimenting with task-specific LLMs and agentic applications, having relevant source data is essential. Bearing in mind that:

Privacy: privacy sensitive data must remain within your control. When unsure where/how the data is stored and used, please anonymize the data.
Dynamic updates: the data that is most relevant to a company is usually frequently updated, which justifies the set up of a live shadow database instead of using data exports. The workflow in this demo can handle both live data and data exports.

For innovation teams experimenting with AI applications, setting up no-code ‘shadow’ databases can be valuable. These databases allow teams to query and build upon their own data. By developing small proof-of-concepts or proof-of-technologies, organizations can better justify future investments in enterprise applications and agents.In this blog, I’ll show you how to create a simple ‘layman’s’ database that connects to an LLM via a no-code solution. The steps are as follows:

Choose a platform like Google Sheets, Airtable, or Notion. (This example uses Google Sheets.)
Create an API (JSON) using an external service (Sheety).
Set up a workflow with DataStax - LangFlow (low code/no code).
Start querying and interacting with your own data directly.

‍

Background

‍Large Language Models (LLMs) are trained on fixed datasets and may not include the most up-to-date information or specific details about niche topics. To make LLMs useful in a business context, they need access to detailed, specific, and frequently updated data.

This is where Retrieval Augmented Generation (RAG) comes into play. RAG enables LLMs to retrieve relevant information from external sources—such as documents, images, or databases—and use it as context in prompts. This approach improves the model’s responses by making them more accurate and contextually relevant.

To retrieve this information efficiently, RAG often relies on a vector store or vector database. These databases store data (text, images, etc.) as numerical embeddings (vectors) that represent their meaning. When a query is made, the vector database finds the most relevant items by comparing embeddings, ensuring the LLM has the right context to generate better answers.

‍

Step 1: create an API

‍First we need to turn our Google sheet into a database that can `talk`. By using platforms as Sheetly (source) you can generate an API for your sheet - also with authentication.

‍

Step 2: setup a workflow

By using a platform like Datastax, you can work from a template in which most essential steps are already pre-programmed. As we want to load data and ‘add’ it to the knowledge of the LLM, we pick the Vector Store RAG template.

‍

In the template workflow, you have two workflows:

The Load Data workflow: this workflow loads the data and enables a Vector Search. In the template the data upload is currently a file upload, but we are going to change that to an API Request.
The Retriever workflow: this workflow integrates the front-end chat interface, with the ‘our own’ database. You can see in this workflow that it should search in the same Database and Collection that has been populated in the load data workflow.

‍

Step 3: adjust workflow and parameters

The whole workflow looks like this:

By going to the Playground in the upper right corner, you can now chat with your own database.

My Google Sheet database contains over 150,000 records of Spotify tracks, complete with detailed variables such as track name, artist, popularity, genre, and more. To demonstrate how this setup works, I asked the system: “What are the most tracks that have something to do with a flower? Give me 10 examples.”

The LLM interpreted the context of the question—searching for associations with flowers—and queried the database accordingly.This example shows how the system combines natural language understanding with precise data retrieval, making it a powerful tool for exploring large datasets without needing complex queries or technical expertise..

‍

Concluding

‍Google Sheets offers a simple and accessible way to start creating shadow databases for AI experiments. Tools like Notion and Airtable expand this functionality further by allowing the inclusion of PDFs and other documents, making them powerful alternatives for more complex datasets. With platforms like DataStax, you can publish your own applications, streamlining the process of turning experimental workflows into tangible results.

In an upcoming blog, I’m going to show how you can use similar workflows to build an AI agents that perform research and analysis on competitors.

The same principles of LLMs and RAGs outlined in this guide can be applied to more complex enterprise solutions. Eli5 leverages these methods so we can create tailored AI applications and agents that interact seamlessly with their own data, opening up new opportunities for innovation and operational efficiency.

See article

Floris Schoenmakers

Chief Venture and Growth Officer

current

Create a Custom AI RAG from your Google Sheet

Imagine you want to build AI applications using your own data but lack expertise in databases, APIs, or large language models (LLMs). A straightforward solution is to create a copy of your data in a ‘shadow’ database using tools like Google Sheets, Notion, or Airtable.

Introduction

Background

‍

Step 1: create an API

Step 2: setup a workflow

Step 3: adjust workflow and parameters

Concluding

The Conscious 1%: Leading a new renaissance in the era of AI

Outsmarting Metcalfe's Law: how successful companies scale development without chaos

The AI amnesia problem: why enterprise AI can't learn from its mistakes

How intelligent automation is changing the game for enterprises and SMEs

The $2 trillion GovTech opportunity nobody's getting right

Building ventures IT support: 4 ways to get IT support for your venture

The hidden gold rush: How AI is transforming boring businesses

The disappearing experience pyramid: How do you become senior without being junior?

How AI-first product studios are unlocking hidden value through hyper specific software

How to build a calculator web app with Google Sheets, Claude and Bolt

The budget estimation paradox: the unsolved challenge in agile methodology

Realistic ROI for digital product development: where TAM/SAM/SOM meets the financial forecast

Esoteric Principles for Entrepreneurs

People don’t buy your presence. They buy your standard.

The MVP Playbook for B2B Software Ideas

Top 25 digital product studios in the Netherlands 2025

Top 25 digital product studios [2025]

Digital Product Studio Map

Plug and play: Connect your data to an LLM with Needle

Build a connected management dashboard in a day

How TanStack Router, TanStack Query, and ShadCN speed up our development

Create a Custom AI RAG from your Google Sheet

The forgotten principles of internet money

The rise of vertical AI

Startups to Watch: January

The Efficiency Revolution: Workflow automation and AI become critical for viable businesses

The role of the chief AI officer

Harnessing Emerging Tech for Tomorrow’s Solutions

The Art of Product Development and Design

Front-Loaded Energy Distribution

The Rise of Micro-SaaS: Finding a Niche in a Crowded Market

Startups to Watch: December

Choosing the Right Tech Stack for Your Product

Product and Tech Strategy Simplified: What Every Builder and Innovator Should Know

The Eli5 Guide to Modern Software Development

AI fear and greed and its influence on the corporate innovation roadmap

Eli5 Strengthens Leadership Team with Floris Schoenmakers and commits to Go-to-Market Innovation

The case for freedom tech

DevOps and beyond: A methodological edge for Eli5.

No risk, no story. No bursts of intensity, no glory.

Breaking the Binary Between Innovation and Optimization

Those who wait and those who build

Tech Product Development: how to choose from in-house teams to product studios

This is how AI frees you from task overload

Your organization is a gold mine for product opportunities, here's why.

Why a solid product strategy is the first essential your new product will need

Abundance of Opportunity

What is a Product Studio and What Does One Build?

Enterprise software solutions are a messy game, we can do better

How Eli5’s AI Roadmap Turns Confusion into Business Value