Data management and security platform Cohesity is redefining how enterprises interact with their data through the launch of its new generative AI assistant, Gaia. In this episode of DEMO, Keith Shaw is joined by Greg Statton, Vice President of AI Solutions at Cohesity, for a hands-on walkthrough of how Gaia helps organizations unlock valuable insights from their existing data assetsβwithout needing to be data scientists or AI experts.Greg demonstrates how Gaia uses retrieval-augmented generation (RAG) and semantic search to sift through massive volumes of unstructured dataβlike archived PDFs, SharePoint files, and NAS drivesβand deliver natural language answers, complete with citations and links to source documents. With tight role-based access controls, a SaaS-based deployment model, and integrations with top cloud AI models from Microsoft, Google, and AWS, Gaia is built for enterprise scale and security.Whether youβre a CIO, CTO, CISO, or data analyst, this demo shows how Cohesity is turning data backup from a static insurance policy into a dynamic knowledge engine. Watch the full episode and see how Gaia can help your team extract real value from your informationβfaster, smarter, and securely.Learn more at cohesity.com/solutions/ai-conversational-search
Register Now
Keith Shaw: Hi, everybody. Welcome to DEMO, the show where companies come in and show us their latest products and platforms. Today, I'm joined by Greg Statton. He is the Vice President of AI Solutions at Cohesity. Welcome to the show, Greg.
Tell me a little bit about Cohesity, and then what you're going to show us today.
Greg Statton: So, Cohesity came into being by revolutionizing how enterprises think about data management, data protection, and data security.
We offer four key applications that run on top of our platform: backupβbeing able to back up all that data from enterprise applications; securing it through anomaly detection, threat hunting, and data classification; and then enabling on-demand access to that data, no matter where it resides globally.
We took a step back and said, "Hey, we want to enable our customers to re-leverage that data." It's often just sitting there as an insurance policy. Our goal is to drive operational efficiency.
So, Iβm excited to talk about our Data Insights pillarβre-leveraging that data through a process called retrieval-augmented generation.
Keith: And again, youβve got a generative AI toolβwhich I think is table stakes for any company that comes on DEMO now. So weβre going to see a little bit of that too, right? Greg: Absolutely. Keith: And who in the company typically uses this? Is it C-level execs?
IT leaders? Data analysts? Can everyone in the enterprise access it? Who's the primary user?
Greg: I love to say everyone on the planet should be able to interact with Cohesityβbut thatβs not quite the case. Primarily, we sell into the technology stackβtypically the CTOβs office. Itβs the core infrastructure operatorsβbackup teams, security teams. So itβs really the CTO and CISO suite.
With these new capabilities, though, we're expanding into the CDO and CIOβs office as well, to help them gain additional value. Keith: And what's the main big problem you're trying to solve? They already have all this dataβyou're just trying to help them extract more value from it, right?
Why should anybody be watching this video?
Greg: Exactlyβweβre trying to help them re-leverage the data they already have, to democratize access to information. The goal is to turn data into information, and then into knowledge.
Keith: Iβm sure youβve got a lot of customersβbut for those who donβt have Cohesity, what are they doing today?
Greg: They're likely using legacy solutions to back up and protect their dataβmethods that aren't efficient or easily accessible.
Keith: So all of that data would just be sitting there and not being used? Greg: Exactly.
It would be backed up using legacy systems, often stored in arcane, proprietary formats that are hard to access or re-leverage.
Keith: Like a big stack of paper somewhere? Greg: Hopefully not! Keith: All right, letβs get into the demo. Show me some cool stuff. Greg: Awesome.
When you access Cohesity through the Cohesity Data Cloud, this is the main splash screen for Heliosβour SaaS control plane that manages everything. Weβll dive into our Data Insights application and show what you can do.
This demo assumes the company is already a Cohesity customer and is backing up data. With our new Cohesity Gaia features, they can now create an on-demand index of data embeddings to drive conversations with their data.
Say an admin identifies SharePoint sites, OneDrives, and NAS shares that contain valuable knowledgeβthey can create a dataset and grant me access. In this case, weβre looking at over 10,000 PDFs historically archived. My boss gave me some very specific tasksβsecurity-related questions for a board meeting.
But first, I want to explore whatβs in this data. As we index the data, we not only create the embeddings indexβwe also do theme and topic analysis. I havenβt read all 10,000 documents; Iβm sure you havenβt either. So this helps me understand the core themes.
I can click on a theme, and start a conversation with the dataβexploratory data analysis on unstructured content.
Now, back to my bossβs requestβheβs heading into a board meeting and wants a summary of unauthorized data breaches from 2018 and 2021. I just joined the company and donβt have the historical context; itβs all locked in peopleβs heads or archives.
So, I ingested the archives Cohesity had already backed up, indexed it, and asked: βCan you summarize the differences between the unauthorized data breaches in 2018 and 2021?β For this demo, Iβm anonymizing company names because it's real data.
The system takes my text question, semantically compares it against 10,000 PDFs, extracts relevant snippets, packages them in a prompt, and sends them to a large language model. The LLM uses that context to generate an answerβwith good detail on both events and general observations.
Because itβs using internal data, I also get resource links and citationsβso I can download the source and add it to the board meeting materials. Keith: Thatβs great for explainabilityβso the AI isnβt just guessing. Greg: Exactly.
You get a much more robust and trustworthy result because you're using your internal data as the source of truth. This isnβt something you'd want to do with a public ChatGPT-type toolβthis is proprietary data. Keith: Right. Greg: What weβre doing here sits between full model fine-tuning and simple querying.
Instead of training a model on all your data, we pull relevant content at the time of inferenceβlike handing an artificial researcher a stack of topic-specific papers and asking a question.
Keith: How do you prevent people from accessing information they shouldn't see? For example, someone in sales doesn't need access to sensitive security files. Is there access control? Greg: Absolutely.
When we create datasets or indexes, admins can include or exclude specific folders or file shares. They can then set role-based access controlsβso only designated individuals or groups can access specific data.
Keith: How long does it take to set up? Is this SaaS-based, and can it be set up in days, hours, weeks? Greg: Great question.
It depends on how much data you have and where it resides. Everything is delivered through our SaaS portal. Right now, it leverages full SaaS infrastructure, but weβre working on bringing it to on-prem customers as well.
Once the feature is enabled, you could be up and asking questions before lunch. The data ingestion and indexing are handled by Cohesityβs infrastructureβyou just bring the questions.
Keith: Is the LLM behind this customized to company-specific data? Or could I ask it random things like directions to a McDonald's?
Greg: You could, but itβll say, βI canβt find that information.β Weβve put specific guardrails in place. We built this modularly to be an enterprise-grade information retrieval platform. Weβve partnered with Microsoft, AWS, and Google to integrate their language models.
Right now, weβre using Cohesityβs Azure OpenAI subscription, but customers can bring their own LLMsβor train smaller internal models if needed.
Keith: Is there anything else you want to show? Greg: I think your viewers get the gist: ask a question, get an answer.
Keith: So how can customers try this out? Do they need to already be Cohesity customers? Greg: The answer is always yesβI used to be in sales! Current customers can reach out to their account reps to request a free trial of Cohesity Gaia.
If you're not a current customer, go to cohesity.com and sign up for free trials of our cloud or on-prem backup offerings. From there, we can easily enable Cohesity Gaia so you can start exploring.
Keith Shaw: Greg Statton from Cohesityβthanks for the demo. Greg: Thank you so much, Keith. Keith: Thatβs all the time we have for todayβs show. Be sure to like the video, subscribe to the channel, and leave your thoughts in the comments.
Join us every week for new episodes of DEMO. Iβm Keith Shaw. Thanks for watching.
Sponsored Links