Skip to main content

YouTube Data

Video transcripts, topics, and metadata for content trend analysis, influencer tracking, and topic extraction. The YouTube data share provides access to video transcripts, AI-extracted topics, and comprehensive metadata. Perfect for analyzing content trends, tracking influencer narratives, and extracting insights from video content at scale.

Update Frequency: On-demand

What's Included

This data is collected by Heisenberg nodes running the data agent and organized into the following structure.

Video Transcripts

Full transcripts with precise timestamps:

Field NameTypeNullableDescription
idstringNoUnique transcript segment identifier
startfloatNoTimestamp when segment begins (seconds)
durationfloatNoLength of the segment (seconds)
texttextNoTranscript text for this segment
created_datetimestampNoWhen transcript was processed
video_idstringNoYouTube video identifier
metadata_idstringYesLink to video metadata

Video Topics

AI-extracted topics and themes:

Field NameTypeNullableDescription
run_numberintegerNoAnalysis batch identifier
video_idstringNoYouTube video identifier
channelstringNoChannel name
startfloatNoTopic start timestamp (seconds)
durationfloatNoTopic duration (seconds)
topicstringYesIdentified topic or theme
topic_descriptiontextYesDetailed topic explanation
insert_attimestampNoProcessing timestamp

Video Metadata

Complete video information:

Field NameTypeNullableDescription
idstringNoUnique metadata identifier
video_idstringNoYouTube video identifier
titlestringNoVideo title
descriptiontextYesVideo description
published_attimestampNoUpload date
insert_attimestampNoProcessing timestamp
channelstringNoChannel name
processedbooleanNoProcessing status

Additional Features

FeatureBenefit
Real-time UpdatesData refreshes multiple times per day, ensuring you always have the latest video content and insights
Full TranscriptsComplete video transcripts with precise timestamps for detailed analysis
AI Topic ExtractionAutomatically identified topics and themes with detailed descriptions
Channel TrackingChannel-level data for tracking creator activity and narratives
Temporal AnalysisTimestamp data for understanding topic evolution and content trends
Structured FormatClean, normalized data ready for immediate use in AI applications

On-Demand Context Generation

Want to create custom context pipelines from this YouTube data? You can generate on-demand context tailored to your specific needs using our COOK platform. COOK allows you to build personalized Data Agents that process, filter, and transform this YouTube data into custom insights perfect for your AI applications. Create context pipelines that combine YouTube data with other sources, apply custom filters, and generate structured outputs that match your exact requirements.

📖 Learn more about COOK →

Integration

Access Methods

REST API - Query transcripts, topics, and metadata programmatically

MCP Integration - Plug directly into AI agents and multi-cloud workflows

Direct Database Access - PostgreSQL connection for custom analytics

Example Queries

Filter by:

  • Channel - Specific creators
  • Time range - Recent uploads or date ranges
  • Topics - Specific themes or keywords
  • Video metadata - Title, description searches
  • Transcript content - Full-text search in transcripts

Analyze by:

  • Trending topics - Most discussed themes
  • Channel activity - Upload frequency and topics
  • Topic evolution - How discussions change over time
  • Transcript search - Find specific quotes or concepts

Next Steps

Ready to integrate YouTube data into your application?