YouTube Data
Video transcripts, topics, and metadata for content trend analysis, influencer tracking, and topic extraction. The YouTube data share provides access to video transcripts, AI-extracted topics, and comprehensive metadata. Perfect for analyzing content trends, tracking influencer narratives, and extracting insights from video content at scale.
Update Frequency: On-demand
What's Included
This data is collected by Heisenberg nodes running the data agent and organized into the following structure.
Video Transcripts
Full transcripts with precise timestamps:
| Field Name | Type | Nullable | Description |
|---|---|---|---|
| id | string | No | Unique transcript segment identifier |
| start | float | No | Timestamp when segment begins (seconds) |
| duration | float | No | Length of the segment (seconds) |
| text | text | No | Transcript text for this segment |
| created_date | timestamp | No | When transcript was processed |
| video_id | string | No | YouTube video identifier |
| metadata_id | string | Yes | Link to video metadata |
Video Topics
AI-extracted topics and themes:
| Field Name | Type | Nullable | Description |
|---|---|---|---|
| run_number | integer | No | Analysis batch identifier |
| video_id | string | No | YouTube video identifier |
| channel | string | No | Channel name |
| start | float | No | Topic start timestamp (seconds) |
| duration | float | No | Topic duration (seconds) |
| topic | string | Yes | Identified topic or theme |
| topic_description | text | Yes | Detailed topic explanation |
| insert_at | timestamp | No | Processing timestamp |
Video Metadata
Complete video information:
| Field Name | Type | Nullable | Description |
|---|---|---|---|
| id | string | No | Unique metadata identifier |
| video_id | string | No | YouTube video identifier |
| title | string | No | Video title |
| description | text | Yes | Video description |
| published_at | timestamp | No | Upload date |
| insert_at | timestamp | No | Processing timestamp |
| channel | string | No | Channel name |
| processed | boolean | No | Processing status |
Additional Features
| Feature | Benefit |
|---|---|
| Real-time Updates | Data refreshes multiple times per day, ensuring you always have the latest video content and insights |
| Full Transcripts | Complete video transcripts with precise timestamps for detailed analysis |
| AI Topic Extraction | Automatically identified topics and themes with detailed descriptions |
| Channel Tracking | Channel-level data for tracking creator activity and narratives |
| Temporal Analysis | Timestamp data for understanding topic evolution and content trends |
| Structured Format | Clean, normalized data ready for immediate use in AI applications |
On-Demand Context Generation
Want to create custom context pipelines from this YouTube data? You can generate on-demand context tailored to your specific needs using our COOK platform. COOK allows you to build personalized Data Agents that process, filter, and transform this YouTube data into custom insights perfect for your AI applications. Create context pipelines that combine YouTube data with other sources, apply custom filters, and generate structured outputs that match your exact requirements.
Integration
Access Methods
REST API - Query transcripts, topics, and metadata programmatically
MCP Integration - Plug directly into AI agents and multi-cloud workflows
Direct Database Access - PostgreSQL connection for custom analytics
Example Queries
Filter by:
- Channel - Specific creators
- Time range - Recent uploads or date ranges
- Topics - Specific themes or keywords
- Video metadata - Title, description searches
- Transcript content - Full-text search in transcripts
Analyze by:
- Trending topics - Most discussed themes
- Channel activity - Upload frequency and topics
- Topic evolution - How discussions change over time
- Transcript search - Find specific quotes or concepts
Next Steps
Ready to integrate YouTube data into your application?