Features
- Multi-format Input: Audio files, Fathom/Zoom URLs, or plain text transcripts
- Automated Transcription: Speech-to-text conversion using Gemini Files API
- Context Enhancement: Optional document and website content integration
- Sales-focused Analysis: Specialized prompts for objection handling, sentiment, and action items
- Real-time Processing: Streaming analysis results with cost tracking
Architecture Overview
The analyzer supports three distinct processing modes based on input type:Processing Modes
Audio Service
Processes uploaded audio files through transcription and analysis. Supported Formats: MP3, WAV, M4A, FLACFile Size Limit: 25MB
Duration Limit: 2 hours
Phantom Service
Extracts transcripts from meeting platform URLs using browser automation. Supported Platforms: Fathom, Zoom, Google MeetMethod: Playwright headless browser extraction
Authentication: Platform-specific session handling
Transcript Service
Analyzes pre-existing text transcripts directly. Input Format: Plain textSize Limit: 50,000 characters
Encoding: UTF-8
Implementation
Audio Processing Workflow
URL Extraction Workflow
API Reference
Analysis Endpoint
Core Components
| Component | Purpose |
|---|---|
| Gemini Files API | Audio upload and transcription |
| Playwright Engine | Browser automation for URL extraction |
| Web Scraper | Website content extraction |
| Document Parser | PDF, DOCX, TXT content extraction |
| LLM Analysis Chain | Sales-focused conversation analysis |
| Prompt Templates | Structured analysis instructions |
| Cost Tracker | Token usage and pricing calculation |
Configuration
Audio Processing Settings
Analysis Parameters
Usage Examples
Audio File Analysis
Meeting URL Analysis
Python Implementation
Performance & Limits
Processing Metrics
- Audio Transcription: 1-3 minutes per hour of audio
- URL Extraction: 15-45 seconds depending on platform
- Text Analysis: 5-15 seconds for typical call transcripts
- Concurrent Processing: Up to 10 analyses simultaneously
Rate Limits
- API Requests: 100 analyses per hour per API key
- File Uploads: 50MB total per hour
- Token Usage: Tracked and reported per analysis
Platform Limitations
- Fathom: Requires public or shared call links
- Zoom: Cloud recordings only, local recordings not supported
- Google Meet: Requires Google Workspace with recording enabled
Error Handling
Common Error Responses
| Status Code | Error Type | Description | Solution |
|---|---|---|---|
400 | INVALID_AUDIO_FORMAT | Unsupported audio file format | Convert to MP3, WAV, M4A, or FLAC |
413 | FILE_TOO_LARGE | Audio file exceeds size limit | Compress audio or split into segments |
422 | TRANSCRIPTION_FAILED | Unable to process audio content | Check audio quality and format |
404 | URL_NOT_ACCESSIBLE | Meeting URL is private or invalid | Verify URL is public and accessible |
429 | RATE_LIMIT_EXCEEDED | Too many concurrent requests | Implement request throttling |

