Whissle

Visit Tool

Whissle is an AI Agents & Automation tool that provides real-time multi-modal AI intelligence from audio, text, and video. It offers a personal AI assistant, live call coaching, deep research, and smart notes, with self-hostable and privacy-first options.

Claim this tool

1View

At a glance

Pricing

Freemium · Usage-based · Open Source

Free tier

Yes

API

Yes

Skill level

Technical

About

What is Whissle?

Whissle is a personal AI assistant and intelligence platform that processes audio, text, and video streams in real-time to extract transcripts, emotion, intent, and actionable insights. Its META-1 model performs transcription and metadata extraction simultaneously, offering lower latency and richer output compared to traditional pipelines. Whissle provides features like live call coaching, deep research capabilities, smart notes, and daily briefings. It is available as a web application, a macOS desktop app with offline support, and a developer-friendly API for streaming speech-to-text and voice intelligence. The platform is open-source, self-hostable via Docker, and emphasizes privacy-first design, allowing users to run the full stack locally.

Best used for

Ideal for developers and researchers who need to integrate real-time voice intelligence into their applications, conduct deep research, and automate note-taking. Especially valuable for those requiring low-latency, privacy-first solutions with advanced metadata extraction capabilities like emotion and intent detection.

Common actions

transcribe audio

extract metadata

analyze voice

coach calls

automate notes

build AI applications

Capabilities

Key features

Real-time speech-to-text
Multi-modal intelligence
Emotion detection
Intent detection
Named entity recognition
Live call coaching
Self-hostable

Target Audience

developerresearcherstartup founderproduct manager

Integrations

Not yet documented

Pricing & Plans

Freemium · Usage-based · Open Source

Not publicly disclosed. Check whissle.ai for current pricing.

FAQs

What metadata does the Whissle Speech-to-Text API extract?

Beyond transcription, Whissle's Speech-to-Text API extracts rich metadata in a single pass, including intent detection, emotion recognition, named entity recognition (NER), speaker diarization, age and gender estimation, and punctuation. The same metadata is also available for text input via their Text Intelligence API.

Is Whissle free to use, and what are the pricing options?

Yes, the personal AI assistant at lulu.whissle.ai is free to use. The Speech-to-Text and Intelligence APIs have usage-based pricing starting at $0.003 per minute. Additionally, Whissle can be self-hosted via Docker at no cost, providing a full local setup.

Can I self-host Whissle, and what are the requirements?

Absolutely. Whissle provides a full Docker Compose setup that allows self-hosting the frontend, gateway (ASR + agent + proxy), and backend locally. It replaces cloud dependencies with SQLite and local storage, requiring only 16 GB RAM and a Gemini API key.

What is Live Assist / call coaching in Whissle?

Live Assist provides real-time AI coaching during phone calls or meetings. It listens to the conversation, detects intent and emotion, and surfaces contextual suggestions, key points, and action items. This all happens in real-time with low latency to support dynamic interactions.

How does Whissle compare to other speech-to-text APIs?

Whissle's META-1 model performs transcription and metadata extraction simultaneously in a single pass. This differs from traditional pipelines that require separate models for each task, resulting in lower latency, lower cost, and richer output from a single API call.

Trending

Subcategories trending in AI Agents & Automation

AI Frameworks & Infra Chatbots & Conversational AI General-Purpose Agents Workflow Agents Personal Assistants RAG & Document AI

Trending

Explore

Browse AI tools by category

Content & Design Productivity & Business Coding & Development AI Agents & Automation Research & Education Wellness & Lifestyle Career Development Marketing & Growth Data & Analytics Customer Support & CX Finance E-commerce