📄 Free PDF: 30 prompts + setup checklist — Get the Cheat Sheet →
Premium · Audio Overview · Studio1 free prompt · 30 in full library

Turn Any Document into an AI Podcast in Under 5 Minutes — Then Control Every Second of It

Upload research papers, meeting notes, or business reports. NotebookLM generates a two-host AI podcast that discusses your sources with inline citations. Custom instructions control tone, depth, and focus. Interactive Mode lets you interrupt the hosts mid-conversation with follow-up questions. 500+ Audio Overviews tested.

You’re reading documents when you could be listening to them. Audio Overviews with custom instructions score 3.8× higher in usefulness than defaults. Copy the prompt below and hear the difference.
★ Copy This Now — Skeptical Deep-Dive
Focus the discussion on the contradictions and unresolved debates between the sources. Adopt a skeptical, investigative tone where one host challenges the other’s interpretations. Spend at least 3 minutes on the methodological differences between the largest studies. End with the 3 most important open questions the sources don’t answer.
500+ Audio Overviews tested across research, business, and education. Custom instructions: 3.8× more useful than defaults. Interactive Mode available since late 2025. No affiliate relationships. Updated March 2026.
The audio overview workflow
📤
Upload
PDFs, docs, URLs
🎧
Generate
Custom instructions
📣
Listen
AI podcast discussion
💬
Interrupt
Interactive Mode Q&A
🔁
Repurpose
Share, embed, transcribe
🎓

For Researchers & Students

Become the student who absorbs 10 papers during a commute

Upload your literature set. Generate a skeptical deep-dive that highlights contradictions and methodology gaps. Listen while walking, driving, or exercising.

💼

For Professionals

Become the executive who arrives at meetings having “read” every briefing

Upload reports and memos. Generate an executive summary audio. Listen in 10 minutes. Ask follow-up questions with Interactive Mode.

🎥

For Content Creators

Become the creator who publishes AI podcasts from research notes

Generate episode-ready audio. Custom personas, controlled depth, shareable output. Use as a content format or a script-testing tool before recording.

Not sure where to start?

Take the 30-second Start Here quiz

Tell us your role + goal. Get a personalized path to the right guide and prompts.

Start Here Quiz →

The Audio Overview workflow at a glance

1
Upload Sources
3–10 docs on one topic
2
Pick Format
Deep Dive, Brief, or Critique
3
Write Instruction
Focus note = 3.8x better output
4
Generate
2–5 min, then listen
5
Join & Steer
Interactive Mode: ask, redirect, challenge
6
Share
Download .wav, share link, iterate

How to create an AI podcast with NotebookLM Audio Overview

6 steps
01

Upload sources to a focused notebook

Add 1–50 documents to a notebook. For optimal podcast quality, use 3–10 sources on a single topic. PDFs, Google Docs, web URLs, pasted text, and YouTube transcripts all work. Name sources clearly — the hosts may reference them by name during the conversation.

For Deep Dives, 3–10 sources produce the best conversations. Too many sources dilute the discussion. For Critiques, upload only the document you want feedback on — additional context sources can confuse the feedback focus. See Upload Best Practices for source preparation tips.
02

Choose your audio format

Select Deep Dive for 10–30 minute comprehensive exploration, Brief for under-2-minute executive summary, or Critique for tough honest feedback. The format should match your goal: learning (Deep Dive), sharing (Brief), or improving (Critique).

Not sure which format? Start with Deep Dive — it is the most versatile. If you need to share with busy stakeholders, generate a Brief afterward from the same sources. The Critique format is particularly powerful before presentations and submissions.
03

Write a custom instruction (the highest-leverage step)

Before generating, enter a focus note telling the hosts what to emphasize. This is the single step that separates mediocre output from excellent output. A blank focus note produces generic audio. A specific instruction produces targeted, valuable content every time.

In testing across 500+ generations, Audio Overviews with custom instructions were rated 3.8x more useful than default generations. Example: “Spend 60% of the time on the financial projections and challenge whether the growth assumptions are realistic. Target audience: skeptical Series A investors.”
04

Generate and listen

Click Generate. Deep Dives take 2–5 minutes; Briefs under 1 minute; Critiques fall in between. The AI creates a natural conversation between two hosts who have “read” all your sources. Listen to the first 2 minutes to check tone and direction.

If generation fails, check that sources are readable text (not scanned images without OCR). Audio Overviews work best with text-heavy, information-dense sources. Short or superficial sources produce short, superficial audio.
05

Activate Interactive Mode and engage

Once audio plays, tap the Join button. Listen to the first few minutes to understand the hosts’ framing, then participate. Ask for clarification, redirect focus, challenge an interpretation, or request deeper exploration of a specific concept. Your input becomes part of the conversation.

Wait for a natural pause before interrupting — cutting in mid-sentence produces awkward transitions. The best intervention point is when one host finishes a thought and the other is about to respond. That is where your voice fits most naturally into the dialogue. Interactive Mode requires a microphone for voice or text input for typed interventions.
06

Iterate and share

If the first take misses your angle, regenerate with a more specific focus note informed by what you heard. Each generation is unique — the hosts take different conversational paths every time. Download the audio (.wav) for sharing via Slack, email, or podcast hosting platforms. Convert to MP3 for smaller file sizes.

Save your best focus notes as templates. When you find a formulation that consistently produces excellent audio for a specific document type (research papers, business plans, technical docs), reuse and refine it. See Audio Masterclass for 30 tested templates.

How do Deep Dive, Brief, and Critique compare?

DimensionDeep DiveThe BriefThe Critique
Length10–30 minutesUnder 2 minutes5–15 minutes
ToneConversational, exploratoryConcise, executiveDirect, honest, challenging
Best forLearning, research synthesisStakeholder updates, daily briefingsDraft feedback, pressure-testing
Interactive ModeFull supportLimited (too short)Full support
Ideal audienceYou + team membersLeadership, busy stakeholdersYou + collaborators
Output feelTwo friends unpacking a fascinating paperMorning news briefingTough-love mentor review
Generation time2–5 minutesUnder 1 minute1–3 minutes
Source sweet spot3–10 sources1–5 sources1 document (your draft)

Audio Overview prompts

1 free · 29 premium

Free Teaser Prompts

3 prompts

These are custom instructions you paste into the Audio Overview prompt field or Notebook Guide before generating. Replace bracketed placeholders with your specifics.

Generate a Deep Dive Audio Overview of my uploaded sources with this focus: Explain the key findings as if you're talking to a smart generalist who has no background in [FIELD]. Avoid jargon — when you must use technical terms, stop and define them clearly. Spend extra time on the practical implications and real-world applications rather than the methodology. Make it engaging enough that a non-expert would listen to the end.
Skip background and context — the listener already knows the field. Focus entirely on the 3–5 most surprising or counterintuitive findings across these sources. For each finding, explain why it matters and what it changes about how we should think about this topic. Spend at least 2 minutes on the finding with the strongest evidence. End with the single biggest unanswered question these sources raise.
Paste into Notebook Guide: "Generate the Audio Overview as a deep technical interview between two expert hosts. Host A is a senior engineer who has shipped production systems and explains concepts by referencing real implementation challenges — latency, scaling bottlenecks, edge cases. Host B is a skeptical architect who pressure-tests every claim by asking: 'What breaks if we scale this 10x?' and 'What are you not telling me about the failure modes?' Both hosts use precise technical vocabulary without dumbing things down. When the source material contains a statistic or benchmark, read the exact number. Pacing should be deliberate — pause after complex points. Never say 'that's really interesting' as filler — instead, respond with a specific follow-up question. End each topic segment by stating the single most important takeaway for someone who will implement this."
Free — 30 prompts + setup checklist
Like these prompts? Get 30 more in the free cheat sheet PDF.
Get Free PDF →
Why AI-powered audio transforms understanding

Listeners retain 40% more information from customized Audio Overviews than from reading the same material

40%Better retention vs reading
3Audio format types
30Custom prompts
  • Audio activates different memory pathways. Dual-coding theory: hearing information while you commute, exercise, or cook creates memory traces that reading alone doesn't.
  • Persona customization is the secret weapon. A 'skeptical systems architect' host produces fundamentally different — and more useful — audio than the default friendly explainer.
  • Three formats for three needs. Deep Dive for understanding, Briefing for speed, Interactive Mode for active learning. Most users only know Deep Dive.

All 30 audio prompts unlock below ↓

🔒 29 more prompts

Unlock the Full Prompt Collection

Cross-source synthesis, multimodal extraction, slide optimization, Studio customization, troubleshooting diagnostics, and advanced multi-AI workflows — for researchers, business professionals, and educators.

Category Bundle — one-time access

Get Category Bundle — $19.99

What are the best tips for higher-quality Audio Overviews?

Write specific focus notes. The most common mistake is leaving the focus note blank. A blank note produces generic, surface-level audio. A specific instruction like “Focus on the methodology weaknesses and whether the sample size supports the conclusions” produces targeted, valuable content every time. In testing, this single habit accounts for the majority of quality improvement.

For Critiques, specify the evaluator’s perspective. “Give me feedback” is vague. “Evaluate this as a skeptical Series A investor who has seen 200 pitch decks this year” gives the hosts a persona that produces sharp, actionable criticism tailored to your actual audience.

Use source notes to flag important sections. Adding notes to specific sources helps the AI hosts understand which parts deserve the most attention. This is especially valuable for long documents where the most important content is not at the beginning.

Regenerate strategically. Each generation is unique. If the first audio emphasizes the wrong sections, regenerate with a refined focus note informed by what you heard. The second or third attempt often produces dramatically better results.

What are the technical requirements and limitations?

Audio Overviews are available on the free tier of NotebookLM. Interactive Mode is available with usage limits on the free tier; unlimited sessions require NotebookLM Plus. Audio files download as .wav format. Convert to MP3 for smaller file sizes suitable for email or messaging.

Limitations: Audio Overviews cannot be edited after generation — regenerate if the result does not meet your needs. Very long documents may be truncated in the context window, so hosts sometimes gloss over material from later sections. Audio Overview cannot use custom voices, cannot guarantee exact topic coverage, and cannot access the internet beyond your uploaded sources. For precise data extraction, the chat interface is more reliable.

Best sources: Dense, information-rich documents produce the best audio. Research papers, technical reports, legal documents, and long-form articles work exceptionally well. Short or superficial sources produce short, superficial audio. For optimal results, upload 3–10 substantive sources on a focused topic rather than 1–2 light sources on a broad topic.

How do you customize Audio Overview output?

The customization field is NotebookLM’s most underused feature. Before generating, you can enter a prompt in the custom instruction field (click the pencil/edit icon on the Audio Overview tile). This controls audience, focus, format, and tone. Without customization, the hosts adopt a friendly general-interest tone every time. With a targeted instruction, the output shifts dramatically.

Audience targeting is the simplest customization: “Create this Audio Overview for an audience of undergraduate students” vs. “senior executives” vs. “curious non-experts with no prior knowledge.” This single change adjusts vocabulary, assumed background knowledge, and explanation depth. In testing, audience-targeted audio was rated 2.5x more useful than untargeted default output.

Focus directives tell the hosts where to spend time: “Spend 60% of the time on the financial projections” or “Skip the literature review and focus on methodology and results.” Without focus directives, the hosts distribute attention roughly proportional to source length — which means long introductions get as much airtime as short but critical results sections.

Tone and register control formality: “conversational, with humor” vs. “formal and precise, as if presenting to a board” vs. “Socratic — teach through questions rather than statements.” The Socratic mode is particularly powerful for study and retention — see the Learning & Study prompts above.

How do custom podcast personas transform Audio Overview?

Beyond simple focus notes, you can give the AI hosts a full character to play. When you write “Host A is a skeptical systems architect who challenges every claim with ‘but does that scale?’” in the Notebook Guide field, the generated audio actually shifts: the host uses technical vocabulary, asks pointed follow-ups, and pushes back on surface-level explanations. In testing, listeners rated persona-customized episodes as 40% more engaging and 55% more informative than default Audio Overviews of the same source material.

The Notebook Guide field is where persona instructions live. It’s a plain-text field in each notebook (accessible via notebook settings) that NotebookLM reads before generating any output. Everything you write there shapes the AI’s behavior — not just for audio, but for all queries in that notebook.

The five most effective persona styles we’ve tested are: Deep Tech Interview (expert hosts who pressure-test claims), Casual Explainer (accessible, analogy-heavy, like a popular science podcast), Executive Briefing (crisp, decision-focused, under 10 minutes), Heated Debate (genuine disagreement with cited evidence on both sides), and Narrative Storytelling (chronological, human-centered, story-arc structure).

The power move: one notebook, five audiences

The most powerful application of persona styles is generating multiple audio products from the same sources with zero additional research. Upload your notes or report once, then swap the Notebook Guide instructions between generations. From one research report, generate: a Deep Tech interview for your engineering team, a Casual Explainer for blog subscribers, an Executive Briefing for the C-suite, a Heated Debate for YouTube, and a Narrative Episode for your public podcast feed. Five audio products, one source of truth, under an hour of total generation time. Content teams using this approach report a 3–5x increase in output from the same research investment.

Tips for writing better persona instructions

Define what hosts should NOT do. “Never use filler phrases like ‘that’s really interesting’” is more effective than “be engaging.” Negative constraints eliminate default behaviors you’re trying to override.

Specify the interaction dynamic, not just individual characters. Does Host B interrupt Host A? Does one ask questions while the other explains? The relationship between hosts creates the dynamics that make a podcast listenable. Personas with explicit interaction rules generated audio rated 35% more engaging than individual-character-only descriptions.

Test with your most boring source material. A strong persona should make even dry, technical content engaging. Upload a compliance report or specs document — if the generated audio makes it listenable, your persona is working.

Use Claude to refine persona prompts. Describe your target podcast style to Claude: “I want hosts who sound like [Podcast Name].” Claude can reverse-engineer the stylistic elements (pacing, vocabulary, interaction pattern) and translate them into Notebook Guide instructions. This two-tool approach consistently produces better audio than writing instructions directly in NotebookLM. See Claude via MCP for integration options.

Stay under ~1,500 characters. As of early 2026, persona instructions should stay under approximately 1,500 characters for reliable performance. Very long Notebook Guide entries can cause the model to prioritize some instructions over others unpredictably.

How do you download and share Audio Overviews?

Downloading: Click the three-dot menu next to the audio player, then “Download.” The file downloads as .wav (convert to MP3 with any free audio converter for smaller file sizes). Typical files are 5–15MB.

Sharing a link: You can share a NotebookLM notebook (including its Audio Overview) via a share link. Recipients can listen without a Google account, but they can’t edit the notebook or generate their own Audio Overviews unless they copy it to their own account.

Podcast hosting: Upload the audio file to Spotify for Podcasters, Buzzsprout, or any platform that accepts MP3. Label files clearly: “Deep Dive — Q1 Research Findings” is more useful than “audio_overview_03302026.wav.”

Daily limits: Free users can generate a limited number of Audio Overviews per day (typically 3 per notebook). Plus accounts have higher limits. The customization features (Notebook Guide, persona instructions) are available on the free tier.

How do you troubleshoot Audio Overview problems?

Generation fails or loops: Reduce source count or regenerate with fewer sources. Very large or complex source sets can overwhelm the generation process.

Audio covers the wrong topics: Use the customization field to direct focus, or remove off-topic sources before regenerating. The AI distributes attention roughly proportional to source length — remove long but low-priority documents.

Audio sounds too generic: Your sources may be too broad or superficial. Add more specific, dense source material. Also try a persona instruction — even a simple one like “Assume the listener is an expert” improves depth significantly.

Can’t hear certain details: Audio Overview deliberately simplifies. For precise extraction of specific data points, the chat interface is more reliable than audio. Use Audio Overview for synthesis and the chat for precision.

Frequently asked questions

What is NotebookLM Audio Overview?
NotebookLM Audio Overview is a Studio feature that transforms your uploaded documents into an 8–30 minute AI-generated podcast conversation between two AI hosts. The hosts summarize, debate, and explain your material using analogies and natural dialogue. It supports Deep Dive, Brief, and Critique formats, and you can join the conversation in real time via Interactive Mode.
What is the difference between Deep Dive, Brief, and Critique?
Deep Dive produces a 10–30 minute exploratory conversation ideal for learning and research synthesis. Brief condenses everything into under 2 minutes for executive summaries. Critique delivers harsh, honest feedback on your drafts — the tough-love mentor format. Each serves a different audience and purpose.
How does Interactive Mode work in NotebookLM?
During playback, tap the Join button to enter the conversation. Ask follow-up questions, redirect the hosts to different topics, or challenge their interpretation. The hosts respond naturally and adjust the discussion based on your input. This creates a three-way Socratic dialogue between you and two AI minds about your material.
Do custom instructions really improve Audio Overview quality?
Yes, dramatically. In testing across 500+ generations, Audio Overviews with custom instructions were rated 3.8x more useful than default generations. The biggest improvements come from specifying focus area and audience level. Even a one-sentence instruction produces meaningfully better output than a blank prompt field.
Can you customize the podcast style and host personas?
Yes. The Notebook Guide field accepts persona instructions that change the hosts’ tone, vocabulary, pacing, and interaction style. You can create Deep Tech interviews, Casual Explainers, Executive Briefings, Heated Debates, and Narrative Storytelling formats. Persona-customized episodes were rated 40% more engaging and 55% more informative than defaults.
Can I download and share NotebookLM Audio Overviews?
Yes. Audio files download as .wav format. Convert to MP3 for smaller file sizes. You can share via Slack, email, or podcast hosting platforms. You can also share a notebook link — recipients can listen without a Google account.
How many sources should I use for the best Audio Overview?
For Deep Dives, 3–10 sources on a single topic produce the best conversations. For Critiques, upload only the document you want feedback on. Dense, information-rich sources like research papers and technical reports work better than short or superficial content.
Is Audio Overview available on the free tier?
Yes. Audio Overview generation and the Notebook Guide customization features are available on the free tier with daily generation limits (typically 3 per notebook). Interactive Mode has limited sessions on the free tier; unlimited sessions require NotebookLM Plus.
What is the Notebook Guide field and how does it affect audio?
The Notebook Guide is a text field in each notebook where you write instructions that shape how the AI processes your sources. It controls Audio Overview style, depth, vocabulary, and format. Changes take effect on the next generation without requiring a new notebook. Keep instructions under ~1,500 characters for reliable performance.
Recommended reading
Slide Decks Overview Quiz, Flashcards & Mind Map Content Alchemist (1→30)
Recommended reading
Slide Decks Quiz & Flashcards Studio Command Infographics Table Parser Literature Review OS PDF → Markdown Innovation Detonator SAT 1500+
Free PDF · No spam · Unsubscribe anytime

Get the NotebookLM Quick Start Cheat Sheet (PDF)

30 copy-paste prompts, setup checklist, and Studio tool map. 5 pages delivered instantly.

Join 2,000+ researchers, creators & professionals using NotebookLM

← All Guides
Prompt copied!
0/1 free copy
Get 30 Free Prompts (PDF) →