Documentation

Help & Documentation

Everything you need to know about using BleepKit to create broadcast-ready clean edits.

New here? Start with this:

1Upload
2AI processes it
3You review every word
4Export clean version
Takes ~2-3 minutesΒ·First song free, no credit cardΒ·AI + your review = zero-risk clean edits

Getting Started

BleepKit combines AI detection with human review to produce clean edits you can trust. Upload your song, let the AI process it, review every flagged word yourself, and export your clean version.

1

Upload

Upload any audio file β€” WAV, MP3, FLAC, or AIFF. Choose your censor mode and output format.

2

AI processes it

The AI separates vocals, identifies the song, transcribes lyrics, and flags profanity automatically.

3

You review every word

Listen to each flagged word in context. Approve words to censor or reject them to keep. You have full control.

4

Export clean version

Download your clean version, stems, and compliance reports β€” proof for labels, distributors, and legal.

AI + human review = zero-risk clean edits

Unlike fully automated tools, BleepKit never ships an edit you haven't reviewed. The AI does the heavy lifting, but you make every final decision. Nothing gets censored that shouldn't be, and nothing slips through.

Uploading Songs

BleepKit upload page

Supported formats

WAV, MP3, FLAC, and AIFF files up to 200 MB. Maximum duration is 30 minutes for songs, 3 hours for podcasts.

Credit usage by duration

DurationCredits
Up to 10 minutes1 credit
10 – 20 minutes2 credits
20 – 30 minutes3 credits

Censor modes

  • Silence β€” Replaces flagged words with silence. The cleanest option for most use cases.
  • Bleep β€” Replaces flagged words with a standard bleep tone. Common for broadcast radio.
  • Reverse β€” Reverses the audio of flagged words. A creative option that keeps the vocal energy.

Output formats

  • WAV β€” Uncompressed, highest quality. Best for mastering and professional workflows.
  • FLAC β€” Lossless compression. Same quality as WAV but smaller file size.
  • MP3 β€” Compressed. Good for distribution and streaming platforms.

Advanced options

  • Loudness normalization β€” Normalize output to a target LUFS value (default: -14 LUFS).
  • Padding β€” Add extra silence around each censored word (0–200 ms). Helps smooth transitions.

Processing Pipeline

When you upload a song, BleepKit runs through several automated steps. Here is what happens behind the scenes:

BleepKit processing pipeline
1

Vocal Separation

AI-powered source separation splits the audio into vocals and instrumental stems with studio-quality precision.

2

Song Identification

The system attempts to identify the song using audio fingerprinting and metadata analysis. This helps find accurate lyrics.

3

Lyrics Retrieval

If the song is identified, lyrics are fetched from online databases. Multiple sources are checked for the best match.

4

Speech-to-Text

The vocal track is transcribed using AI speech recognition, providing word-level timestamps for precise censoring.

5

Profanity Detection

Transcribed words and retrieved lyrics are cross-referenced against profanity databases to flag explicit content.

6

Review Preparation

Flagged words are compiled with timestamps and source information, ready for your review.

Reviewing Flagged Words

The review page gives you full control over which words are censored. Every word flagged by the AI can be approved or rejected before export. This is what makes BleepKit zero-risk β€” you make the final call on every word.

BleepKit review page

Example: flagged words table

#TimestampWordSourceAction
10:42 – 0:43****bothApproved
21:15 – 1:16****sttApproved
32:08 – 2:09hecksttRejected

Each row has a play button to hear the word in context. Approve words to censor them, reject to keep them in the audio.

Bulk actions

Use "Approve All" to censor every flagged word, or "Reject All" to keep them all. Then fine-tune individual words as needed.

Audio playback

Click the play button next to any word to hear a short audio snippet centered on that word. This helps you decide whether to censor it in context.

Source badges

Each flagged word shows where it was detected:

  • stt β€” Detected from speech-to-text transcription
  • lyrics β€” Found in retrieved lyrics
  • both β€” Detected by both transcription and lyrics
  • manual β€” Manually added by you
  • custom_list β€” Matched your custom word list

Censor mode switcher

You can change the censor mode (silence, bleep, or reverse) directly on the review page before exporting. No need to re-upload.

Full transcript

Below the word table, the full transcript shows every word in the song. Already-flagged words are highlighted. Click any unflagged word to add it to the censor list β€” useful for catching words the AI missed.

Downloads & Exports

After approving your word selections, BleepKit exports your files. Five downloads are available:

Clean Version

The full song with all approved words censored using your chosen censor mode.

Instrumental

The instrumental track with vocals completely removed.

Acapella

The clean vocal track only, with approved words censored.

Compliance Report (TXT)

A plain text report listing every flagged word, timestamps, and your decisions.

Compliance Report (PDF)

A professionally formatted PDF with tables, song metadata, and your complete review decisions.

Re-review & re-export

Changed your mind? Click "Re-Review Words" on the download page to go back and adjust your selections. Re-reviews are free and unlimited β€” you only pay once per upload.

Compliance Reports

Proof for labels, distributors, and legal

Every processed song generates a compliance report β€” a detailed record of exactly what was censored, where, and by whom. This is documentation you can hand to a label, distributor, or legal team to prove the edit meets broadcast standards.

TXT Report

Machine-readable plain text with every flagged word, timestamp, detection source, and approval status. Ideal for archiving and automated workflows.

PDF Report

Professionally formatted with BleepKit branding, song metadata, censored word tables, and processing details. Ready to attach to submissions or hand to stakeholders.

Reports include: song/podcast metadata, processing date, censor mode used, total words flagged, individual word timestamps and sources, approval decisions, and output file details.

Settings

Customize BleepKit to match your workflow from the Settings page.

Default preferences

Set your preferred censor mode, output format, content type, and normalization settings. These defaults auto-populate the upload form so you don't have to set them every time.

Custom extra words

Add words that should always be flagged for censoring, even if the AI does not flag them. One word per line. These words will be checked against every song you process.

Custom ignore words

Add words that should never be flagged, even if the AI detects them. Useful for brand names, slang, or context-specific terms that are not actually profanity.

Supported Languages

BleepKit automatically detects the language of your audio and applies language-specific profanity detection. The following languages are currently supported:

Arabic
Bengali
Bulgarian
Catalan
Chinese
Croatian
Czech
Danish
Dutch
English
Finnish
French
German
Greek
Hebrew
Hindi
Hungarian
Indonesian
Italian
Japanese
Korean
Malay
Norwegian
Persian
Polish
Portuguese
Romanian
Russian
Serbian
Slovak
Slovenian
Spanish
Swedish
Tagalog
Tamil
Telugu
Thai
Turkish
Ukrainian
Urdu
Vietnamese

Accuracy is highest for major world languages. Less common languages are supported but results may vary. Language is auto-detected during processing.

Credits & Plans

How credits work

Each song you upload costs credits based on its duration. Credits are deducted when processing begins. Re-reviewing and re-exporting a song does not cost additional credits.

Free trial

Your first song is completely free β€” no credit card required. Podcast users get 15 free minutes.

Podcasts & Video

BleepKit podcast processing

How media cleaning works

Podcast and Video modes use a simplified pipeline optimized for spoken word content. Select β€œPodcast” or β€œVideo” when uploading to activate this mode. Same AI + human review approach β€” you approve every word before export.

What's different from songs?

  • β€’No vocal separation β€” podcasts and videos don't have background instrumentals to remove, so we skip this step entirely.
  • β€’No song identification β€” no fingerprinting or lyrics lookup. Just straight to transcription.
  • β€’Longer duration β€” podcasts and videos can be up to 3 hours (vs 30 minutes for songs).
  • β€’Minute-based pricing β€” uses your media minutes balance instead of song credits. View minute packs.

Video support

Upload MP4, MOV, MKV, or WebM files up to 2 GB. BleepKit extracts the audio, cleans it, and muxes it back into the original video container β€” your video quality is untouched. Perfect for YouTube creators who need to keep content monetization-safe without re-editing.

Downloads

Podcast exports include the clean audio file and compliance reports (TXT + PDF). Video exports include the clean video file with the censored audio track. Instrumental and acapella stems are not available for media content since there is no vocal separation.

Known Limitations

BleepKit is highly accurate, but no AI system is perfect. Here are scenarios where results may vary:

Heavy background noise

Extremely noisy recordings (low-quality live recordings, heavy distortion) may reduce transcription accuracy. Studio-quality audio produces the best results.

Overlapping vocals

When multiple speakers or singers overlap simultaneously, the AI may miss words or flag incorrect timestamps. This is most common in group choruses or crosstalk in podcasts.

Slang and context ambiguity

Some words have both clean and explicit meanings depending on context. The AI flags conservatively β€” you can always reject false positives on the review page.

Rare languages

While 30+ languages are supported, accuracy is highest for major world languages. Less common languages or heavy regional accents may produce lower detection rates.

Mumbled or whispered vocals

Very quiet, mumbled, or whispered vocals may not be accurately transcribed. The full transcript on the review page lets you catch anything the AI missed.

Your safety net: Even when the AI isn't perfect, the review step catches everything. You see every flagged word, hear it in context, and make the final decision. That's why we say AI + human review = zero-risk.

Frequently Asked Questions

General

What is BleepKit?
BleepKit is an AI-powered tool that creates broadcast-ready clean versions of songs, podcasts, and videos. It combines AI detection with human review β€” you approve every censored word before export, so nothing slips through and nothing gets censored that shouldn't be.
Who is BleepKit for?
BleepKit is built for record labels, distributors, A&R teams, video creators, YouTubers, and anyone who needs professional clean edits. Independent artists and studios can also use it.
How long does processing take?
Most songs are processed in 2–3 minutes. Podcasts and videos take longer depending on length. You'll see real-time progress on the processing page.

Quality & Standards

Will this pass Spotify / Apple Music clean edit standards?
Yes. BleepKit produces broadcast-ready clean edits that meet the standards for Spotify, Apple Music, Amazon Music, and all major streaming platforms. The compliance report provides documentation that the edit has been reviewed and approved.
Does this sound better than manual edits?
BleepKit uses AI-powered vocal separation to censor only the vocal track, leaving the instrumental completely untouched. This produces cleaner results than manually editing a mixed-down file, where cuts often create audible artifacts in the music.
Can I use clean edits for YouTube monetization?
Yes. Clean versions created with BleepKit are suitable for YouTube, TikTok, and other platforms that require or prefer clean content. The compliance report can serve as documentation if your content is flagged.
Can I batch process songs?
Currently, songs are processed one at a time. You can upload and queue multiple songs β€” each one enters the pipeline independently. Batch processing is on our roadmap.

Accuracy

What if the AI misses a word?
You can manually add any missed word on the review page by clicking it in the full transcript. The AI catches the vast majority of profanity, but your manual review ensures nothing slips through. This is why BleepKit is zero-risk β€” AI + human review.
What if the AI flags a word that isn't profanity?
Click the checkmark to reject it β€” it will be kept in the final audio. You can also add words to your "ignore list" in Settings to prevent them from being flagged in the future.
Does it support languages other than English?
Yes. BleepKit supports 30+ languages for speech-to-text transcription and profanity detection. The language is auto-detected during processing.

Billing

Is the first song really free?
Yes. Your first song is on us β€” no credit card required. Just sign up and upload.
Do re-reviews cost credits?
No. Re-reviewing and re-exporting a song is free and unlimited. You only pay once per upload.
What happens if I run out of credits?
You can upgrade your plan or purchase additional credits at any time from the Plan page.

Video

What video formats are supported?
BleepKit accepts MP4, MOV, MKV, and WebM files up to 2 GB. The output video is returned in the same format as the original.
Does BleepKit change my video quality?
No. BleepKit only processes the audio track. The video stream is left completely untouched β€” same resolution, same codec, same quality.
What if my video has background music?
BleepKit censors profanity by muting or bleeping the audio at the detected timestamps. If there is background music playing during the censored word, there may be a brief audio dip. For most content this is barely noticeable.
Can I get subtitles from my video?
Yes. After processing, you can download an SRT subtitle file generated from the transcript. This works great for adding captions in video editors like Premiere Pro or DaVinci Resolve.
Do videos use song credits?
No. Videos use your Media Minutes balance, the same as podcasts. You can purchase Media Minute Packs from the Plan page.

Files & Privacy

Is my audio kept private?
Absolutely. Your files are stored securely and are only accessible to you. We never share or use your audio for training purposes.
Can I delete my processed files?
Yes. You can delete any job from your dashboard at any time.
How long are my files stored?
Audio and video files are automatically deleted 60 days after processing is complete. You’ll receive an email reminder 10 days before deletion if you have email notifications enabled in Settings. Compliance reports are kept permanently and remain available after files are removed. You can always re-upload and reprocess a file if needed.

Still need help?

Our team is here to help. Reach out and we will get back to you as soon as possible.

hello@bleepkit.com