Transcription Advanced#
Context#
Adding context can improve transcription accuracy. Context can be beneficial to correctly transcribe uncommon spoken words, such as:
entity names (people, company or product names, etc.)
technical jargon (in medicine, engineering, science, etc.)
Context can be any text (e.g. a summary or a related document) or just a list of relevant words. It can also contain text or phrases that might not be present in the audio. Omnio will only use context if necessary.
Transcribe the audio. Here's the relevant context:
XiangXYZ Electronics
TitanHex Technologies
ExoBook ZX5
Diagon TekGear Z7
OmegaDrive T5
QuantumGear E9
Timestamps#
Transcription with timestamps includes timestamps in [MI:SS] format.
Transcribe the audio with timestamps.
You can also specify how frequently you want to insert timestamps.
Transcribe the audio with timestamps.
Insert timestamps every 50 characters on average.
Verbatim#
Verbatim transcription is a word-for-word transcription of spoken language. This means that every single word, including fillers, pauses and false starts, is transcribed exactly as it was heard.
Create a verbatim transcript of the audio.
Clean verbatim#
Clean verbatim transcription removes filler words, stammers and interjections from other speakers (e.g. “mm-hmm”, “um”).
Bob: Hi, um, is this Alice?
Alice: It is, yeah.
Bob: Alice, how-how is it going? My name, um, is Bob.
Transcribe the audio in clean verbatim form.
Profanity#
Omnio can mask, remove or tag profanity.
Transcribe the audio with profanity masked.
Person: I’m done with this s***, you f****** a******.
Transcribe the audio with profanity removed.
Person: I’m done with this [profanity removed], you [profanity removed] [profanity removed].
Transcribe the audio with profanity tagged.
Person: I’m done with this [profanity:shit], you [profanity:fucking] [profanity:asshole].
Personally identifiable information#
Omnio can remove or tag personally identifiable information (PII), such as names, addresses, dates of birth, and phone numbers.
Transcribe the audio with personal information removed.
Transcribe the audio with personal information tagged.