How To Correct Automated Transcripts With Our Editor

We provide a browser based-editor which can be used to quickly correct the automated transcripts. Click the Edit Transcript button to launch it.

Scribie Editor

The first thing you will notice is the audio waveform at the top. That is the audio player.  Clicking anywhere on it will take you to the corresponding word in the transcript.

The first row of buttons are the controls. Each button also has a corresponding keyboard shortcut so that you don’t have to use the mouse which saves a lot of time. The important shortcuts to remember are CTRL+P to play/pause and CTRL+O to rewind (CMD for Mac).

The second row of buttons are some controls for the text editor. Hover the mouse over the button to get a description of what the button does. It’s mostly self-explanatory.

You will also notice some text underlined in blue and red. The red ones are spelling mistakes. Run the spell check to correct those. The blue ones are where our speech recognition engine was not confident enough and so those may be mistakes. You can right click on those and choose Play Word to check the corresponding audio.

The following are the list of corrections which tend to be required in the automated transcripts:

  • Mistakes: These are words which are incorrectly transcribed. Most of these words will have blue underlines.
  • Speaker Turns: Our speech recognition engine misses around 40% of the turns. So some paragraphs may actually have two speakers in them (we are working to improve it).
  • Punctuations: There may be some missing periods. The commas and other punctuations are mostly correct, although we only provide the start quote. The end quote has to be manually inserted.
  • Capitalization: Some of the capitalized words may be wrong. Some other words may need to be capitalized.

We recommend the 2-pass approach to make the corrections. First play and check the blue underlines. Those are the low-hanging fruits and you can get them out of the way fast.

Next, play the audio from the beginning and make corrections as you go along. Whenever you notice a mistake, pause, make the correction, and resume play. Rinse and repeat till you reach the end of the file. Increasing the playback speed can also help in cases where the accuracy is more than 80%.

Once you are done with the edits, Click the Download button at the bottom for the Word Document or other formats.

Editor download files

Effectively, it takes around 3-4 times the duration of the file to correct the automated transcript, if you include the time for replays. It is also easy to lose focus on long files. So, remember to take breaks. Without the automated transcript, you may have to spend 8-10 times the duration of the file.

Of course, if you do not have the time, our transcribers will be happy to make the corrections for you. We guarantee 99% accuracy for our manual transcripts. Please do try it out.

Did Trump Say I or I’d?

The Jury is out on what President Trump said: I or I’d. But what does the AI think? We put our free automated transcripts service to test on the following clip.

And here’s the result.

But with that being said, a president has been extremely generous with what he said. I like him a lot. I have a great relationship with them, as you know, have a great relationship with prime minister abe in japan, and I probably have a very good relationship with m gun f not care. I have relationships with people to surprise.

So our AI agrees with WSJ. President Trump did say ‘I’. So there you go!

The transcript is missing few words towards the end and we are working to fix it. However, if you have a clean audio file then head here to get a free automated transcript!

No Room for Errors

No room for errors

If you have used Scribie’s service before, you probably know about our high standards in terms of the quality and the accuracy of the transcripts.

So a very important question that comes to mind is, how is Scribie able to churn out 99.9% accurate transcripts all the time, while some industry players are afraid of even claiming that benchmark? Continue reading “No Room for Errors”

Are Automated Transcribing Softwares Good Enough? Not for New York Times

If you are ChristineMcM, a New York Times commentator you probably know a too much about how automatic transcribing software can mess things up for you.

As reported by The Daily Dot, she had something to say about a recent Trump article but had to take a phone call in the middle of her comment. Her automatic transcription software heard and posted the whole conversation.

Yes, you read that right.

This is what it ended up posting.

This might be funny, but this shows the state that we are currently in with respect to automated transcribing.

Transcribing still continues to be mostly done by humans to avoid such gaffes.

Although she later clarified the mistake, it left those close to her and her followers in a state of a fix. Some even suspected that she might be having a neurological episode.

Here is her clarification:

Having understood these problems, Scribie is not looking to go the same route.

Instead, we use technology and AI to help humans transcribe faster and better.

The industry is far away from completely eliminating the human factor in the transcribing chain (unless you can afford such a gaffe).

For the time being the best way to get your file transcribed is a human with cutting-edge technology that enables efficiency and high accuracy.

 

 

Affordable Transcription Pricing

Affordable Transcription Pricing Scribie

Demystifying Transcription Pricing

 

If you’ve ever shopped around for the best transcription service then you’ve probably experienced how confusing and frustrating transcription pricing can be.

Many services aren’t upfront with their pricing (or accuracy level) and require you to submit for a “free quote.” Others may ostensibly offer a rock-bottom price only to nickel and dime you for features that are already standard in most services. And others are just downright disingenuous about the entire process—good luck getting in touch with them! Continue reading “Affordable Transcription Pricing”

Speech Recognition Software Falls Short on Transcription

Human Transcription is better

The Foibles of Speech Recognition

 

In this day and age, more and more of what we are doing is becoming automated. One example would be banking. You don’t have to go to a bank anymore to deposit or transfer money. You can do that from an app or just log on to their website. Some banks don’t even have physical buildings. Human interaction and the component of business is becoming more and more limited. Continue reading “Speech Recognition Software Falls Short on Transcription”

A Note From Our Founder: Why We Charge By Difficulty Level

Starting this year, we made an important policy decision; to charge for transcription of audio/video files based on the difficulty level instead of just by the audio minute. It is an important policy change and can destroy our business. As the founder of Scribie, I firmly believe that it is the right thing to do.

Let’s start by defining the difficulty level. The industry standard for transcription of a clean audio file is around 4 times the duration of the file (1 hour long file takes around 4 hours to transcribe). Any issues in the audio file such as accent, background noise, distortion, distant speakers etc., adds a multiple to that time. E.g., files with accented/distant speakers can take twice the time to transcribe just because those parts have to replayed over and over again. That multiple is what we define as the difficulty level of the file.

It is, however, a tricky thing to measure as the customer and the transcriber will have differing opinions. We internally have a voting system and an algorithm which flags difficult files for our admins. The algorithm uses a statistically significant sample size and we believe this is the closest we can arrive to an un-biased consensus opinion.

Once we have measured the difficulty level, we can estimate the cost which it will add to process of transcription. There are four ways we can account for the cost.

  1. Compromise on the accuracy
  2. Absorb the cost ourselves
  3. Ask our transcribers to spend the additional time and effort for free
  4. Charge our customers more

The first option is the easiest one. It’s a good compromise as long as it’s acceptable to both parties. However, it is a slippery slope and has disastrous consequences for a meritocratic system like ours. We grade our transcribers by the mistakes at each step of the transcription process and we cannot trust the grades if there are unknown number of mistakes. Also, it contradicts our quality guarantee.

Until last year we used to absorb the cost. It was viable since our rates were higher. It was however grossly unfair to our transcribers since we demanded the same accuracy without any additional compensation but still graded them based on the same criteria. We lost many good transcribers as a result. I still feel terrible about that.

The third option is a non-starter since basic economics defeats it. We pay our transcribers by the audio hour and a flat rate just leads to a terrible quality transcript most of the time.

That brings us to the fourth option; pass the cost on to the customer. Logically it is the best option to choose, but also the riskiest since it is hard to argue with customers. Operationally, it enables us to compensate our transcribers by the difficulty level which in turn distributes the effort required evenly across our 4-step process and optimizes the accuracy/effort ratio.

Since implementing this policy we have lost many customers and have even been ridiculed publicly. But that is a price we are prepared to pay. We firmly believe that our customers deserve the best quality transcript irrespective of the file and our transcribers deserve the best possible compensation for their efforts. Charging by the difficulty level is the only way we do both and achieve our goal of maximum possible accuracy.

Good Audio = Lower Prices

How To Get Cheaper Transcriptions

Want to pay less for your transcripts? If you use the following tips and tricks you can improve your audio and pay up to 4x less than what you would pay with bad audio — that could save you hundreds of dollars!

Seems simple, huh? It is… but many people often don’t realize they are recording bad audio. Here are a few quick and easy tips for getting the best quality audio with the least amount of effort.

 

1. Microphone & Recording Device

First off, determine what type of microphone pattern best suits what situation you need to record. If you’re recording a one-on-one interview, you’ll want a mic that has a narrow pickup pattern (unidirectional). If you’re recording a group of people (board meeting, focus group, etc.) you’ll want an omnidirectional mic as it has a larger pickup pattern and will capture all of the speakers. While you always want to be in a quiet environment with any mic, it’s important to note that omnidirectional mics are more susceptible to unwanted background noise so make sure you have a nice and quiet room.  Here are a few microphones and recorders we recommend at Scribie:

  1. Zoom H4n  – Awesome recorder with many recording options $219.99 zoom_h4npro_4_channel_handy_recorder_1464752813000_1253811
  2. Tascam DR 05 –  Best bang for the buck recorder $99.99   
  3.  Shure VP64A Nice mic low-priced and durable  $78.75
  4. Sennheiser MD 46  Great unidirectional mic for high noise environment $199.95
  5. MXL AC-404 Portable USB Conferencing Microphone – Perfect omnidirectional mic for recording groups – $83.99

2. Recording Environment

Ideally, you will need to be in a quiet location free of any ambient noise. This is crucial to capturing excellent audio because background noises such as a coffee shop, bar, or your neighbors’ car door slamming party can make it difficult for transcribers to hear your speakers’ voices. And even though our transcribers are awesome, transcribing audio with a noisy background is no easy feat and can take 3x to 4x longer to ensure our 98% accuracy.

cardoorslam
Picture from reddit user kidbopper

 

 

3. Microphone Positioning

Have the recorder on a stand or at least some type of buffer between it and whatever platform you use. This helps by separating the mic from any vibrations that may cause unwanted sound. If you are using a unidirectional microphone, try and position the mic 6″-12″ inches in front of the subject, with the mic aimed directly at their mouth. If you are using an omnidirectional mic, position the mic in the middle of the group on a stand. Always do a sound check before you begin recording by plugging in a headset and having people talk at normal voice levels.

4. Recording Settings

Recording settings are important, so make sure your recorder is set to record at a proper setting. We recommend setting it to record WAV files at 48kHz 28bit. This setting makes sure you are able to capture the human voice at an optimal level.

 

Don’t have a mic or recording interviews on your iPhone?

No problem, you can still produce great audio. Fstoppers shows how they use the iPhone to grab audio for video interviews and you can use the same methods for your audio interviews as well.

  1. Get the mic as close as possible to the subject.
  2. Use the stock iPhone recording app or download iTalk by Griffin.
  3. Export audio.

 

 

 

When YouTube Captions Go Wrong

Human Transcription > Computer Transcription

Have you ever used Google Voice’s visual voicemail option? How about YouTube’s closed captioning service? If so, you’ve probably encountered a wildly inaccurate and hilarious transcript.

Rhett McLaughlin and James Lincoln, the comedy duo behind Rhett & Link, used this amusing side effect and turned it into a series of hilarious skits on YouTube.

The concept is similar to the Telephone Game. A message is passed from person to person until the original message is mostly unrecognizable.

Here’s what they did:

Step 1: Record a short script.

Step 2: Upload it to YouTube.

Step 3: Record a new video with the garbled transcripts that YouTube produced.

Step 4: Repeat.

The result is a funny and an incoherent message similar to that of the famous “Bad Lip Reading” videos.

These skits were filmed between 2011 and 2013 and demonstrate just how inaccurate Googles’ automatic transcription services used to be. Since then, Google’s automated voice transcription service has improved significantly – hence the reason the series eventually fizzled out.

Given these modest improvements, automated transcription services still pale in comparison to the level of accuracy that human transcription services, such as Scribie, can provide.

We believe the English language, in all its complexity, nuance, and beauty will never be completely mastered by artificial intelligence.  And while this video is in jest, it’s an excellent example of why knowledge work will always require a human component to maintain quality assurance.

Why Scribie Charges Extra For Non-North American Accents

We have often been asked “Why does Scribie charge extra for non-North American accents?” so we decided to clarify and expand on the reasons behind our decision. Here’s what you need to know:

images

Most files that are uploaded on Scribie for transcription are from the US, or at least the speakers are mostly North Americans–so for that reason, we specialize in transcribing North American accents. Naturally, it makes sense that most of our transcribers are based in the US. And this ensures they are familiar with the accents, which in turn ensures flawless transcription in much less time. The result? Faster turnaround, better accuracy, and happier customers.

Generally, we have a flat rate for all high-quality audio files with North American speakers. However, we do charge a bit extra ($.50 per minute) for high difficulty files due to the time these files are. Similarly, we charge a little more for files with a non-North American accent. The main reason is, these accents require specialized transcribers that are familiar with wider accent patterns and can transcribe files with non-US speakers at the same accuracy level. The problem is that these transcribers are rare, which means, if we get too many files in this category, the files tend to get stuck in the pipeline. And nobody likes that.

Basically, we charge more for these files because it costs more to transcribe these file types. It also keeps the transcribers motivated and encourages them to continue to work on such files.

So in closing, the pay rate occasionally changes, but what doesn’t change is Scribie’s commitment to one of the highest levels of accuracy in the industry  Upload your files now and find it out for yourself.