We are happy to announce that you can now prioritize the free automated transcripts and get it ASAP for as low as 10¢/min. Automated transcripts require a lot of CPU and GPU power. Therefore we queue these files up and process them one by one. Our processing queue is a FIFO queue, First In First Out. So new files are added to the back of the queue.
Sometimes our queue gets backed up and it may take a long time one particular file to reach the front of the queue. With this feature, you can pay to get in front of the queue. Your file will be pushed to the front and the processing will be started as soon as the current one finishes. Click the Redeem Now link and make the payment to prioritize. Here’s a screenshot of how it looks in your account.
Try out our free automated transcripts today!
Our latest speech and language models have been released. There are several new features in this release. The following is a list:
Acoustic Model: This is our fourth acoustic model trained on our data. The dataset contained mostly accented speakers (eg. Indian, African, Irish etc.). It also contained some noisy files. The accuracy of the automated transcript on accented files should be better now.
Language Model: We have added more data to our language model and doubled its size. The model now model has now been trained on around 46 million lines and has improved the WER by around 2%.
Punctuations: The biggest feature of this release is expanded punctuations. We now support all types of punctuations including quotes and hyphens. To our knowledge, nobody else including Google Web Speech, AWS Transcribe and Speechmatics supports quotes.
Speaker Turns: We also have updated our speaker turns model. The accuracy of the model is around 80% on long paragraphs. The automated transcripts will be better segmented now. We are currently working on adding speaker diarization to the automated transcript and it should be out soon. We do speaker turns a bit differently and do not require the number of speakers as an input. That is also one of our unique features. Google Web Speech does not support multi-speaker files and AWS Transcribe and Speechmatics require the number of speakers as an input for diarization.
This release also fixes the issue of missing predictions where some words, especially near speaker turns were not being transcribed. The automated transcripts should now capture all utterances, except filler words. We also benchmarked our model with LibriSpeech Clean and our internal dataset. The following are our numbers.
For comparison, PaddlePaddle numbers are the following:
As you can see, for conversational audio, our models outperform PaddlePaddle by a wide margin. We are working on improving our models for non-conversational audio as well. Our ASR is a DeepSpeech-based system and therefore a comparison with PaddlePaddle is a good benchmark for us. The Continual Learning blog post has some more details on how we trained our DeepSpeech models.
The automated transcripts are free currently, so try it out today!
The Jury is out on what President Trump said: I or I’d. But what does the AI think? We put our free automated transcripts service to test on the following clip.
And here’s the result.
But with that being said, a president has been extremely generous with what he said. I like him a lot. I have a great relationship with them, as you know, have a great relationship with prime minister abe in japan, and I probably have a very good relationship with m gun f not care. I have relationships with people to surprise.
So our AI agrees with WSJ. President Trump did say ‘I’. So there you go!
The transcript is missing few words towards the end and we are working to fix it. However, if you have a clean audio file then head here to get a free automated transcript!
UPDATE 09/27: We no longer provide free automated transcripts. You have to now pay for it at 10 cents per minute of audio.
Getting your automated transcripts from Scribie is a fairly intuitive process. But here is a step-by-step guide on it.
- Go to scribie.com/transcription/free
- Click on “Upload Files” button.
- You have many ways to add your file. Once you have chosen wait for the file to be uploaded.
- Once you are on this page, click on “Auto”
- You will get a pop up to create an account or to sign-in if you already have an account.
- Once you click on “Auto” again, you will get this message.
- Wait for the email from Scribie on yourom registered mail id on the status of your file.
- Once you get the confirmation mail, click on the link provided and you will be directed to the screen with the button ” Edit transcript”.
- Click on “Edit Transcript” to start editing.