We have simplified our pricing and merged the Budget/Regular/Rush transcripts into a single type called Manual. The transcription rate for Manual transcript type is $0.80/minute with a 36 hours turnaround. In effect, we no longer support the $0.60/minute rate with the 5 day turnaround or the Rush option with 12 hour turnaround.
This pricing structure leaves us with just two types of transcripts; Automated and Manual. These names are self-explanatory. Automated transcripts are never touched by our human transcribers and you have to edit the transcript yourself to the desired accuracy level. The manual transcripts are done by our transcribers and we ensure that the accuracy is 99% or more.
This change is effective from 1st Jan 2019. Please visit the update pricing page for more.
We are pleased to announce that account credits are now supported on Scribie. Account credits are funds which you can add to your Scribie account and use it to pay for transcript orders, automated transcripts or any other payments on Scribie. Your credit card will be charged only once and all subsequent payments will be charged to your account credits. Account credits can be also be shared with team members and they do not expire. However, account credits cannot be withdrawn or transferred. You can only use it for payments on Scribie.com.
Account credits can be added from the settings page.
Click the Add Credits button to add credits and enter the amount. That will lead you to the invoice page where your credit card will be charged. On successful payment, the credits will show up in your account immediately.
Credits will be automatically applied (provided the setting is enabled) to all invoices and you can just pay with your account credit to place an order. The following screenshot shows the invoice page after the credit has been applied.
If the invoice amount is less than available credits, you only have to pay the difference as shown in the following screenshot.
Any refunds for payments made with account credits will be sent to account credits by default. If part of the payment has been made with account credits, then the refund will be split between the payment method and account credits. However, you can choose to have all refunds sent to account credits as well from the settings page. Credit Card refunds take a few days to hit the bank, but account credits refund will show up immediately in your account.
Account credits can also be used for any orders placed through the API. This is especially useful for playing around with our system.
As usual, our customer success team is always available if you need any assistance, and your comments feedback is most welcome.
We now provide subtitles along with the automated transcripts for $0.25/min of the audio. We support the SRT and VTT formats. You can order the subtitles from the button as shown in the screenshot below.
That will lead to the invoice page where you can make the payment. After the payment, the subtitle file will be replaced with the links to download the files.
The SRT and VTT buttons will download the file. The YouTube button will upload the file to your YouTube account and add it as a caption.
Try out our free automated transcription service with subtitles today!
We are happy to announce that you can now prioritize the free automated transcripts and get it ASAP for as low as 10¢/min. Automated transcripts require a lot of CPU and GPU power. Therefore we queue these files up and process them one by one. Our processing queue is a FIFO queue, First In First Out. So new files are added to the back of the queue.
Sometimes our queue gets backed up and it may take a long time one particular file to reach the front of the queue. With this feature, you can pay to get in front of the queue. Your file will be pushed to the front and the processing will be started as soon as the current one finishes. Click the Redeem Now link and make the payment to prioritize. Here’s a screenshot of how it looks in your account.
Try out our free automated transcripts today!
We provide a browser based-editor which can be used to quickly correct the automated transcripts. Click the Edit Transcript button to launch it.
The first thing you will notice is the audio waveform at the top. That is the audio player. Clicking anywhere on it will take you to the corresponding word in the transcript.
The first row of buttons are the controls. Each button also has a corresponding keyboard shortcut so that you don’t have to use the mouse which saves a lot of time. The important shortcuts to remember are CTRL+P to play/pause and CTRL+O to rewind (CMD for Mac).
The second row of buttons are some controls for the text editor. Hover the mouse over the button to get a description of what the button does. It’s mostly self-explanatory.
You will also notice some text underlined in blue and red. The red ones are spelling mistakes. Run the spell check to correct those. The blue ones are where our speech recognition engine was not confident enough and so those may be mistakes. You can right click on those and choose Play Word to check the corresponding audio.
The following are the list of corrections which tend to be required in the automated transcripts:
- Mistakes: These are words which are incorrectly transcribed. Most of these words will have blue underlines.
- Speaker Turns: Our speech recognition engine misses around 40% of the turns. So some paragraphs may actually have two speakers in them (we are working to improve it).
- Punctuations: There may be some missing periods. The commas and other punctuations are mostly correct, although we only provide the start quote. The end quote has to be manually inserted.
- Capitalization: Some of the capitalized words may be wrong. Some other words may need to be capitalized.
We recommend the 2-pass approach to make the corrections. First play and check the blue underlines. Those are the low-hanging fruits and you can get them out of the way fast.
Next, play the audio from the beginning and make corrections as you go along. Whenever you notice a mistake, pause, make the correction, and resume play. Rinse and repeat till you reach the end of the file. Increasing the playback speed can also help in cases where the accuracy is more than 80%.
Once you are done with the edits, Click the Download button at the bottom for the Word Document or other formats.
Effectively, it takes around 3-4 times the duration of the file to correct the automated transcript, if you include the time for replays. It is also easy to lose focus on long files. So, remember to take breaks. Without the automated transcript, you may have to spend 8-10 times the duration of the file.
Of course, if you do not have the time, our transcribers will be happy to make the corrections for you. We guarantee 99% accuracy for our manual transcripts. Please do try it out.
Our latest speech and language models have been released. There are several new features in this release. The following is a list:
Acoustic Model: This is our fourth acoustic model trained on our data. The dataset contained mostly accented speakers (eg. Indian, African, Irish etc.). It also contained some noisy files. The accuracy of the automated transcript on accented files should be better now.
Language Model: We have added more data to our language model and doubled its size. The model now model has now been trained on around 46 million lines and has improved the WER by around 2%.
Punctuations: The biggest feature of this release is expanded punctuations. We now support all types of punctuations including quotes and hyphens. To our knowledge, nobody else including Google Web Speech, AWS Transcribe and Speechmatics supports quotes.
Speaker Turns: We also have updated our speaker turns model. The accuracy of the model is around 80% on long paragraphs. The automated transcripts will be better segmented now. We are currently working on adding speaker diarization to the automated transcript and it should be out soon. We do speaker turns a bit differently and do not require the number of speakers as an input. That is also one of our unique features. Google Web Speech does not support multi-speaker files and AWS Transcribe and Speechmatics require the number of speakers as an input for diarization.
This release also fixes the issue of missing predictions where some words, especially near speaker turns were not being transcribed. The automated transcripts should now capture all utterances, except filler words. We also benchmarked our model with LibriSpeech Clean and our internal dataset. The following are our numbers.
|LibriSpeech Clean||Read speech||14.53%||5.85%|
For comparison, PaddlePaddle numbers are the following:
|LibriSpeech Clean||Read speech||5.4%||1.9%|
As you can see, for conversational audio, our models outperform PaddlePaddle by a wide margin. We are working on improving our models for non-conversational audio as well. Our ASR is a DeepSpeech-based system and therefore a comparison with PaddlePaddle is a good benchmark for us. The Continual Learning blog post has some more details on how we trained our DeepSpeech models.
The automated transcripts are free currently, so try it out today!
We are back with our spring special promotion. Avail a 10% discount on all orders at Scribie with the SPRING18 discount code. It will be valid till May 20th, 2018. Don’t forget to apply before ordering!