Download the following zip file and then unzip it.IdentityPoolIdOutput-The identity pool ID, which allows you to securely connect to S3 and Lambda.VoiceTranslatorBucket-The S3 bucket where you host your application, and where audio files are stored.VoiceTranslatorLambda-The name of the Lambda function to be invoked from your web application.VoiceTranslatorLink-The link to your webpage.In the Outputs section of the stack shown in the following screenshot, you find the following four parameters:.On the last step of the wizard, acknowledge that CloudFormation creates IAM After 10–15 minutes, the stack has been created. Go through the wizard and create the stack by accepting the default values.Sign into the console, and then choose Launch Stack, which launches a CloudFormation stack in your AWS account.Accelerate the speed of delivering the file with CloudFront.Īs I mentioned earlier, I created an AWS CloudFormation template to create all the necessary resources.Automatically play the translated audio to the user.You could return the audio file itself, but for simplicity, save it on S3 and just return its name. Save the output audio file back to S3 with the Lambda function, and then return the file name to your page (JavaScript invocation).Convert the new translated text into speech with Amazon Polly.Translate the transcribed text from one language to another with Amazon Translate.Convert audio into text with Amazon Transcribe.In the input of the function, provide the name of audio file (that you saved earlier in Amazon S3), and pass the source and target language parameters. Save the input audio file on S3 and invoke a Lambda function.Gain secure access to S3 and Lambda from the browser with Amazon Cognito.This is also the place to save the input audio file recorded in the browser. Host your page on Amazon S3, which simplifies the whole solution.Allow access to your site with Amazon CloudFront, which allows you to get an HTTPS link to your page and which is required by some browsers to record audio.The following diagram shows all the components of this process, which I later describe in detail. To make this process a bit easier, you can use an AWS CloudFormation template, which initiates the application. Amazon Polly - This text-to-speech service uses advanced deep learning technologies to synthesize speech that sounds like a human voice.Ī diagrammatic representation of how these three services relate is shown in the following illustration.Amazon Translate - This neural machine-translation service delivers fast, high-quality, and affordable language translation.Amazon Transcribe supports real-time transcriptions, which help achieve near real-time conversion. Amazon Transcribe - This fully managed and continuously trained automatic speech recognition (ASR) service takes in audio and automatically generates accurate transcripts.The heart of this application consists of an AWS Lambda function that connects the following three AI language services: About this blog postĪmazon Polly, Amazon Transcribe, Amazon Translate, AWS Lambda, Amazon CloudFront, Amazon S3 In this post, I show how you can connect multiple services in AWS to build your own application that works like a bit like the Babel fish. “The Babel fish is small, yellow, leech-like-and probably the oddest thing in the universe… If you stick one in your ear, you can instantly understand anything said to you in any form of language.”ĭouglas Adams, The Hitchhiker’s Guide to the Galaxy Wouldn’t it be even cooler to build your own real-time voice translator application using AWS services? It would be similar to the Babel fish in The Hitchhiker’s Guide to the Galaxy: Just imagine-you say something in one language, and a tool immediately translates it to another language.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |