I am implementing a Voice-Recognition recording feature in my app which saves the recording as a.gp3 audio file using MediaRecorder library. I now want to apply Speech-to-Text conversion on the audio file, does anyone know a way to do it? (Not using SpeechRecognizer) because the voice input needs to arrive from an audio file
Thanks everyone in advance! :)
Related
I've been struggling to find the information on how to record an audio through a mic, save it as a mp3 or wav file (if possible) and then store it in an array using Kotlin. Does anybody know how to do that?
I would the android speech-to-text system analyze an audio file instead a normal microphone flow, is it possible?
It is ofcourse possible. Take a look at Google Cloud Speech, it accepts a local audio file or better, an URL to that audio file.
Here is offical docs.
I have a small video clip and an audio file. The problem is how to write code to merge them into a single file. i have never written code for multimedia applications for android and don't know if the merging is possible with android media framework. Is there any third party library to do that?
Can we right a merging code in Java and call it in Android?
Please guide me through this. Thanks
You can try INDE Media for Mobile, tutorials are here: https://software.intel.com/en-us/articles/intel-inde-media-pack-for-android-tutorials
It has a sample demonstating how to substitute audio track in mp4 video file by another audio track from another mp4 file. It's here: https://github.com/INDExOS/media-for-mobile/blob/master/Android/samples/effects/src/com/intel/inde/mp/effects/SubstituteAudioEffect.java
In sample app it's under Audio Effect:
I am trying to apply new audio pitch for my video file.
I want to replace audio of my existing video by android code. Is it possible in android. can any one help me to do this
Is there any Android library support this facility to change audio for video file?
Any Help is appreciated...!!!
Thanks in Advance
This is not the exact solution but you can try it:
When you play the video file set its volume to zero.
At the same time use the service in which you can play any audio you want.
As service runs at background it has no effect on your video streaming.
When your video end,end your service.
And remember to set volume of your audio file.
You can change video pitch easily by following these steps :-
Add this library in your android project : https://gitlab.com/soundtouch/soundtouch.
follow this link for integrate this library process : https://stackoverflow.com/a/52425255/7899427.
After successfully integrate this library first of all extract audio file in WAV audio format. You can use ffmpeg library for do this.
Than using the soundtouch library you can change the audio file pitch and tempo rate and export new audio file in WAV format.
After complete these steps merge that converted audio file with video file.
Now will get the converted audio file with same video file.
Happy coding.
When I press and hold the search key on my Droid it starts the built in Android voice recognition.
While it is listening, if I speak the words "Note to self" and then keep talking, when done I get an email ready to send complete with the STT translation and the amr encoded audio file as an attachment.
I want to include this same functionality in my own program. More specifically, I need access to the audio file that is (or appears to be) created by speechrecognizer.
How do I access this audio file?
I am using api level 7. If I try to record and use speechrecognizer at same time it fails.
Any help appreciated.
I don't think the com.google.android.voicesearch application provides access to the recorded audio files. The files are temporarily stored in /data/data/com.google.android.voicesearch/files after the user makes a recording, but I'm not sure how you would get read access to those files.