I want to create an android app that inputs the user's voice and modulate it in real-time.
Therefore, I wanted to ask if there are any libraries or functions to input the speech/voice and change its pitch,etc.
Any in built functions...?
Thank you.
Related
I know this has been a topic covered for 8 years, but I haven't been able to find a recent answer to this question.
I'm working on an Android Auto app in my spare time. I'm trying to build out a version of the License Plate Game; when you spot an out-of-state license plate you log it to the app. And hopefully you can find all 50.
Speaking with my wife last night, we realized the smartest way to achieve this would be to activate speaking from the car's steering wheel. From there, an utterance of something like Add Utah to the License Plate Game would add Utah to your list of license plates you've seen. A user clearly wouldn't want this app in the foreground for the duration of a car ride, so a background voice action would be the way to go.
The Android Auto portion would just be for fun and to learn how to develop for Android Auto with some kind of minimal UI.
Documentation on Voice Actions for Android is minimal when it comes to custom voice utterances. Would I really need to create a Google Assistant extension for an app that is intended to store data and have all logic hosted locally within the project?
Thanks for any and all help!
Good day ma'am/sirs! I'm new to android app developing, and I'm really in need of help. I'm developing a Speech-to-Text app though its not the usual STT apps that are available on app-stores. I'm using pocketsphinx for offline speech recognition and conversion, and Android Studio IDE.
My app has three main features and those are:
Record - here is where the user will be able to record his/her speech. The recorded speech will be saved into the device's storage.
Library - Here is where the user will be able to see his/her recorded speech and converted audio-to-text files. Also the convert feature where the user may convert his/her recorded speech into text files.
Edit - here is where the user will be able to edit his/her audio/text files. Only cut, delete, and modify(only text) are the available features.
My main problem is, is it actually possible to convert a recorded speech into text by using pocketsphinx? To make it more understandable, I've tried demos of pocketsphinx and what I've experienced through it is when you speak through your device, it directly converts what you said. Unlike my idea, where you may record your speech, and convert it into text whenever you want. I'm so confused if its possible, if yes, may someone tell/explain to me how? If no, may someone tell/explain to me the other ways to follow my idea? Thanks in advance!
I want to create a application android that use pandorabot with Speech Interfaces like CallMOM
https://play.google.com/store/apps/details?id=com.pandorabots.callmom
Is there any example ?
Thanks
The algorithm is pretty simple:
recognize user input
send it to pandorabots
get result back
synthesize it with TTS engine
go to step 1
You can learn more about speech recognition on Android from answer on Is there a way to use the SpeechRecognizer API directly for speech input?
You can learn more about TTS on Android from answer on Android TTS speech synthesizer
Here is an example project that should give you the basics idea of building an Android app that talks to the Pandorabots API.
I googled around and found the regular speech-api from google. But I think this isn't what I need. I need continious voice recognition and the ability to launch other actions when a specific word is spoken. Is there anything in the android sdk that I can use?
If not: Is it possible to implement third-party libraries? (If yes: which - and what do I have to think about when implement a third-party-library?)
Edit: I thought about this again. I have to recognize just one 'word' (that probably won't be in googles-speech-databases). I have the chance to record it. That means, I'm able to continiously match the incoming audio-stream against my recording. That should work without a database. But I'm new to android-development. Do you have suggestions for APIs to use for recording and matching the recorded? Or is there any better way to continiously wait for a specifig 'word' to occur and then process any further actions?
btw: if that wasn't clear described: the app should continue to record and watch for the word to occure again when the reaction is done.
Is there anything in the android sdk that I can use?
No, sorry.
Does anyone have experience with java voice recognition and localization?
I'm thinking to build an android application, with some basic voice recognition options, but I want to implement localization for that based on some translate tool, maybe Google translate, and users can update his "dictionary" with new languages from remote dictionary...this project is in first phase, and I'm still brainstorming, so does anyone have some experience or is something like that even possible?
Why not just use Android's built-in speech recognition? It's REALLY easy (you just set up an Intent then catch it when it returns) and the results are surprisingly good.
android.speech
I'm not sure exactly what you're trying to do, but this will allow you to specify the language to recognize.