Process audio data in Google+ Hangout API - android

Is it possible to intercept audio data using google+ hangout api? I writing an app using g+ hangout for android and I would like to process the audio. To be precessive, I want to denoise speech and use speech-to-text (e.g. google search, sphinx) to make basic voice commands.
Because I have full control of the android app it doesn't matter for me if I will have a callback with audio data from hangout or I can record audio using android AudioRecorder and then somehow forward those data to google hangout (Though the latter solution would be better because we can denoise on the android device). Actually I would be happy with any feasible workaround that may work at this stage of the API.

The Hangouts API is not going to help you develop this feature.
What you need is a platform agnostic API for accessing hangouts data. The API is instead intended to solve a different problem. It allows you to write HTML/JavaScript applications that run inside the canvas of hangouts running on desktop web browsers.

One possible "workaround" that I'm currently investigating, myself—
publish the hangout "on air"
get the youtube live id (available as of 2012-08-22, roughly... since Hangout API 1.2) ~ gapi.hangout.onair.getYouTubeLiveId() https://developers.google.com/+/hangouts/api/gapi.hangout.onair#gapi.hangout.onair.getYouTubeLiveId (note that this can only be grabbed by the host?)
grab http://www.youtube.com/watch?v=${LIVEID} // suggestion: look at youtube-dl: http://rg3.github.com/youtube-dl/documentation.html
and then use ffmpeg to process the flv
Information for this answer was primarily grabbed from Downloading videos in flv format from youtube. and http://grokbase.com/t/gg/google-plus-developers/128fbteedb/google-hangout-api-url-to-youtube-stream-screenshot-and-hangout-topic

Related

Is mandatory to have Youtube app installed to use Youtube api

I was reading several tutorials about how to use the Youtube Api. Just to find out after I implemented it. That it was not working when the original Youtube App was not installed on device. In the tutorials I read this was not mentioned. Just in original doc's it says:
The API client library interacts with a service that is distributed as a part of the YouTube app for the Android platform
If that is the case I think it is an exclusion criterion. Cause it forces the user to install another 20mb sized app.
So my question would be if there is another way to play youtube Videos.
One possibility is to use HTML5 player in a web-view. But It may have compatibility issues with older android OS and experience may vary from device to device.

Is there any API to make a video call in Google glass?

In the latest update to Glass, Google dropped the Hangouts feature. Since the Glass development kit is fairly new does anyone know of any API available to do a video chat using Glass?
Any inputs will be appreciated. At present I am planning to use SIP as done in Android. Can the same be applied to Glass?
This is what they said on the Google + page:
Video calls – We hold ourselves to high standards for the features
that we build, and video calls aren’t living up to these standards.
Explorers have told us so directly, and fewer than 10% of them use
video calls. For this reason, we’ve made the hard decision to remove
video calls from Glass until the experience is better. We don’t know
when that will be, but in the meantime, keep an eye on MyGlass as more
Glassware is built and released – we’re already seeing the developer
community work on other video streaming services. We’ve always said
that feedback from Explorers shapes Glass, and this is no exception.
I think your SIP approach is the way to go for now.

Google speech recognition library or API

Google has recently made great progress with their speech recognition software, which is used in several open source products, e.g. Chromium Web Speech and Android Handsfree texting. I would like to use their speech recognition as part of my server stack, however I can't find much about it.
Is the text recognition software available as a library or package? Or alternatively, can I call chromium from another program to transcribe some audio file to text?
The Web Speech API's are designed only to be used in the context of either Chrome or Android. There is a lot of work that goes on in the client so there is no public server to server API that would just take an audio file and process it.
If you search github you find tools such as https://gist.github.com/alotaiba/1730160 but I am pretty certain that this method of access is 100% not supported, endorsed or confirmed to keep working.
The method previously stated at https://gist.github.com/alotaiba/1730160 does work for me. I use it on a daily basis in my home automation programs. I use a python script to capture audio and determine what is useful audio or just noise, then it sends the little audio snippet to google and returns the text all under a second!! I have successfully integrated it into my programs and if you google around you will find even more people that have as well!

YouTube Java API vs Android REST Client implementation

I am trying to make an Android app which will pull uploaded videos from a channel and display them in a list view. Users will be able to watch videos, subscribe to the channel, comment, and like videos.
I have looked at using an Atom feed to get the video info from YouTube (I have a working app), but I came across the Java API for YouTube today, which abstracts all the parsing that's needed, if I implement the REST client code myself.
My question is which is the recommended solution for this? And which will perform better?
for sure " DON'T INVENT THE WHEEL ! "
I build smaller application for android and believe me using YouTube API is much better than anything else.
if you are seeking to play videos inside your Application try to see also Open-Youtube-player !
also you should use the API for other features and for its performance you don't need to add more leakage to your application you need it fast and light.

Which TTS is google translate using? Is there a way to use this TTS in android app too?

I found for certain languages the sound files from google translate sound by far most real (compared to other TTS engines availlable for Android).
Is the google translate app using a TTS engine?
Is there a way to play these TTS from a android app?
Here is a link to a TechCrunch article on using the unofficial Google TTS API [http://techcrunch.com/2009/12/14/the-unofficial-google-text-to-speech-api/]. You pass in the text in a RESTful API and it returns you an MP3. So if you can make an HTTP request from your Android application and play the MP3 that is returned you could use the Google TTS engine. The downside is that it is "unofficial" so it could be changed or removed without warning. There are probably other services available like this that are supported. But you will need an Internet connection and have to deal with latency issues with this approach, which may or may not be an issue depending on the application.
Android uses TTS engine, but the engine is determined by the firmware provider. The most-commonly seen one is Pico.
Here is the Android documentation on using TTS:
android.speech.tts.TextToSpeech
Android's TTS does NOT require an internet connection. It generates the sounds as needed.
Here is some code that uses Android TTS. It handles some of the gotchas involved:
https://github.com/gmilette/Say-the-Magic-Word-

Categories

Resources