I know that google goggles will translate a picture to text. What I want to do is allow the user to take a picture (no problem), translate that picture to text(?) and then do some processing on the text (no problem).
Is there any API on android that allows this? or is there any way to programtically communicate with google goggles? I would hate to have the user use goggles then select the saved file with my app.....
Take a look at Is there any free OCR library for Android? and What kind of OCR Java library should I use in Android? if you want libraries that do the translation on the phone itself.
However, Google also provides a WebAPI that does this. http://googlesystem.blogspot.com/2009/09/google-docs-ocr.html
Just remember that OCR (optical character recognition) is, as of now, an inexact science and you won't get flawless transcription in all cases...
check this example it's used in google translate to get text from image it's use google services
https://github.com/googlesamples/android-vision/tree/master/visionSamples/ocr-codelab
Related
I am looking for a solution to develop an attendance system. The problem is I cannot find a good library to work in (or maybe I don't even know). I want to implement Face Recognition based on the previously saved faces when a user signs up. I also want to recognize a person even with a face mask on. Can you help me by providing a path to use?
I have a sample Application https://play.google.com/store/apps/details?id=com.ttv.facedemo
This is implemented using FaceOnLive SDK which is paid, I want to implement the same feature using OpenCV.
For my bachelor thesis I have to make an app that recognize logos, eg : I see the logo of a car and I want to find out what car it is. I take a picture of the car’s logo and the app should recognize the image and send me back the Word „Mercedes” so that I can search for information about the car online. I would like that no matter the position of the logo or the light or the color, the app to recognize what that logo represents.
I have tried with recognize.im API but it doesn't work well because it is callibrated and adjusted for comparison not classification, and I definitely need classification.
I would like to go with on-cloud recognition, but on-device would work too (in which case what algorithm should I use ?).
Thank you very much
There are few APIs which provide general image recognition such as Google Vision or Imagga. These services can give you some general information about the scene - for example it can tell whether there is a car in the image or not.
However, your car brand recognition task is very specific and you might achieve better results by using customizable service such as vize.ai, which allows you to train your task-specific API endpoint. In order to train it, you need to prepare example images with logos for each car brand you want to recognize (30 to 50 images per brand). You upload these images to vize.it using the web browser interface and you'll get the API endpoint to a classifier trained for your task. Then you can simply classify new images by calling the API.
Edit:More details added (as requested by ρяσѕρєя K)
Disclaimer: I'm working at vize.it.
Edit: Link changed
I am trying to make a photo organization app in android in which i was planning on using facial recognition. Since Google+ already has a facial recognition mechanism in place i was wondering how can I integrate it with my app. Basically my app will pull photos from the users G+ account and display them in a customized and better organized layout. Is their anyway to extract the facial recognition output, as in the x-y coordinates of the face or anything else, using a query in the G+ API ? I have been scouring the net but was not able to find anything concrete on this.
Thanks
Ayush
P.S : This is my first question and I hope I was able to make myself clear enough :)
I don't believe that is part of the G+ API. If you really want to detect the outline of the face, look into some other API's that do such.
Here's a list of several face detection API's
http://blog.mashape.com/post/53379410412/list-of-40-face-detection-recognition-apis
If you're dead set on using G+ the photo's would probably have to already have the face tagged. Then you'd have to rely on some sort of scrapping method to get that javascript code and decode/parse it.
You'd be better off using a face detection API.
I'm trying to create a camera activity for taking photos to be OCR'd. Here's what I wish to accomplish:
A resizable box in the middle of the camera preview to indicate which particular area will be created into a Bitmap and processed by the OCR engine.
Continuous autofocus (done)
I'm using tesseract btw.
If anyone would care to point me to some reference / examples / tutorials, that would be great.
There's a viewfinder rectangle here:
https://github.com/rmtheis/android-ocr/blob/master/android/src/edu/sfsu/cs/orange/ocr/CaptureActivity.java
I've been doing something similar. Right now, I'm just sending the whole photo to a webservice and processing it with OCRfeeder, which will perform segmentation on the image and send each part with text in it to tesseract. I've been getting much better accuracy that way. In addition, you might want to perform some preprocessing to clean up the image first.
There can be two general approaches.
You can resize the image before sending it to OCR engine. Keep in mind that Tesseract engine you use has some kind of feature - it requires some space between characters and image borders, sometimes more than expected.
The second approach is to use field-level recognition, when you specify coordinates of the text block and send the full image to OCR engine. Have a look at http://www.ocrsdk.com, it's a cloud OCR SDK with web api recently launched by ABBYY, it's in beta, so for now it's free to use. It has a field level recognition methods and Android code samples. I work # ABBYY and can provide additional info on our products if necessary
I'm looking at developing an app that could benefit from having a image recognition system. I've seen this sort of thing in iPhone and Android apps. Take a picture of a book and the app takes you to Amazon where you can find that book. I'm not looking for general image recognition, but more the ability to pick a single image out of a library of about 10k images.
Any ideas of what services are available for this sort of thing?
Google Goggles does something similar to Amazon Remembers. It uses OCR if text can be identified and they want to use it with the similar image search from Google Images. I think they generate some kind of hash for an image with the feature that if the images are similar the images are similar to.
My best guess would be try to start with the character recognition and do a text search for the title of your card. This means your user has to make a very clear image maybe even in a specific position. But for a first application this would be great already. As somebody playing magic I would buy the tool for trading and cataloging my cards.
Actually, while short of getting an actual Amazon employee to tell you there is no way to confirm this, I am fairly certain that the Amazon Remembers feature you refer to is actually the work of crowd sourcing- using lots of people combing through data to make it appear like a computer is doing it. I think they may actually be using there own Mechanical Turk system.
Edit: Also, I found this SO question that might interest you. It is specifically for playing cards, but some of the answers (such as the machine learning example) can be modified to be more helpful for what you want to do with magic cards.