I want to make an app of image recognition and i need help of where to start. What i need is someone to explain me these few things
Wich offline libraries are the best to use with xamarin for image processing
in case more performance is needed, best libraries for Image processing iOS and Android to work them separately.
It does not matter if the library is in C or C++ what i want is documentation to follow
My idea of best is.
Well documented.
Easy to implement on the platforms xamarin or individually
The main functions i am looking for are for object recognition in an image NOT at runtime with camera
Also i want to add if is there any well document of fundamentals of image procesing and edge detection.
Thanks
Depends on what you want to do - image detection is a big topic.
A couple of places to start are:
Microsoft cognitive services
https://blog.xamarin.com/performing-ocr-for-ios-android-and-windows-with-microsoft-cognitive-services/
This is an on-line service and can do OCR, facial recognition, even describing what is in an image.
OpenCV
This is a fully featured computer vision library available in C++ with iOS and Android wrappers you can bind to use form Xamarin.
http://opencv.org
Related
I want to integrate Face Recognition in Android.
1) First time user will sign up by scanning face (Will stored on Server/local)
2) Next time when a user tries, it should detect the user from the stored image.
I tried exploring Face Recognition of AWS but ended up with confusion in setting up.
there are multiple ways you can do this.
there is a simple library for this purpose in android called FaceDetector. you can use that for face detection only.
there are also more advanced libraries like OpenCV which is available for many languages including java and android. but I recommend if you are not familiar with android NDK or computer vision try sticking with the FaceDetector library.
note that if you start with OpenCV. it is going to be hard for you at first because you are entering a field called computer vision which is a branch of machine learning. and immediately you will feel it that you're walking in a completely new territory
I'm fairly familiar with android application developpement and have created quite a few apps within Android-Studios. Now I'd like to do an app wich can detect a A4 paper in an image and read it's content. Can you name me a couple of libraries/API's? I've checked out OpenCV and Vuforia, but couldn't find something with this particular problem.
Thanks alot!
The best (free) possibility for OCR (Optical Character Recognition) would be Tesseract Engine.
Please note however, that you'll need some background in image processing to get reasonable results.
There are libraries that do the preprocessing for you, but most of them are commercial projects.
Known that I'm very new in Machine learning.
I was thinking about a real world example of using Machine Learning
and Neural network in an application and I want to try it with a
mobile application who can handle image recognition with the front
camera after make an image of something(A cat for exemple).
I really need advice of tools to use to rapidly make a prototype of this application with a python backend that I will call via rest.
Thanks in advance.
I suggest if you are new to the machine learning algorithms, that you use an API from Google or Microsoft and get in touch with the flow and how it works .. Once you understand what are the inputs and outputs, you can try to replace the API for you own neural net, try to train it properly and collect results ..
Machine learning is not an easy concept and if you start big, there is a good chance that you'll get discouraged before you finish building it ... The API will provide you with a functional prototype very quickly and thus help you stay motivated to pursue it more ..
But to answer your question more directly, TensorFlow by Google is probably the most sophisticated tool for machine learning in general right now..
There is an excellent course for deep learning with TensorFlow made by Google on Udacity ..
You can follow PyImageSearch. It has lot of stuff related image processiong like face recognition and license Plate Recognition system. It also use neural networks.
Use an image recognition API, like google vision.
It is easy and fast to put in an application, and a lot more effective if you do not have experience and ressources in ML
I have done something similar for our company website. It is based on caffe though.
You can go through the source code here
However, it is a segmentation demo. You need to modify it a little.
We are working on a Mobile Application which has to be developed on IOS and Android Platforms. Most of the application is straight forward which is display content from a back end CMS system for which we are considering using PhoneGap. The application has one module in which the user can browse through a virtual house and be able to modify colors / replace a particular item like a chair etc. This can be easily done using Unity3d which we have in place. Using Unity3d for the whole application is probably going to be an overkill and there will be a lot of effort required on the Application Front or if we go the PhoneGap route managing the 3d thing is going to be a challenge.
The question I have is if there is any way we can export the Unity Scene and use it in a PhoneGap build since we will get best of both packages? If not that what are the options to get Unity3d working (UI and Services for Application) for doing an Application or doing some stuff with PhoneGap which helps to get the 3d part sorted.
Thanks in advance.
Figured out he solution.
Using Phonegap completely. For the 3d Module using Panorama image and displaying it using https://github.com/nicekei/jQuery-html5-canvas-panorama-plugin for interactivity. There are many other plugins available for jquery and mobile friendly. I choose to use the above one. You can find more on http://www.jquery4u.com/plugins/10-jquery-panorama-image-display-plugins/. For walk through using annotations to link it to another page which will give a 3D Panorama for that room.
You can render stills from a 3d software and stitching them using any Panoramic stitcher Photoshop also does the job. Hope it helps.
Is there an android Framework that can be used in an app to recognize a 3D image and send the user to a video. This should fall under augmented reality, but so far everything I have viewed uses 2D image and stuff to produce a 3D image on the screen... My situation is backwards from that. I tried using vuforia but I couldn't get the sdk to work, and unity needs an android license. DroidAr doesn't seem to fit the bill either. Or are there any tutorials on this matter? Thanks.
I have not used the feature, but Metaio has a 3D object, "markerless" tracking feature as well as the ability to do video playback within the SDK. I am sure if you would rather simply redirect to a video (YouTube) or something this would not be exceptionally difficult.
http://www.metaio.com/software/mobile-sdk/features/
Metaio's mobile SDK is similar to Vuforia, so if you had trouble with that you might have difficulty getting it up and running. If your programming skills aren't up to that, you might consider looking into Junaio, an AR browser made by Metaio. With Junaio you simply create a content channel rather than having to build the app from scratch. Again, I have not actually tried this feature yet but the documentation seems to indicate that 3D tracking is available in Juniao:
http://www.junaio.com/develop/quickstart/3d-tracking-and-junaio/
Good luck!