web crawl the app information from Android market - android

I want to get a dataset of Android application information, which includes application name, package name, version, permission requested, etc.
The official Android application market is Google Play. There are millions of applications on the market. I want to get at least tens of thousands of application information from it, and store it into a csv file. For instance, here's an application's link:
https://play.google.com/store/apps/details?id=de.ralphsapps.snorecontrol
The problem is how to get the list of applications' url?
How to to parse the information from webpage?
Is there any good web crawler suitable for this kind of job? Or is there any scripting language, such as python, has such kind of crawl functions?
Thanks.

Google Play Store has its own format to display information objects in HTML.
Write your own HTML parser for all these to get information you need.
It's best to use JSoup for this job at JSoup.org ,
or refer to my sample tutorial on JSoup as a parser:
Parsing HTML using JSoup

Related

Building an offline android app with data from pdf

I have data in the pdf format(Not in english). I want to load the data to my app and app will be offline. What are the elements i shall be using in android to make the app which is like reading data chapterwise? As the data i have is huge,how to load it offline from pdf? What is the efficient way?
This is reference link for the app:
https://play.google.com/store/apps/details?id=com.winjit.hclite&hl=en
Getting the text of a pdf is a hard task in android. You can check this other similar questions to this topic which all have no straight forward answer: Link, Link, Link.
If you cant workaround this there are commercial libarys out there which you can find pretty easy with your search engine of choice (I don't want to promote them here).

Localize strings returned by a web service

What would be the best design to localize strings in Android that come from a web-service hosted on Google app engine?
My Android application calls a REST web-service that returns items. Each item has a "tags" attribute that contains strings.
Server-side, these tags are stored in the datastore, in English, in the item (with a StringListProperty).
The handler is hardly more complex than:
class MyHandler(webapp2.RequestHandler):
def get(self):
item = MyModel.get_by_id(id)
self.response.write(json.encode(item))
I wonder what design recommendations you can suggest to localize this "tags".
Localize server-side? This does not sound very usual (flickr, stackoverflow, etc.) don't do that. And, then, how to handle the localization? (see code snippet above)
Localize client-side? Then, what is the most efficient way to do that?
In www.cloud4apps.com you can find a free cloud web service that works similar to StackOverflow that enables the power of crowd-sourcing where users post and vote for best entries.
Then the services determine which string resources to use in your app based on votes and your approval or automatically if desired.
You can upload your current Android resource files into the online tool and Download latest translations as Android resource files or use the SDK to pull data in real-time.
Their SDK implements caching that allows your app to still use text resources even when offline.

Retrieving Movie Ratings from Websites/Blogs for Android App

Is it possible to retrieve movie/songs rating from different website or a blog and show it in android application as to which blog or website has given how much rating to a movie/song and show it in one place.
P.S: I am beginner at android application development and got this idea to make a app which could fetch ratings for a particular songs or movie . So if it is possible it will be great if you give me a little headstart as to how will it will be done .
Thanks.
I would recommend something like a external server or database that does crawling work for you.
You would need something like a list of websites/blogs you get your ratings from and you would need to write something like a crawling script for each website. That means downloading the HTML-file of the site and parse it for your information. This data must be stored in your database or the server must do this dynamically.
Now your device connects to the database/server to get the analysated data. If you let your device do the whole parsing-work, it would kill the net-traffic and would take ages to load.

getting specific information from a website in android

i want to write a program that gets the match dates from this link http://www.goal.com/en/teams/germany/148/fc-bayern-munich-news
and use it in my program i just want the dates and the matches how can i do this? in andorid
I'd write an Activity to display the data, which calls an AsyncTask to connect to the site and download the HTML. I'd then use some kind of parser to grab the data I want and save it to a database.
Have you written Java before? If not I'd start out by learning the language. Download Eclipse and write a simple program that can connect to the site and grab the HTML. Then add the parser.
Once you are that far, do the Hello World tutorial, then work your way through the other tutorials. Also learn about the Android Application Lifecycle. At that point you can start thinking about moving your code over to the Android framework.
EDIT
Here are some links to information about potential parsers & parsing approaches.
Tag Soup
What HTML parsing libraries do you recommend in Java
Two HTML parsing links
You could also consider using (hushed voice) regex/pattern matching.

Read HTML file from Website and convert into App

I am looking into developing an App that will convert a website into more readable data for an android app. I am at university and have an online notice board which can be viewed on the web but if possible I would like to transfer this into an app on android to make it more easy to read on mobile devices.
What I thinking is that the app would go to the website where the notice board is held and read in the html code to display each notice in a list adapter view. Each notice is within its own div so I assume I could use that to split each notice up into its own button on the list adapter view. Is this possible and if so how I can go about doing this. I have tried google for an answer but I have not yet found a solution to this problem.
Thanks for your help
It seems overly complicated to me. I wouldn't handle all that using Android. I'd crawl the data on a machine (server) and then I'd convert all needed data to JSON and have the Android (client) fetch the data using a simple JSON parser.
In my opinion that would be the easiest solution if you don't have access to the server the website is hosted on to get it generate a JSON feed for you directly.
EDIT: In answer to your comment Boardy.
Here is the official website of the JSON project in order to get an understanding of what it is. Then if you have access to the webserver providing that page (I assume it is a PHP based site) and want to modify or add the functionality of providing a JSON feed then you should also take a look at the PHP JSON documentation.
To parse JSON on Android check out this SO question and also don't forget to take a look at the official Android documentation on their JSON implementation.

Categories

Resources