Last week Nickycakes wrote an auto scraper, auto friend adder, auto message sender, and auto comment poster for a major social networking site in under a day. The software works flawlessly and could potentially bring in thousands of dollars from spam comment posts with affiliate links if the user were so inclined. But that’s not the point of this post.
Scraping content from other websites is one of the best ways to set up fully automated content rich websites. What types of websites? Local business directories can be created in under an hour by scraping contact information from a yellow pages style site. Weather data can be scraped from numerous sites to be reformatted and turned into a widget of some sort. Wikipedia can be easily scraped for a blurb of relevant info on pretty much any topic. Ebay can be scraped for bid information. The possibilities are pretty much endless, but one thing is for sure. There’s nothing better than making fully automated websites that update themselves with new content and pull in money without any real work after they’ve been created.
Nickycakes has been building content scrapers of different kinds for a while using php, but wanted to step up his game and learn how to make scrapers that properly handled logging into websites, storing cookies, and actually looking like a web browser. All that stuff can be easily accomplished with php/curl. So a couple weeks ago, the Cakes ordered a book off amazon called Webbots, Spiders, and Screen Scrapers. You can probably find it at borders books or something as well.
In that book was enough information to write scripts for logging on, submitting forms, parsing pretty much anything you could imagine, etc. Writing a social networking scraper/adder/commenter was a breeze.
So if you don’t want to buy a book, there are quite a few resources online to help you get started. Here are a few:
PHP CURL manual
Smaxor’s newbie curl form submitting tutorial
Using libcurl with PHP
Make sure you grab this toolbar which will help you dissect any webpage’s forms quick and easy like:
Webmaster Toolbar
Source: http://www.nickycakes.com/scraping-websites-for-fun-and-profit/
Note:
Delta Ray is experienced web scraping consultant and writes articles on Yelp Data Scraping, Linkedin Profile Scraping, Yellowpages Data Scraping, eBay Product Scraping, Amazon Product Scraping, Tripadvisor Data Scraping, Linkedin Email Scraping, Screen Scraping Services, Yelp Review Scraping and yellowpages data scraping.
Scraping content from other websites is one of the best ways to set up fully automated content rich websites. What types of websites? Local business directories can be created in under an hour by scraping contact information from a yellow pages style site. Weather data can be scraped from numerous sites to be reformatted and turned into a widget of some sort. Wikipedia can be easily scraped for a blurb of relevant info on pretty much any topic. Ebay can be scraped for bid information. The possibilities are pretty much endless, but one thing is for sure. There’s nothing better than making fully automated websites that update themselves with new content and pull in money without any real work after they’ve been created.
Nickycakes has been building content scrapers of different kinds for a while using php, but wanted to step up his game and learn how to make scrapers that properly handled logging into websites, storing cookies, and actually looking like a web browser. All that stuff can be easily accomplished with php/curl. So a couple weeks ago, the Cakes ordered a book off amazon called Webbots, Spiders, and Screen Scrapers. You can probably find it at borders books or something as well.
In that book was enough information to write scripts for logging on, submitting forms, parsing pretty much anything you could imagine, etc. Writing a social networking scraper/adder/commenter was a breeze.
So if you don’t want to buy a book, there are quite a few resources online to help you get started. Here are a few:
PHP CURL manual
Smaxor’s newbie curl form submitting tutorial
Using libcurl with PHP
Make sure you grab this toolbar which will help you dissect any webpage’s forms quick and easy like:
Webmaster Toolbar
Source: http://www.nickycakes.com/scraping-websites-for-fun-and-profit/
Note:
Delta Ray is experienced web scraping consultant and writes articles on Yelp Data Scraping, Linkedin Profile Scraping, Yellowpages Data Scraping, eBay Product Scraping, Amazon Product Scraping, Tripadvisor Data Scraping, Linkedin Email Scraping, Screen Scraping Services, Yelp Review Scraping and yellowpages data scraping.
Delta Ray is experienced web scraping consultant and writes
articles on YellowPages
Data Scraping, Tripadvisor
Data Scraping, Linkedin
Email Scraping, Amazon Product Scraping, Website Harvesting, IMDb Data
Scraping, Yelp Review Scraping, Screen Scraping Services, Yelp Review Scraping
and yellowpages data scraping.
No comments:
Post a Comment