How to "scrape" HTML and display it on your own site?

For questions and problems with the CMS core. This board is NOT for any 3rd party modules, addons, PHP scripts or anything NOT distributed with the CMS made simple package itself.
Post Reply
thumbsucker
Forum Members
Forum Members
Posts: 73
Joined: Thu Feb 09, 2006 1:38 pm

How to "scrape" HTML and display it on your own site?

Post by thumbsucker »

Anyone know the easiest way to "scrape" the HTML from another site and display it in your own?
lemkepf
Forum Members
Forum Members
Posts: 163
Joined: Tue Oct 18, 2005 8:30 pm

Re: How to "scrape" HTML and display it on your own site?

Post by lemkepf »

Don't. It's called stealing. Don't be a ******.



p.s. this stuff really pisses me off.
Last edited by lemkepf on Tue Mar 07, 2006 10:44 pm, edited 1 time in total.
thumbsucker
Forum Members
Forum Members
Posts: 73
Joined: Thu Feb 09, 2006 1:38 pm

Re: How to "scrape" HTML and display it on your own site?

Post by thumbsucker »

Don't be stupid.  It's legal when it's a government data source, which by law can't be copyrighted.  Get a clue.  ???
lemkepf
Forum Members
Forum Members
Posts: 163
Joined: Tue Oct 18, 2005 8:30 pm

Re: How to "scrape" HTML and display it on your own site?

Post by lemkepf »

You also have to realize that 99% of people wanting to do this are people trying to steal content. I've been a victim twice. I've also had clients that have been victims. It's a nasty battle to try and get that content taken down ....

Anyways, to answer your question... you can use a linux command "wget".

Code: Select all

wget -r --progress=binary -w1 -Q1000m -v -t3 -nH -np -k -l10 -Dwww.yoursite.com http://www.yoursite/index.html
Post Reply

Return to “CMSMS Core”