Click here to Skip to main content
15,917,456 members
Please Sign up or sign in to vote.
1.00/5 (2 votes)
Hi, I tried to web scrape a site using webclient but it has include files and it doesn't scrape the actual output that is produced from include file. could someone please help me how to web scrape the actual html that you see when you do inspect in firebug in morzilla

thanks
Posted
Comments
Sergey Alexandrovich Kryukov 22-Sep-13 11:17am    
Not enough information.
—SA

1 solution

First of all, Web scraping is something irrelevant to ASP.NET, "include files" or other server-side technology:
http://en.wikipedia.org/wiki/Web_scraping[^].

All what matters is HTTP request you can make and HTTP response. All details of server-side behavior are hidden from the client side. You can achieve all the Web server user can get, perhaps a little more, but no less.

Please see my comment to the question: you did not describe your problem. So, I can advice just one thing: instead of System.Net.WebClient use more general and hence powerful class System.Net.HttpWebRequest,
http://msdn.microsoft.com/en-us/library/system.net.httpwebrequest.aspx[^],
http://msdn.microsoft.com/en-us/library/system.net.webrequest.aspx[^].

Please see my past answers:
get specific data from web page[^],
How to get the data from another site[^].

—SA
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900