Click here to Skip to main content
15,910,471 members
Please Sign up or sign in to vote.
3.00/5 (1 vote)
See more:
Hi there,

I am a beginner to RSS and HTML Parsing.
So I want to practice my skill.

There are heaps of options in this catalogue on that page.

So I want to retrieve all the catalogue and turn them into a well-formed XML file for making a RSS.

Is there any idea about how to do it?
It would be better if someone could kindly provide code fragments.

Thanks in advance.
Posted

I have an article that uses HtmlAgilityPack to parse some of the Codeproject article/user pages. You could use that as an example.

CodeProject Article Scraper, Revisited[^]

I don't rely completely on it, but there is some code in ther that uses it.
 
Share this answer
 
Comments
franva 26-Jan-11 22:54pm    
thanks mate, :)
I don't think anyone has attempted to parse the site map yet. Why don't you give it a shot? If you run into specific issues, post here and someone will be sure to help you out. But it would be quite unlikely that you'd find a guy to write the whole parser for you from scratch.
 
Share this answer
 
Comments
franva 24-Jan-11 7:10am    
Hi Nishant,

Thanks to your reply :)
And what do you mean --- why don't you give it a shot?
The reanson why I want to parser sitemap is I want to transform it to a RSS file. That's the only reason.
I saw the example code HTML2RSS in HTML Agility Pack, but I have no idea about the XPath field, so..... I just want to know some simple and effective way to implement it.

Once again, thank you a lot ^_^

Winston

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900