Click here to Skip to main content
15,890,527 members
Please Sign up or sign in to vote.
1.00/5 (1 vote)
See more:
Writing a program in java that is going to analyze a huge amount of data that displaces in a special website; but not sure how to get the html codes from the website into the java program automatically.
Is there a special method, library, topic covering this problem or a link that can be helpful to find the solution

What I have tried:

I am trying DOM html JS tutorial but not sure if it's the suitable case.
Is Jsoup the solution?
Posted
Updated 2-Feb-22 21:59pm
v3

1 solution

If the website provides an API to access the data, then use that rather than trying to extract the data from the HTML.

Otherwise, you'll have to resort to Web scraping[^]. But be aware that you may be breaching the website's terms of use, and end up liable to prosecution. And the site owners are free to change the structure of their HTML markup at any time, or to take other steps to prevent your program from scraping their data.
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900