Click here to Skip to main content
15,886,518 members
Articles / Artificial Intelligence / Big Data

Hierarchy of Categories and Classifying Wikipedia Articles using XML Dump

,
Rate me:
Please Sign up or sign in to vote.
5.00/5 (25 votes)
9 Apr 2021CPOL10 min read 38.5K   280   12  
How to classify articles on Wikipedia using XML dump
A hierarchical object is built from relationships between categories and their parents. It is used in a classifier, detecting if an article belongs to possibly far parent category.

Views

Daily Counts

Downloads

Weekly Counts

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
United States United States
Programmer, developer, and researcher. Worked for several software companies, including Microsoft.
PhD in Theoretical Physics, D.Sc. in Solid State Physics.
Enjoys discovering patterns in nature, history and society, which break public opinion or are hidden from it.

Written By
Instructor / Trainer National Technical University of Ukraine "Kiev Pol
Ukraine Ukraine
Professor Vladimir Shatalov works on National Technical University of Ukraine 'Kyiv Polytechnic Institute', Slavutych branch, teaches students to Computer Science. Research interests include Data Mining, Artificial Intelligence, Theoretical Physics and Biophysics.
Research activity also concerns investigations of mechanisms of non-thermal electromagnetic and acoustic fields impacts on bio-liquids, effects of irradiations on physical and chemical properties of water.

Comments and Discussions