data-raw/dmoz/readme.md

DMOZ

DMOZ is a volunteer run open-content directory. It groups similar content into a hierarchical categorization scheme. The data can be downloaded at: http://rdf.dmoz.org/. The structure of the schema and content files are regularly updated and can be downloaded from: http://rdf.dmoz.org/rdf/.

We parsed the XML data using the scripts posted here. The resulting domain level zipped csv which carries english translation of the categories can be accessed here. For all the data and subdomain level data, go here.

For a list of all the DMOZ categories, see categories.txt.



themains/rdomains documentation built on April 23, 2023, 8:53 a.m.