The USPTO Linked Patent Dataset release

Dear all,

We are happy to announce USPTO Linked Patent Dataset release.

Patents are widely used to protect intellectual property and a measure of innovation output. Each year, the USPTO grants over 150, 000 patents to individuals and companies all over the world. In fact, there were more than 200, 000 patent grants issued in the US in 2013. However, accessing, searching and analyzing those patents is often still cumbersome and inefficient.

Our dataset is the output of converting USPTO XML patents data into RDF from the years 2002 – 2016. This supports the integration with other data sources in order to further simplify use cases such as trend analysis, structured patent search & exploration and societal progress measurements.

The USPTO Linked Patent Dataset contains 13,014,651 entities where 2,355,579 are patents. Other entities represent Applicant, Inventor, Agent, Examiner (primary and secondary),  and assignee. All these entities amount to c.a. 168 million triples are describing the patents information.

The complete description of the dataset and SPARQL endpoint are available on the DataHub:

We really appreciate feedback and are open to collaborations.
If you happen to have use cases utilizing this dataset, please contact us.


About Mofeed Hassan

I am a Ph.D. student in AKSW. My Work concerns Link Discovery specially Supervised instance interlinking in semantic web
This entry was posted in Dataset Release and tagged , , , . Bookmark the permalink.