Data Download Tables
Table Name | Description | # of Rows | Origin |
---|---|---|---|
application0.045 GB | Information on the applications for granted patent | 6,114,791 | raw |
assignee0.012 GB | Disambiguated assignee data | 392,640 | disamb |
claim10.416 GB | Full text of patent claims, including dependency and sequence | 86,415,009 | raw |
cpc_current0.805 GB | Current CPC classification data for all patents (applied retrospectively to all patents) | 30,803,091 | raw (from separate classification files) |
cpc_group0.02 MB | Lookup table of current CPC groups | 656 | raw (from separate classification files) |
cpc_subgroup5.118 MB | Lookup table of current CPC subgroups | 259,436 | raw (from separate classification files) |
cpc_subsection0.003 MB | Lookup table of current CPC subsections | 127 | raw (from separate classification files) |
foreigncitation0.633 GB | Citations made to foreign patents by US patents | 20,703,512 | raw |
government_interest0.003 GB | Raw government interest statements on all patents (where available) | 122,888 | raw |
government_organization0.003 MB | Organization names and related agency hierarchy parsed from the government interest statements on all patents (where available) | 137 | processed |
inventor0.036 GB | Disambiguated inventor data | 3,682,415 | disamb |
ipcr0.255 GB | International Patent Classification data for all patents (as of publication date) | 10,518,656 | raw |
lawyer4.871 MB | Disambiguated lawyer data | 163,762 | disamb |
location3.247 MB | Disambiguated location data, including latitude and longitude | 130,524 | disamb |
location_assignee14.011 MB | Metadata table for many-to-many relationships | 5,493,878 | disamb (linking table) |
location_inventor52.898 MB | Metadata table for many-to-many relationships | 14,286,655 | disamb (linking table) |
mainclass0.002 MB | Lookup table of original USPC main classes (as of patent publication date) | 1,242 | raw |
mainclass_current0.007 MB | Lookup table of current USPC main technology classes (applied retrospectively to all patents) | 532 | raw (from separate classification files) |
nber0.104 GB | NBER classification data for all patents up to May 2015 | 5,077,298 | raw (from separate classification files) |
nber_category0.0 MB | Lookup table for NBER categories | 6 | raw (from separate classification files) |
nber_subcategory0.001 MB | Lookup table for NBER subcategories | 37 | raw (from separate classification files) |
otherreference2.336 GB | Non-patent citations mentioned in patents (e.g. articles, papers, etc.) | 29,177,088 | raw |
patent1.125 GB | Data on granted patents | 6,114,791 | raw |
patent_assignee0.083 GB | Metadata table for many-to-many relationships | 5,492,850 | disamb (linking table) |
patent_contractawardnumber0.87 MB | Contract or award numbers parsed from the government interest statements on all patents (where available) | 132,766 | processed |
patent_govintorg0.324 MB | Metadata table with patent-to-organization relationships linked to the government_organization table | 141,422 | processed |
patent_inventor0.098 GB | Metadata table for many-to-many relationships | 14,281,512 | disamb (linking table) |
patent_lawyer0.032 GB | Metadata table for many-to-many relationships | 7,025,875 | disamb (linking table) |
rawassignee0.355 GB | Raw assignee information as it appears in the source text and XML files | 5,493,878 | raw |
rawinventor0.763 GB | Raw inventor information as it appears in the source text and XML files | 14,286,655 | raw |
rawlawyer0.326 GB | Raw lawyer information as it appears in the source text and XML files | 7,026,536 | raw |
rawlocation0.697 GB | Raw location data for inventors and assignees, as it appears in xml and text source files | 20,108,757 | raw |
subclass0.396 MB | Lookup table of original USPC subclasses (as of patent publication date) | 265,338 | raw |
subclass_current1.939 MB | Lookup table of current USPC subclasses (applied retrospectively to all patents) | 174,673 | raw (from separate classification files) |
usapplicationcitation0.72 GB | Citations made to US patent applications by US patents | 22,236,006 | raw |
uspatentcitation2.438 GB | Citations made to US granted patents by US patents | 84,508,523 | raw |
uspc0.522 GB | USPC classification data for all patents | 20,868,391 | raw |
uspc_current0.54 GB | Current USPC classification data for all patents up to May 2015 | 22,271,739 | raw (from separate classification files) |
usreldoc0.219 GB | U.S. related documents (post-2005 patents only) | 6,660,685 | raw |
wipo15.321 MB | WIPO technology fields for all patents | 8,074,534 | raw (from separate classification files) |
wipo_field0.001 MB | Lookup table of WIPO technology fields | 70 | raw (from separate classification files) |
The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, which contains all granted and non-granted applications, is also available upon request. After March, 2016, the applications database will not contain the same inventor IDs as the PatentsView database. Only inventors on granted applications can be matched between the PatentsView and applications databases via a granted application ID.
This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).
Attribution should be given to PatentsView (www.patentsview.org) for use, distribution, or derivative works.
From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.
For more information, visit the Methods and Sources section of the website.