patentr: Access USPTO Bulk Data in Tidy Rectangular Format

Converts TXT and XML data curated by the United States Patent and Trademark Office (USPTO). Allows conversion of bulk data after downloading directly from the USPTO bulk data website, eliminating need for users to wrangle multiple data formats to get large patent databases in tidy, rectangular format. Data details can be found on the USPTO website <https://bulkdata.uspto.gov/>. Currently, all 3 formats: 1. TXT data (1976-2001); 2. XML format 1 data (2002-2004); and 3. XML format 2 data (2005-current) can be converted to rectangular, CSV format. Relevant literature that uses data from USPTO includes Wada (2020) <doi:10.1007/s11192-020-03674-4> and Plaza & Albert (2008) <doi:10.1007/s11192-007-1763-3>.

Version: 0.1.4
Depends: R (≥ 2.10)
Imports: Rcpp (≥ 1.0.5), utils, lubridate (≥ 1.7.9), magrittr (≥ 2.0), dplyr (≥ 1.0.2), rlang (≥ 0.4.7), xml2 (≥ 1.3.2), progress (≥ 1.2.2)
LinkingTo: Rcpp
Suggests: testthat, covr, knitr, readr, rmarkdown, tibble
Published: 2021-09-12
Author: Raoul Wadhwa ORCID iD [aut, cre], James Yu [aut], Hayley Beltz [aut], Milind Desai [aut], Jacob Scott [aut], Peter Erdi [aut]
Maintainer: Raoul Wadhwa <raoulwadhwa at gmail.com>
BugReports: https://github.com/JYProjs/patentr/issues
License: MIT + file LICENSE
URL: https://JYProjs.github.io/patentr/
NeedsCompilation: yes
CRAN checks: patentr results

Documentation:

Reference manual: patentr.pdf
Vignettes: Introduction to patentr

Downloads:

Package source: patentr_0.1.4.tar.gz
Windows binaries: r-devel: patentr_0.1.4.zip, r-release: patentr_0.1.4.zip, r-oldrel: patentr_0.1.4.zip
macOS binaries: r-release (arm64): patentr_0.1.4.tgz, r-oldrel (arm64): patentr_0.1.4.tgz, r-release (x86_64): patentr_0.1.4.tgz
Old sources: patentr archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=patentr to link to this page.