What is included in the LexisNexis® IP DataDirect dataset?
What global patent data coverage does IP DataDirect provide?
How is the patent data structured and delivered?
What additional data enrichments are included?
How frequently is the patent data updated?
Can I customize or select the data I receive?
What is included in the LexisNexis® IP DataDirect dataset?
LexisNexis® IP DataDirect provides access to one of the world’s most comprehensive collections of global patent data. The dataset aggregates information from patent offices around the world and delivers it in a standardized format designed for large-scale research, analytics, and system integration.
The IP DataDirect dataset includes more than 178 million patent documents from over 109 patent authorities, with full-text coverage from 70 jurisdictions.
Patent publications are enriched with additional metadata and delivered in a harmonized format to simplify integration with internal patent databases, analytics platforms, and research systems.
What global patent data coverage does IP DataDirect provide?
IP DataDirect collects and standardizes patent data from a wide range of global patent authorities, providing access to a large repository of patent publications and associated metadata.
The dataset includes:
178M+ patent documents
109+ patent authorities
70 authorities with full-text coverage
112M+ searchable PDFs
102M+ publications with images
Major full-text authorities include:
United States (US)
European Patent Office (EP)
World Intellectual Property Organization (WO)
China (CN)
Japan (JP)
Korea (KR)
Germany (DE)
United Kingdom (GB)
France (FR)
Canada (CA)
Additional bibliographic and abstract data is available from a broader set of international patent authorities.
Here you can find the data coverage for each authority and the data upload schedule: Data Coverage.
How is the patent data structured and delivered?
IP DataDirect provides patent data at the document level, meaning each record corresponds to an individual patent publication.
Each publication is delivered in a standardized XML format based on WIPO Standard ST.36, ensuring consistency across data from multiple patent offices.
This standardized structure allows organizations to ingest and process patent data consistently across jurisdictions and systems.
Patent publications may include:
Bibliographic information
Abstracts and titles
Patent classifications
Citations
Legal status events
Patent family relationships
Ownership and assignee information
Associated documents such as high-resolution images and optimized PDF files can also be delivered alongside the structured XML data.
What additional data enrichments are included?
In addition to the original patent office data, IP DataDirect provides several enhancements that improve data usability and analysis.
These enhancements include:
Standardized and normalized entity names
Corporate ownership and affiliation mapping
Patent family relationships (multiple family definitions)
Forward and backward citation links
Standardized legal status events
Machine-translated patent text
Claim tagging
Links to scientific literature such as Scopus
These value-added elements help organizations perform more advanced analytics and improve data consistency across jurisdictions.
How frequently is the patent data updated?
IP DataDirect provides frequent updates to ensure access to the most current patent information.
New and updated patent publications are typically made available within hours of being received from source patent offices, allowing organizations to maintain near real-time patent databases.
In the Data Coverage page here, scroll down to find more information as to the Data Upload Schedule.
Can I customize or select the data I receive?
IP DataDirect allows organizations to tailor the dataset they receive based on their specific needs.
Data feeds can be configured using parameters such as:
Patent authority
Technology classifications
Date ranges
Publication types
Specific data elements
This flexibility allows organizations to retrieve only the data relevant to their workflows, reducing processing overhead and simplifying integration with internal systems.
Summary
LexisNexis® IP DataDirect delivers one of the most comprehensive global patent datasets available, combining large-scale patent office coverage with harmonized data structures and value-added enhancements.
By standardizing patent publications from multiple jurisdictions into a single XML-based format and enriching them with normalized metadata, IP DataDirect enables organizations to build powerful internal patent databases, analytics platforms, and research tools using trusted global patent data.
If you have further questions please reach out to a member of the team. Complete this form and we'll be in touch shortly.