Gå til hovedindhold

Workflow for Patent Text Analysis Using HPC Resources

Billede
A diagram outlining the workflow for data processing, text analysis, and statistical analysis using high-performance computing (HPC) resources. It highlights key steps: data cleaning, text analysis with NLP techniques, and statistical modeling, supported by advanced computational infrastructure.
Marek Giebel/DeiC

A three-step process for patent text analysis, supported by HPC resources:

Data Cleaning and Pre-processing: Organises and formats raw patent data for analysis.

Text Analysis Techniques: Applies NLP methods to assess readability, word types, and linguistic complexity.

Statistical Analysis: Conducts regression and correlation analyses to extract trends and insights.

Computational Resources: HPC infrastructure is critical for managing large datasets and performing complex analyses efficiently.

Created by Anne Rahbek-Damm and Marek Giebel

Revideret
10 dec 2024