TL;DR: The attempt in this paper to automate the whole process not only helps create final patent maps for topic analyses, but also facilitates or improves other patent analysis tasks such as patent classification, organization, knowledge sharing, and prior art searches.
Abstract: Patent documents contain important research results. However, they are lengthy and rich in technical terminology such that it takes a lot of human efforts for analyses. Automatic tools for assisting patent engineers or decision makers in patent analysis are in great demand. This paper describes a series of text mining techniques that conforms to the analytical process used by patent analysts. These techniques include text segmentation, summary extraction, feature selection, term association, cluster generation, topic identification, and information mapping. The issues of efficiency and effectiveness are considered in the design of these techniques. Some important features of the proposed methodology include a rigorous approach to verify the usefulness of segment extracts as the document surrogates, a corpus- and dictionary-free algorithm for keyphrase extraction, an efficient co-word analysis method that can be applied to large volume of patents, and an automatic procedure to create generic cluster titles for ease of result interpretation. Evaluation of these techniques was conducted. The results confirm that the machine-generated summaries do preserve more important content words than some other sections for classification. To demonstrate the feasibility, the proposed methodology was applied to a real-world patent set for domain analysis and mapping, which shows that our approach is more effective than existing classification systems. The attempt in this paper to automate the whole process not only helps create final patent maps for topic analyses, but also facilitates or improves other patent analysis tasks such as patent classification, organization, knowledge sharing, and prior art searches.
TL;DR: In this article, a conceptual framework is developed showing the use of patent information in core areas of technology management, such as competitor monitoring, technology assessment, R&D portfolio management, identification and assessment of potential sources for the external generation of technological knowledge, especially by means of mergers and acquisitions, and human resource management.
TL;DR: A network-based analysis is proposed, an alternative method for citation analysis that provides richer information and thus enables deeper analysis since it takes more diverse keywords into account and produces more meaningful indexes.
TL;DR: Text mining is used to transform patent documents into structured data to identify keyword vectors and principal component analysis is employed to reduce the numbers of keyword vectors to make suitable for use on a two-dimensional map.
TL;DR: In this article, the business potential of polymer solar cells is reviewed and the market opportunities analyzed on the basis of the currently reported and projected performance and manufacturing cost of polysilicon solar cells.