Skip navigation

A patent mining approach to accurately identifying Innovative Industrial Clusters based on the multivariate DBSCAN algorithm

A patent mining approach to accurately identifying Innovative Industrial Clusters based on the multivariate DBSCAN algorithm

Zeng, Siping, Wang, Ting, Lin, Wenguang, Chen, Zhizhen (Ryan) ORCID logoORCID: https://orcid.org/0000-0001-6656-5854 and Xiao, Renbin (2024) A patent mining approach to accurately identifying Innovative Industrial Clusters based on the multivariate DBSCAN algorithm. Systems, 12:321. pp. 1-28. ISSN 2079-8954 (Online) (doi:10.3390/systems12090321)

[thumbnail of VoR]
Preview
PDF (VoR)
47811_CHEN_A_patent_mining_approach_to_accurately_identifying_Innovative_Industrial_Clusters.pdf - Published Version
Available under License Creative Commons Attribution.

Download (7MB) | Preview

Abstract

Innovative Industrial Clusters (IIC), characterized by geographical aggregation and technological collaboration among technology enterprises and institutions, serve as pivotal drivers of regional economic competitiveness and technological advancements. Prior research on cluster identification, crucial for IIC analysis, has predominantly emphasized geographical dimensions while overlooking technological proximity. Addressing these limitations, this study introduces a comprehensive framework incorporating multiple indices and methods for accurately identifying IIC using patent data. To unearth latent technological insights within patent documents, Latent Dirichlet Allocation (LDA) is employed to generate topics from a collection of terms. Utilizing the applicants’ names and addresses recorded in patents, an Application Programming Interface (API) map systems facilitates the extraction of geographic locations. Subsequently, a Multivariate Density‑Based Spatial Clustering of Applications with Noise (MDBSCAN) algorithm, which accounts for both technological and spatial distances, is deployed to delineate IIC. Moreover, a bipartite network model based on patent geographic information collected from the patent is constructed to analyze the technological distribution on the geography and development mode of IIC. The utilization of the model and methodologies is demonstrated through a case study on the China flexible electronics industry (FEI). The findings reveal that the clusters identified via this novel approach are significantly correlated with both technological innovation and geographical factors. Moreover, the MDBSCAN algorithm demonstrates notable superiority over other algorithms in terms of computational precision and efficiency, as evidenced by the case analysis.

Item Type: Article
Uncontrolled Keywords: bipartite network; density‑based spatial clustering; innovative industrial clusters; latent Dirichlet allocation; patent analysis
Subjects: H Social Sciences > H Social Sciences (General)
H Social Sciences > HD Industries. Land use. Labor
Q Science > QA Mathematics
Faculty / School / Research Centre / Research Group: Greenwich Business School
Greenwich Business School > School of Accounting, Finance and Economics
Last Modified: 12 Oct 2024 09:28
URI: http://gala.gre.ac.uk/id/eprint/47811

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics