Technological competitiveness of China's internet platformers: comparison of Google and Baidu by using patent text information

Agrawal

Gans

and

Goldfarb

(

2018

Prediction Machines: The Simple Economics of Artificial Intelligence

Harvard Business School Press

Arts

Cassiman

and

Gomez

J.C.

(

2017

), “

Text matching to measure patent similarity

”,

Strategic Management Journal

, Vol.

No.

, pp.

Bell

and

Pavitt

(

1993

), “

Technological accumulation and industrial growth: contrasts between developed and developing countries

”,

Industrial and Corporate Change

, Vol.

No.

, pp.

157

210

Biancotti

and

Ciocca

(

2018

), “

Regulating data superpower in the age of AI

”,

Realtime Economic Issues Watch, October 23, 2018

Peterson Institute for International Economics

Cho

D.S.

Kim

D.J.

and

Rhee

D.K.

(

1998

), “

Latecomer strategies: evidence from the semiconductor industry in Japan and Korea

”,

Organization Science

, Vol.

No.

, pp.

489

505

Chorzempa

Triolo

and

Saks

(

2018

), “

China’s social credit system: a mark of progress or a threat to privacy?

”,

Peterson Institute for International Economics

Policy Brief 18-14

Economist

(

2020

), “

Special report: the data economy

”,

The Economist, Feb 22, 2020

London

Fagerberg

and

Godinho

M.M.

(

2005

), “

Innovation and catching-up

”,

The Oxford Handbook of Innovation

Oxford University Press

New York, NY

, pp.

514

543

Fan

(

2006

), “

Catching up through developing innovation capability: evidence from China’s telecomequipment industry

”,

Technovation

, Vol.

No.

, pp.

359

368

Goldfarb

and

Trefler

(

2018

), “

AI and international trade

”,

NBER Working Paper #24254

Cambridge MA

Kashani

E.S.

Radosevic

Kiamehr

and

Gholizadeh

(

2022

), “

The intellectual evolution of the technological catch-up literature: bibliometric analysis

”,

Research Policy

, Vol.

No.

, p.

104538

Kim

(

1998

), “

Crisis construction and organizational learning: capability building in catching-up at Hyundai motor

”,

Organization Science

, Vol.

No.

, pp.

506

521

Lee

(

2013

Schumpeterian Analysis of Economic Catch-up: Knowledge, Path-Creation, and the Middle-Income Trap

Cambridge University Press

London

McInnes

Healy

and

Melville

(

2018

), “

UMAP: uniform manifold approximation and projection for dimension reduction

”,

6, Dec 2018, arXiv preprint arXiv:1802.03426

Mathews

J.A.

(

2006

), “

Dragon multinationals: new players in 21st century globalization

”,

Asia Pacific Journal of Management

, Vol.

No.

, pp.

Miao

Song

Lee

and

Jin

(

2018

), “

Technological catch-up by east Asian firms: trends, issues, and future research agenda

”,

Asia Pacific Journal of Management

, Vol.

No.

, pp.

639

669

Mikolov

Chen

Corrado

and

Dean

(

2013

), “

Efficient estimation of word representations in vector space

”,

In ICLR

Motohashi

(

2020

), “

Science and technology co-evolution in AI: empirical understanding through a linked dataset of scientific articles and patents

”,

RIETI Discussion Paper Series 20-E-010

RIETI

Tokyo Japan

Motohashi

and

Zhu

(

2023

), “

Identifying technology opportunity using dual-attention model and technology-market concordance matrix

”,

Technological Forecasting and Social Change

, Vol.

197

, p.

122916

Motohashi

Koshiba

and

Ikeuchi

(

2019

), “

A method of extracting content information from patent documents and comparison of their characteristics by applicant type by using the vector space model of distributed expressions

”,

NISTEP Discussion Paper No. 175

MEXT

Japan, Tokyo

, (

in Japanese

Nagaoka

Motohashi

and

Goto

(

2010

), “

Patent statistics as an innovation indicator

”, in

Hall

and

Rosenberg

(Eds),

Handbook of the Economics of Innovation

Elsevier Science

North Holland

, Vol.

Park

and

Geum

(

2022

), “

Two-stage technology opportunity discovery for firm-level decision making: GCN-based link-prediction approach

”,

Technological Forecasting and Social Change

, Vol.

183

, p.

121934

Sugawara

Kobayashi

and

Iwasaki

(

2016

), “

On approximately searching for similar word embeddings

”,

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics

Trajtenberg

(

2018

), “

Artificial intelligence as the next GPT: a Political-Economy perspective

”,

NBER Working Paper #24245

Cambridge MA

Wang

Roijakkers

and

Vanhaverbeke

(

2014

), “

How fast do Chinese firms learn and catch up? Evidence from patent citations

”,

Scientometrics

, Vol.

No.

, pp.

743

761

Wang

Chen

Wang

and

Kuo

(

2019

), “

Evaluating word embedding models: methods and experimental results

”,

APSIPA Transactions on Signal and Information Processing

, Vol.

No.

, p.

e19

Younge

K.A.

and

Kuhn

J.M.

(

2016

Patent-to-Patent Similarity: A Vector Space Model

SSRN

k-means++ was used to assign all words derived by the Skip-gram model into 24 clusters. We chose the number of clusters arbitrarily. The words in each cluster were presented in the form of word cloud. The Skip-gram model assumes that similar words are more likely to appear in the same context (window). Therefore, in fact, the words in each cluster are supposed to be associative and related, not exactly to be similar.

Appendix 2. Document cluster labels

Instead of labeling document clusters only by the word clouds, we also adopted the patent titles as complementary information. We picked up ten patents of each cluster, which were nearest to its centroid.

Appendix 3. Tuning of explored range in NGT

NGT has a primary parameter ϵ that defines the explored range for the graph, allowing us to achieve higher precision. As the “No Free Lunch” theorem, the more extensive the explored range, the higher the precision, the longer the search time. To investigate the relationship between the explored range ϵ and accuracy, we randomly collect n patents from the corpus. Denote N_true(i) as the true nearest 200 neighbors of patent i, and N_ngt(i, ϵ) the approximated nearest 200 neighbors of patent i given by NGT. Then, the accuracy of given ϵ value is calculated by the following:

A c c u r a c y (ϵ) = \frac{1}{n} \sum_{i = 1}^{n} \frac{l e n (N_{t r u e} (i) \cap N_{n g t} (i, ϵ))}{200}

In our case, we collected a random sample of 500 patents and set the range of ϵ from 0.05 to 1 with a step 0.05. The following figures shows the change of accuracy by tuning the value of ϵ. For the following results, we set the ϵ as 0.35, which had a 0.997 accuracy rate and plausible running time in the experiment.

2023

Kazuyuki Motohashi and Chen Zhu.

Figure 1.

Research framework

Figure 2.

Internet-related patents by application year

Figure 3.

Distribution of cosine similarity between pairs within patent families

Figure 4.

Histograms of cosine similarity between pairs within patent families

Figure 5.

Word crowd of clustering results

Figure 6.

UMAP visualization of patent contents and clustering results

Figure 7.

Composition of patent contents by country

Figure 8.

Comparison of Google and Baidu patents

Figure 9.

RCA of Google/Baidu patents in each country

Figure 10.

Cosine similarity of 200th nearest patents

Figure 11.

Share of USPTO patents in 200 neighbors by country

Figure 12.

Graphical interpretation of NGT results

Figure 13.

Cumulativeness of Google and Baidu patents

Figure 14.

Impact of Google and Baidu patents

Figure A1.

Word cloud results for word embedding

Figure A2.

Tuning explored range in NGT analysis

Table 1.

Cosine similarity between within patent family pairs

Country	Mean	SD	Min	25%	50%	75%	Max
US	0.97	0.05	0.25	0.99	1.00	1.00	1.00
CN	0.95	0.07	0.53	0.92	0.98	0.99	1.00
USCN	0.97	0.05	0.28	0.96	0.99	1.00	1.00
US*	0.95	0.10	0.14	0.99	1.00	1.00	1.00
CN*	0.89	0.14	0.24	0.84	0.97	0.99	1.00
USCN*	0.94	0.10	0.11	0.94	0.98	1.00	1.00

Country	Mean	SD	Min	25%	50%	75%	Max
US	0.97	0.05	0.25	0.99	1.00	1.00	1.00
CN	0.95	0.07	0.53	0.92	0.98	0.99	1.00
USCN	0.97	0.05	0.28	0.96	0.99	1.00	1.00
US*	0.95	0.10	0.14	0.99	1.00	1.00	1.00
CN*	0.89	0.14	0.24	0.84	0.97	0.99	1.00
USCN*	0.94	0.10	0.11	0.94	0.98	1.00	1.00

Note:

(*) denoted the results of TF-IDF weighted document embedding

Source: Created by authors

Table A1.

Document cluster labels

Labels	nearest10_title	IPC
0	Method and device for obtaining combined image	G06K9/62
0	Digital image visualized management and retrieval for communication network	G06F17/30
0	Terminal device, intelligent mobile phone, and face identification-based authentication method and system	G06K9/00
0	Remote sensing image significance target detection method and system based on Hadoop	G06F17/30
0	Method for detecting over-exposure area in monitoring video image combining multiple features	G06K9/62
0	Method and system for detection of representative area of automatic quasi object type image	G06F17/30
0	Station identification method and device	G06K9/00
0	Method for generating and applying image search code technique	G06F17/30
0	Image matching method and image matching device	G06K9/62
0	Method and system for replacing background images of smart camera in real time	G06F3/0484
1	Distributed storage method and apparatus, and data processing method and apparatus	G06F17/30
1	Massive real-time data synchronization system based on private cloud storage	H04L29/08
1	Distribution and utilization global total data transmission and storage method and device and electronic equipment	G06F17/30
1	Data rapid distribution method and device	H04L29/06
1	Method for acquiring and converting data of metering system of intelligent transformer substation	G06F17/30
1	Method of pre-caching or pre-fetching data utilizing thread lists and multimedia editing systems using such pre-caching	G06F3/06
1	Database normalization storage system and method suitable for use in multi-model satellite testing	G06F17/30
1	Data audits based on timestamp criteria in replicated data bases within digital mobile telecommunication system	G06F17/30
1	Write operation control method, system and device and computer storage medium	G06F3/06
1	Smart storage platform apparatus and method for efficient storage and real-time analysis of big data	G06F3/06
2	Context-based photograph sharing platform for property inspections	G06F17/30
2	Systems and methods for constructing and using models of memorability in computing and communications applications	G06F3/048
2	Systems and methods for constructing and using models of memorability in computing and communications applications	G06F3/048
2	Systems and methods for constructing and using models of memorability in computing and communications applications	G06F3/048
2	Incentives for content consumption	G06Q30/00
2	Method and apparatus for locating errors in documents via database queries, similarity-based information retrieval and modeling the errors for error resolution	G06F17/30
2	Method and system for electronic display of photographs	G06F17/30
2	Three-dimensional web crawler	G06F17/30
2	Intelligent integrating system for crowdsourcing and collaborative intelligence in human- and device- adaptive query-response networks	G06F17/30
2	Methods and systems for annotation of digital information	G06F17/24
3	Intelligent liquid warehousing device	G06K9/00
3	Internet-of-things-based water level monitoring system for water conservancy and hydropower engineering	H04L29/08
3	Touch control input device used for electronic information equipment	G06F3/041
3	Output device and wearable display	G09G5/00
3	Diversified reinforced tablet computer system	G06F1/16
3	Force touch module, preparation method thereof, touch screen panel and display device	G06F3/041
3	Luminous band display type sliding touch bar and display method of touch luminous band	G06F3/041
3	Economical skin-pattern-acquisition and analysis apparatus for access control; systems controlled thereby	G06K9/00
3	Shield machine posture solving device based on VBA writing	G06F9/44
3	Touch-control module, touch screen and intelligent device and stereo touch-control method	G06F3/041
4	Method for understanding questions in question type automatic question-answer systems on basis of rule	G06F17/27
4	Data searching method and system based on semantic analysis	G06F17/27
4	Information searching method based on metadata	G06F17/30
4	Relevancy priority ordering method used for environmental protection regulation retrieval	G06F17/30
4	Information management, retrieval and display system and associated method	G06F17/30
4	Information management, retrieval and display systems and associated methods	G06F7/00
4	Information management, retrieval and display system and associated method	G06F17/30
4	Method of indexing words in handwritten document images using image hash tables	G06F17/30
4	Method for searching pattern matching index	G06F17/30
4	System, method and program product for answering questions using a search engine	G06F17/30
5	Search engine method based on keyword resolution scheduling	G06F17/30
5	Method and system for automatically converting dynamic form page to HTML5 page	G06F17/22
5	Automatic access of electronic information through machine-readable codes on printed documents	G06F12/00
5	Electronic commerce system for updating information	G06F12/00
5	Web service multithreading file uploading system	H04L29/08
5	System and method for creating and posting media lists for purposes of subsequent playback	G06F3/0482
5	System and method for creating and posting media lists for purposes of subsequent playback	G06F15/16
5	System and method for creating and posting media lists for purposes of subsequent playback	G06F15/16
5	Pay per record system and method	H04L29/06
5	Dynamic generation of target files from template files and tracking of the processing of target files	G06F7/00
6	Wired security access control device of financial industry network and access method of wired security access control device	H04L29/06
6	Vehicle identification system and method	G06F17/30
6	Control system	G06F3/16
6	Plug type audio device and signal processing method	G06F3/16
6	Touch display device and touch display method	G06F3/041
6	Method and device for playing audio data in sound card signal input channel in real time	G06F3/16
6	Portal access control system	G06F7/04
6	Method and device for displaying states of ports of switch	H04L12/24
6	Computer control system	G06F3/00
6	Login method and device for user identified by radio frequency	G06F21/00
7	Device, method and equipment for information data interaction for processing information data	G06F17/30
7	Smart instant interaction technology for use in radius range of position	G06F17/30
7	Information processing method, terminal and electronic device	G06F17/30
7	System information security monitoring method and device, computer device and storage medium	G06Q10/10
7	Novel electronic device information collection and selective information orientation distribution method	H04L29/06
7	Interested object information acquisition method and system with mobile terminals coordinating with cloud terminal	H04L29/08
7	Information display method and device	H04L12/58
7	Method and device for feeding back information, and terminal	H04L12/58
7	Method, device and system for storing social networking service (SNS) content	G06F17/30
7	Method and system for automatically ordering dishes and settling account	G06Q30/02
8	Facial action unit strength estimation-based expression analysis method	G06K9/00
8	Spatial data matching method based on machine learning	G06F17/30
8	Method for quickly sorting electroencephalograph signal based on threshold analysis	G06F3/01
8	Intelligent analysis method for components of camera scene image	G06K9/62
8	Method and system for generating radio frequency identification data into tripping origin destination) matrix on the basis of Spark	G06F17/30
8	Target identification method based on geometry reconstruction and multi-scale analysis	G06K9/00
8	Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation	G06F17/30
8	Judgment standard establishment method for identifying red and black time sequence through resistance method	G06K9/62
8	Data flow abnormality detection and multiple verification method based on enhancement-type angle abnormality factor	G06F17/30
8	Wi-Fi-based indoor personnel passive detection method	G06K9/00
9	Systems and methods of network operation and information processing	G06F15/16
9	Systems and methods of network operation and information processing	G06F17/30
9	Systems and methods of network operation and information processing, including engaging users of a public-access network	G06F15/16
9	Systems and methods of network operation and information processing, including use of unique/anonymous identifiers throughout all stages of information processing and delivery	G06F15/16
9	Video broadcast creation method and system, access device and management device	H04L29/06
9	System and method for realizing signaling firewall based on signaling point-free access technology	H04L29/06
9	Network device access authentication method in network video monitoring	H04L29/06
9	System and method for simulating an application for subsequent deployment to a device in communication with a transaction server	G06F7/00
9	Method and system for managing personal information	G06Q30/00
9	Method for monitoring resource utilization of server	H04L12/24
10	Off-line engine system based on software as a service (SaaS) mode	G06F17/30
10	System and method for providing a messaging application program interface	G06F3/00
10	Integrated chaining process for continuous software integration and validation	G06F9/44
10	Method for implementing configuration clause processing of policy-based network in cloud component software system	H04L29/06
10	Method for providing a virtual execution environment on a target computer using a virtual software machine	G06F9/44
10	Frame driving method of application construction platform	G06F9/44
10	Internal control management system capable of applying response type shared application architecture	G06F9/44
10	Computer flexible management construction system and interface storage and explanation method	G06F9/44
10	Method and system for connecting words, phrases, or symbols within the content of transmitted data to URI or IP address	G06F17/30
10	Realization method and system for device control by using HTTP interface	H04L29/08

Labels	nearest10_title	IPC
0	Method and device for obtaining combined image	G06K9/62
0	Digital image visualized management and retrieval for communication network	G06F17/30
0	Terminal device, intelligent mobile phone, and face identification-based authentication method and system	G06K9/00
0	Remote sensing image significance target detection method and system based on Hadoop	G06F17/30
0	Method for detecting over-exposure area in monitoring video image combining multiple features	G06K9/62
0	Method and system for detection of representative area of automatic quasi object type image	G06F17/30
0	Station identification method and device	G06K9/00
0	Method for generating and applying image search code technique	G06F17/30
0	Image matching method and image matching device	G06K9/62
0	Method and system for replacing background images of smart camera in real time	G06F3/0484
1	Distributed storage method and apparatus, and data processing method and apparatus	G06F17/30
1	Massive real-time data synchronization system based on private cloud storage	H04L29/08
1	Distribution and utilization global total data transmission and storage method and device and electronic equipment	G06F17/30
1	Data rapid distribution method and device	H04L29/06
1	Method for acquiring and converting data of metering system of intelligent transformer substation	G06F17/30
1	Method of pre-caching or pre-fetching data utilizing thread lists and multimedia editing systems using such pre-caching	G06F3/06
1	Database normalization storage system and method suitable for use in multi-model satellite testing	G06F17/30
1	Data audits based on timestamp criteria in replicated data bases within digital mobile telecommunication system	G06F17/30
1	Write operation control method, system and device and computer storage medium	G06F3/06
1	Smart storage platform apparatus and method for efficient storage and real-time analysis of big data	G06F3/06
2	Context-based photograph sharing platform for property inspections	G06F17/30
2	Systems and methods for constructing and using models of memorability in computing and communications applications	G06F3/048
2	Systems and methods for constructing and using models of memorability in computing and communications applications	G06F3/048
2	Systems and methods for constructing and using models of memorability in computing and communications applications	G06F3/048
2	Incentives for content consumption	G06Q30/00
2	Method and apparatus for locating errors in documents via database queries, similarity-based information retrieval and modeling the errors for error resolution	G06F17/30
2	Method and system for electronic display of photographs	G06F17/30
2	Three-dimensional web crawler	G06F17/30
2	Intelligent integrating system for crowdsourcing and collaborative intelligence in human- and device- adaptive query-response networks	G06F17/30
2	Methods and systems for annotation of digital information	G06F17/24
3	Intelligent liquid warehousing device	G06K9/00
3	Internet-of-things-based water level monitoring system for water conservancy and hydropower engineering	H04L29/08
3	Touch control input device used for electronic information equipment	G06F3/041
3	Output device and wearable display	G09G5/00
3	Diversified reinforced tablet computer system	G06F1/16
3	Force touch module, preparation method thereof, touch screen panel and display device	G06F3/041
3	Luminous band display type sliding touch bar and display method of touch luminous band	G06F3/041
3	Economical skin-pattern-acquisition and analysis apparatus for access control; systems controlled thereby	G06K9/00
3	Shield machine posture solving device based on VBA writing	G06F9/44
3	Touch-control module, touch screen and intelligent device and stereo touch-control method	G06F3/041
4	Method for understanding questions in question type automatic question-answer systems on basis of rule	G06F17/27
4	Data searching method and system based on semantic analysis	G06F17/27
4	Information searching method based on metadata	G06F17/30
4	Relevancy priority ordering method used for environmental protection regulation retrieval	G06F17/30
4	Information management, retrieval and display system and associated method	G06F17/30
4	Information management, retrieval and display systems and associated methods	G06F7/00
4	Information management, retrieval and display system and associated method	G06F17/30
4	Method of indexing words in handwritten document images using image hash tables	G06F17/30
4	Method for searching pattern matching index	G06F17/30
4	System, method and program product for answering questions using a search engine	G06F17/30
5	Search engine method based on keyword resolution scheduling	G06F17/30
5	Method and system for automatically converting dynamic form page to HTML5 page	G06F17/22
5	Automatic access of electronic information through machine-readable codes on printed documents	G06F12/00
5	Electronic commerce system for updating information	G06F12/00
5	Web service multithreading file uploading system	H04L29/08
5	System and method for creating and posting media lists for purposes of subsequent playback	G06F3/0482
5	System and method for creating and posting media lists for purposes of subsequent playback	G06F15/16
5	System and method for creating and posting media lists for purposes of subsequent playback	G06F15/16
5	Pay per record system and method	H04L29/06
5	Dynamic generation of target files from template files and tracking of the processing of target files	G06F7/00
6	Wired security access control device of financial industry network and access method of wired security access control device	H04L29/06
6	Vehicle identification system and method	G06F17/30
6	Control system	G06F3/16
6	Plug type audio device and signal processing method	G06F3/16
6	Touch display device and touch display method	G06F3/041
6	Method and device for playing audio data in sound card signal input channel in real time	G06F3/16
6	Portal access control system	G06F7/04
6	Method and device for displaying states of ports of switch	H04L12/24
6	Computer control system	G06F3/00
6	Login method and device for user identified by radio frequency	G06F21/00
7	Device, method and equipment for information data interaction for processing information data	G06F17/30
7	Smart instant interaction technology for use in radius range of position	G06F17/30
7	Information processing method, terminal and electronic device	G06F17/30
7	System information security monitoring method and device, computer device and storage medium	G06Q10/10
7	Novel electronic device information collection and selective information orientation distribution method	H04L29/06
7	Interested object information acquisition method and system with mobile terminals coordinating with cloud terminal	H04L29/08
7	Information display method and device	H04L12/58
7	Method and device for feeding back information, and terminal	H04L12/58
7	Method, device and system for storing social networking service (SNS) content	G06F17/30
7	Method and system for automatically ordering dishes and settling account	G06Q30/02
8	Facial action unit strength estimation-based expression analysis method	G06K9/00
8	Spatial data matching method based on machine learning	G06F17/30
8	Method for quickly sorting electroencephalograph signal based on threshold analysis	G06F3/01
8	Intelligent analysis method for components of camera scene image	G06K9/62
8	Method and system for generating radio frequency identification data into tripping origin destination) matrix on the basis of Spark	G06F17/30
8	Target identification method based on geometry reconstruction and multi-scale analysis	G06K9/00
8	Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation	G06F17/30
8	Judgment standard establishment method for identifying red and black time sequence through resistance method	G06K9/62
8	Data flow abnormality detection and multiple verification method based on enhancement-type angle abnormality factor	G06F17/30
8	Wi-Fi-based indoor personnel passive detection method	G06K9/00
9	Systems and methods of network operation and information processing	G06F15/16
9	Systems and methods of network operation and information processing	G06F17/30
9	Systems and methods of network operation and information processing, including engaging users of a public-access network	G06F15/16
9	Systems and methods of network operation and information processing, including use of unique/anonymous identifiers throughout all stages of information processing and delivery	G06F15/16
9	Video broadcast creation method and system, access device and management device	H04L29/06
9	System and method for realizing signaling firewall based on signaling point-free access technology	H04L29/06
9	Network device access authentication method in network video monitoring	H04L29/06
9	System and method for simulating an application for subsequent deployment to a device in communication with a transaction server	G06F7/00
9	Method and system for managing personal information	G06Q30/00
9	Method for monitoring resource utilization of server	H04L12/24
10	Off-line engine system based on software as a service (SaaS) mode	G06F17/30
10	System and method for providing a messaging application program interface	G06F3/00
10	Integrated chaining process for continuous software integration and validation	G06F9/44
10	Method for implementing configuration clause processing of policy-based network in cloud component software system	H04L29/06
10	Method for providing a virtual execution environment on a target computer using a virtual software machine	G06F9/44
10	Frame driving method of application construction platform	G06F9/44
10	Internal control management system capable of applying response type shared application architecture	G06F9/44
10	Computer flexible management construction system and interface storage and explanation method	G06F9/44
10	Method and system for connecting words, phrases, or symbols within the content of transmitted data to URI or IP address	G06F17/30
10	Realization method and system for device control by using HTTP interface	H04L29/08

Source: Created by authors

Abramovitz

(

1986

), “

Catching up, forging ahead, and falling behind

”,

The Journal of Economic History

, Vol.

No.

, pp.

385

406

Agrawal

Gans

and

Goldfarb

(

2018

Prediction Machines: The Simple Economics of Artificial Intelligence

Harvard Business School Press

Arts

Cassiman

and

Gomez

J.C.

(

2017

), “

Text matching to measure patent similarity

”,

Strategic Management Journal

, Vol.

No.

, pp.

Bell

and

Pavitt

(

1993

), “

Technological accumulation and industrial growth: contrasts between developed and developing countries

”,

Industrial and Corporate Change

, Vol.

No.

, pp.

157

210

Biancotti

and

Ciocca

(

2018

), “

Regulating data superpower in the age of AI

”,

Realtime Economic Issues Watch, October 23, 2018

Peterson Institute for International Economics

Cho

D.S.

Kim

D.J.

and

Rhee

D.K.

(

1998

), “

Latecomer strategies: evidence from the semiconductor industry in Japan and Korea

”,

Organization Science

, Vol.

No.

, pp.

489

505

Chorzempa

Triolo

and

Saks

(

2018

), “

China’s social credit system: a mark of progress or a threat to privacy?

”,

Peterson Institute for International Economics

Policy Brief 18-14

Economist

(

2020

), “

Special report: the data economy

”,

The Economist, Feb 22, 2020

London

Fagerberg

and

Godinho

M.M.

(

2005

), “

Innovation and catching-up

”,

The Oxford Handbook of Innovation

Oxford University Press

New York, NY

, pp.

514

543

Fan

(

2006

), “

Catching up through developing innovation capability: evidence from China’s telecomequipment industry

”,

Technovation

, Vol.

No.

, pp.

359

368

Goldfarb

and

Trefler

(

2018

), “

AI and international trade

”,

NBER Working Paper #24254

Cambridge MA

Kashani

E.S.

Radosevic

Kiamehr

and

Gholizadeh

(

2022

), “

The intellectual evolution of the technological catch-up literature: bibliometric analysis

”,

Research Policy

, Vol.

No.

, p.

104538

Kim

(

1998

), “

Crisis construction and organizational learning: capability building in catching-up at Hyundai motor

”,

Organization Science

, Vol.

No.

, pp.

506

521

Lee

(

2013

Schumpeterian Analysis of Economic Catch-up: Knowledge, Path-Creation, and the Middle-Income Trap

Cambridge University Press

London

McInnes

Healy

and

Melville

(

2018

), “

UMAP: uniform manifold approximation and projection for dimension reduction

”,

6, Dec 2018, arXiv preprint arXiv:1802.03426

Mathews

J.A.

(

2006

), “

Dragon multinationals: new players in 21st century globalization

”,

Asia Pacific Journal of Management

, Vol.

No.

, pp.

Miao

Song

Lee

and

Jin

(

2018

), “

Technological catch-up by east Asian firms: trends, issues, and future research agenda

”,

Asia Pacific Journal of Management

, Vol.

No.

, pp.

639

669

Mikolov

Chen

Corrado

and

Dean

(

2013

), “

Efficient estimation of word representations in vector space

”,

In ICLR

Motohashi

(

2020

), “

Science and technology co-evolution in AI: empirical understanding through a linked dataset of scientific articles and patents

”,

RIETI Discussion Paper Series 20-E-010

RIETI

Tokyo Japan

Motohashi

and

Zhu

(

2023

), “

Identifying technology opportunity using dual-attention model and technology-market concordance matrix

”,

Technological Forecasting and Social Change

, Vol.

197

, p.

122916

Motohashi

Koshiba

and

Ikeuchi

(

2019

), “

A method of extracting content information from patent documents and comparison of their characteristics by applicant type by using the vector space model of distributed expressions

”,

NISTEP Discussion Paper No. 175

MEXT

Japan, Tokyo

, (

in Japanese

Nagaoka

Motohashi

and

Goto

(

2010

), “

Patent statistics as an innovation indicator

”, in

Hall

and

Rosenberg

(Eds),

Handbook of the Economics of Innovation

Elsevier Science

North Holland

, Vol.

Park

and

Geum

(

2022

), “

Two-stage technology opportunity discovery for firm-level decision making: GCN-based link-prediction approach

”,

Technological Forecasting and Social Change

, Vol.

183

, p.

121934

Sugawara

Kobayashi

and

Iwasaki

(

2016

), “

On approximately searching for similar word embeddings

”,

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics

Trajtenberg

(

2018

), “

Artificial intelligence as the next GPT: a Political-Economy perspective

”,

NBER Working Paper #24245

Cambridge MA

Wang

Roijakkers

and

Vanhaverbeke

(

2014

), “

How fast do Chinese firms learn and catch up? Evidence from patent citations

”,

Scientometrics

, Vol.

No.

, pp.

743

761

Wang

Chen

Wang

and

Kuo

(

2019

), “

Evaluating word embedding models: methods and experimental results

”,

APSIPA Transactions on Signal and Information Processing

, Vol.

No.

, p.

e19

Younge

K.A.

and

Kuhn

J.M.

(

2016

Patent-to-Patent Similarity: A Vector Space Model

SSRN

Kim

Lee

and

Kwak

(

2017

), “

Standards as a driving force that influences emerging technological trajectories in the converging world of the internet and things: an investigation of the M2M/IoT patent network

”,

Research Policy

, Vol.

No.

, pp.

1234

1254