We show how a deep neural network can be designed to learn meaningful representations from high-dimensional and heterogeneous categorical features in patent data using self-supervised learning. Based on each firm’s technology portfolio and each patent’s co-classification information, we propose a novel recommendation system for firms seeking new convergence opportunities through representations of convergence items and firms. The results of this work are expected to recommend convergence opportunities in multiple technology fields by considering the target firm’s potential preference. First, we create a technology portfolio consisting of a set of patents owned by each firm. Then, we train a neural network to extract latent representations of firms and technology convergence items. Despite a lack of indicators related to a firm’s latent preference for a convergence item, a self-supervised neural network can capture the similarity with semantic information of firm’s latent preference that is implicitly present in patent’s co-classification information in each firm’s technology portfolio. We then calculate the similarity between the vector of a target firm and convergence items for recommendation. The top N similar convergence items that have the highest scores are recommended as the new convergence items for the target firm. We apply our framework to the dataset of patents granted by the United States Patent and Trademark Office between 2011 and 2015. The results indicate that the recent development in theories and empirical studies of deep representation learning can shed new light on extracting valuable information from the structured part of patent data.
Bibliographical noteFunding Information:
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (2016R1A2A1A05005270) and (MSIT) (2020R1A2C2005026). Jungpyo Lee is currently working as an AI Research Engineer in Mobigen, Korea.
© 2020, Akadémiai Kiadó, Budapest, Hungary.
All Science Journal Classification (ASJC) codes
- Social Sciences(all)
- Computer Science Applications
- Library and Information Sciences