Concepedia

Publication | Open Access

Singling Out Individual Inventors from Patent Data

32

Citations

7

References

2011

Year

TLDR

Recent studies aim to identify individual inventors from patent data using heuristics on names and other disclosed information. This paper proposes a methodology to identify individual inventors in European Patent Office applications. The method follows a three‑step process: parsing to clean inventor fields, matching to cluster similar names, and filtering with additional data and scoring to isolate distinct inventors. Applying the algorithms to a large set of EPO inventors yields figures illustrating the method’s performance.

Abstract

An increasing number of studies have sprung up in recent years seeking to identify individual inventors from patent data. Different heuristics have been suggested to use their names and other information disclosed in patent documents in order to find out, “who is who,” in patents. This paper contributes to this literature by setting forth a methodology to identify them using patents applied to the European Patent Office (EPO hereafter). As in the large part of this literature, we basically follow a three-steps procedure: (1) the parsing stage, aimed at reducing the noise in the inventor’s name and other fields of the patent; (2) the matching stage, where name matching algorithms are used to group possible similar names; (3) the filtering stage, where additional information and different scoring schemes are used to filter out these potential same inventors. The paper includes some figures resulting of applying the algorithms to the set of European inventors applying to the EPO for a large period of time.

References

YearCitations

Page 1