Publication | Open Access
Singling Out Individual Inventors from Patent Data
32
Citations
7
References
2011
Year
Recent studies aim to identify individual inventors from patent data using heuristics on names and other disclosed information. This paper proposes a methodology to identify individual inventors in European Patent Office applications. The method follows a three‑step process: parsing to clean inventor fields, matching to cluster similar names, and filtering with additional data and scoring to isolate distinct inventors. Applying the algorithms to a large set of EPO inventors yields figures illustrating the method’s performance.
An increasing number of studies have sprung up in recent years seeking to identify individual inventors from patent data. Different heuristics have been suggested to use their names and other information disclosed in patent documents in order to find out, “who is who,” in patents. This paper contributes to this literature by setting forth a methodology to identify them using patents applied to the European Patent Office (EPO hereafter). As in the large part of this literature, we basically follow a three-steps procedure: (1) the parsing stage, aimed at reducing the noise in the inventor’s name and other fields of the patent; (2) the matching stage, where name matching algorithms are used to group possible similar names; (3) the filtering stage, where additional information and different scoring schemes are used to filter out these potential same inventors. The paper includes some figures resulting of applying the algorithms to the set of European inventors applying to the EPO for a large period of time.
| Year | Citations | |
|---|---|---|
Page 1
Page 1