A Study of Query Reformulation for Patent Prior Art Search with Partial Patent Applications

Mohamed Reda Bouadjenek 1 Scott Sanner 2 Gabriela Ferraro 3
1 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Patents are used by legal entities to legally protect their inventions and represent a multi-billion dollar industry of licensing and litigation. In 2014, 326,033 patent applications were approved in the US alone -- a number that has doubled in the past 15 years and which makes prior art search a daunting, but necessary task in the patent application process. In this work, we seek to investigate the efficacy of prior art search strategies from the perspective of the inventor who wishes to assess the patentability of their ideas prior to writing a full application. While much of the literature inspired by the evaluation framework of the CLEF-IP competition has aimed to assist patent examiners in assessing prior art for complete patent applications, less of this work has focused on patent search with queries representing partial applications. In the (partial) patent search setting, a query is often much longer than in other standard IR tasks, e.g., the description section may contain hundreds or even thousands of words. While the length of such queries may suggest query reduction strategies to remove irrelevant terms, intentional obfuscation and general language used in patents suggests that it may help to expand queries with additionally relevant terms. To assess the trade-offs among all of these pre-application prior art search strategies, we comparatively evaluate a variety of partial application search and query reformulation methods. Among numerous findings, querying with a full description, perhaps in conjunction with generic (non-patent specific) query reduction methods, is recommended for best performance. However, we also find that querying with an abstract represents the best trade-off in terms of writing effort vs. retrieval efficacy (i.e., querying with the description sections only lead to marginal improvements) and that for such relatively short queries, generic query expansion methods help.
Type de document :
Communication dans un congrès
ICAIL: International Conference on Artificial Intelligence and Law, Jun 2015, San Diego, United States. 2015, ICAIL'2015: 15th International Conference on Artificial Intelligence and Law
Liste complète des métadonnées

Littérature citée [24 références]  Voir  Masquer  Télécharger

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01134828
Contributeur : Mohamed Reda Bouadjenek <>
Soumis le : jeudi 7 mai 2015 - 11:49:29
Dernière modification le : jeudi 11 janvier 2018 - 17:01:51

Fichier

ICAIL2015.pdf
Fichiers produits par l'(les) auteur(s)

Licence


Copyright (Tous droits réservés)

Identifiants

  • HAL Id : lirmm-01134828, version 3

Citation

Mohamed Reda Bouadjenek, Scott Sanner, Gabriela Ferraro. A Study of Query Reformulation for Patent Prior Art Search with Partial Patent Applications. ICAIL: International Conference on Artificial Intelligence and Law, Jun 2015, San Diego, United States. 2015, ICAIL'2015: 15th International Conference on Artificial Intelligence and Law. 〈lirmm-01134828〉

Partager

Métriques

Consultations de la notice

198

Téléchargements de fichiers

313