Leveraging label hierarchy using transfer and multi-task learning: A case study on patent classification

Segun Taofeek Aroyehun; Jason Angel; Navonil Majumder; Alexander Gelbukh; Amir Hussain

doi:10.1016/j.neucom.2021.07.057

Leveraging label hierarchy using transfer and multi-task learning: A case study on patent classification

Segun Taofeek Aroyehun, Jason Angel, Navonil Majumder, Alexander Gelbukh, Amir Hussain

Centro de Investigación en Computación (CIC)

Producción científica: Contribución a una revista › Artículo › revisión exhaustiva

3 Citas (Scopus)

Resumen

When labels are organized into a meaningful taxonomy, the parent-child relationship between labels at different levels can give the classifier additional information not deducible from the data alone, especially with limited training data. As a case study, we illustrate this effect on the task of patent classification—the task of categorizing patent documents based on their technical content. Existing approaches do not take into consideration this additional information. Experiments on two patent classification datasets, WIPO-alpha and USPTO-2M, show that our regularized Gated Recurrent Unit (GRU) architecture already gives a performance improvement with a micro-averaged precision score using the top prediction of 0.5191 and 0.5740 on the two datasets, respectively. However, knowledge transfer along the label hierarchy gives further significant improvement on WIPO-alpha, raising the score to 0.5376, and a small improvement on USPTO-2M to 0.5743. Our analyses reveal that incorporating label information improves performance on classes with fewer examples and makes model robust to errors that result from predicting closely related labels.

Idioma original	Inglés
Páginas (desde-hasta)	421-431
Número de páginas	11
Publicación	Neurocomputing
Volumen	464
DOI	https://doi.org/10.1016/j.neucom.2021.07.057
Estado	Publicada - 13 nov. 2021

Acceder al documento

10.1016/j.neucom.2021.07.057

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

@article{a0a229bd9b2c4a3283f3e240aec51191,

title = "Leveraging label hierarchy using transfer and multi-task learning: A case study on patent classification",

abstract = "When labels are organized into a meaningful taxonomy, the parent-child relationship between labels at different levels can give the classifier additional information not deducible from the data alone, especially with limited training data. As a case study, we illustrate this effect on the task of patent classification—the task of categorizing patent documents based on their technical content. Existing approaches do not take into consideration this additional information. Experiments on two patent classification datasets, WIPO-alpha and USPTO-2M, show that our regularized Gated Recurrent Unit (GRU) architecture already gives a performance improvement with a micro-averaged precision score using the top prediction of 0.5191 and 0.5740 on the two datasets, respectively. However, knowledge transfer along the label hierarchy gives further significant improvement on WIPO-alpha, raising the score to 0.5376, and a small improvement on USPTO-2M to 0.5743. Our analyses reveal that incorporating label information improves performance on classes with fewer examples and makes model robust to errors that result from predicting closely related labels.",

keywords = "Machine learning, Multi-task learning, Natural language processing, Neural networks, Patent classification, Transfer learning",

author = "Aroyehun, {Segun Taofeek} and Jason Angel and Navonil Majumder and Alexander Gelbukh and Amir Hussain",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier B.V.",

year = "2021",

month = nov,

day = "13",

doi = "10.1016/j.neucom.2021.07.057",

language = "Ingl{\'e}s",

volume = "464",

pages = "421--431",

journal = "Neurocomputing",

issn = "0925-2312",

}

TY - JOUR

T1 - Leveraging label hierarchy using transfer and multi-task learning

T2 - A case study on patent classification

AU - Aroyehun, Segun Taofeek

AU - Angel, Jason

AU - Majumder, Navonil

AU - Gelbukh, Alexander

AU - Hussain, Amir

PY - 2021/11/13

Y1 - 2021/11/13

N2 - When labels are organized into a meaningful taxonomy, the parent-child relationship between labels at different levels can give the classifier additional information not deducible from the data alone, especially with limited training data. As a case study, we illustrate this effect on the task of patent classification—the task of categorizing patent documents based on their technical content. Existing approaches do not take into consideration this additional information. Experiments on two patent classification datasets, WIPO-alpha and USPTO-2M, show that our regularized Gated Recurrent Unit (GRU) architecture already gives a performance improvement with a micro-averaged precision score using the top prediction of 0.5191 and 0.5740 on the two datasets, respectively. However, knowledge transfer along the label hierarchy gives further significant improvement on WIPO-alpha, raising the score to 0.5376, and a small improvement on USPTO-2M to 0.5743. Our analyses reveal that incorporating label information improves performance on classes with fewer examples and makes model robust to errors that result from predicting closely related labels.

AB - When labels are organized into a meaningful taxonomy, the parent-child relationship between labels at different levels can give the classifier additional information not deducible from the data alone, especially with limited training data. As a case study, we illustrate this effect on the task of patent classification—the task of categorizing patent documents based on their technical content. Existing approaches do not take into consideration this additional information. Experiments on two patent classification datasets, WIPO-alpha and USPTO-2M, show that our regularized Gated Recurrent Unit (GRU) architecture already gives a performance improvement with a micro-averaged precision score using the top prediction of 0.5191 and 0.5740 on the two datasets, respectively. However, knowledge transfer along the label hierarchy gives further significant improvement on WIPO-alpha, raising the score to 0.5376, and a small improvement on USPTO-2M to 0.5743. Our analyses reveal that incorporating label information improves performance on classes with fewer examples and makes model robust to errors that result from predicting closely related labels.

KW - Machine learning

KW - Multi-task learning

KW - Natural language processing

KW - Neural networks

KW - Patent classification

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85115652725&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2021.07.057

DO - 10.1016/j.neucom.2021.07.057

M3 - Artículo

AN - SCOPUS:85115652725

SN - 0925-2312

VL - 464

SP - 421

EP - 431

JO - Neurocomputing

JF - Neurocomputing

ER -

Leveraging label hierarchy using transfer and multi-task learning: A case study on patent classification

Resumen

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto