Open library

This portal has been archived. Explore the next generation of this technology.

Salience Rank: Efficient Keyphrase Extraction with Topic Modeling

lib:396c637b49fe9d74 (v1.0.0)

Authors: Nedelina Teneva,Weiwei Cheng
Where published: ACL 2017 7
Document: PDF DOI

Abstract URL: https://www.aclweb.org/anthology/P17-2084/

Topical PageRank (TPR) uses latent topic distribution inferred by Latent Dirichlet Allocation (LDA) to perform ranking of noun phrases extracted from documents. The ranking procedure consists of running PageRank K times, where K is the number of topics used in the LDA model. In this paper, we propose a modification of TPR, called Salience Rank. Salience Rank only needs to run PageRank once and extracts comparable or better keyphrases on benchmark datasets. In addition to quality and efficiency benefit, our method has the flexibility to extract keyphrases with varying tradeoffs between topic specificity and corpus specificity.

Relevant initiatives

Related knowledge about this paper

Search on this portal

Reproduced results (crowd-benchmarking and competitions)

Artifact and reproducibility checklists

Common formats for research projects and shared artifacts

Collective Knowledge (organizing research projects based on FAIR principles)

Reproducibility initiatives

Comments

Please log in to add your comments!

If you notice any inapropriate content that should not be here, please report us as soon as possible and we will try to remove it within 48 hours!

Salience Rank: Efficient Keyphrase Extraction with Topic Modeling

Relevant initiatives Hide

Comments Hide

Relevant initiatives

Comments