Open library

Check the preview of 2nd version of this platform being developed by the open MLCommons taskforce on automation and reproducibility as a free, open-source and technology-agnostic on-prem platform.

TAP-DLND 1.0 : A Corpus for Document Level Novelty Detection

lib:e4ef687f19042497 (v1.0.0)

Authors: Tirthankar Ghosal,Amitra Salam,Swati Tiwari,Asif Ekbal,Pushpak Bhattacharyya
Where published: LREC 2018 5
ArXiv: 1802.06950
Document: PDF DOI

Abstract URL: http://arxiv.org/abs/1802.06950v1

Detecting novelty of an entire document is an Artificial Intelligence (AI) frontier problem that has widespread NLP applications, such as extractive document summarization, tracking development of news events, predicting impact of scholarly articles, etc. Important though the problem is, we are unaware of any benchmark document level data that correctly addresses the evaluation of automatic novelty detection techniques in a classification framework. To bridge this gap, we present here a resource for benchmarking the techniques for document level novelty detection. We create the resource via event-specific crawling of news documents across several domains in a periodic manner. We release the annotated corpus with necessary statistics and show its use with a developed system for the problem in concern.

Relevant initiatives

Related knowledge about this paper

Search on this portal

Reproduced results (crowd-benchmarking and competitions)

Artifact and reproducibility checklists

Common formats for research projects and shared artifacts

Collective Knowledge (organizing research projects based on FAIR principles)

Reproducibility initiatives

Comments

Please log in to add your comments!

If you notice any inapropriate content that should not be here, please report us as soon as possible and we will try to remove it within 48 hours!

TAP-DLND 1.0 : A Corpus for Document Level Novelty Detection

Relevant initiatives Hide

Comments Hide

Relevant initiatives

Comments