Open AccessPosted Content
Multitask Learning for Class-Imbalanced Discourse Classification.
TL;DR: This article performed an extensive analysis on sentence-level classification approaches for the News Discourse dataset, one of the largest high-level semantic discourse datasets recently published, and showed that a multitask approach can improve 7% Micro F1 score upon current state-of-the-art benchmarks, due in part to label corrections across tasks, which improve performance for underrepresented classes.
read more
Abstract: Small class-imbalanced datasets, common in many high-level semantic tasks like discourse analysis, present a particular challenge to current deep-learning architectures. In this work, we perform an extensive analysis on sentence-level classification approaches for the News Discourse dataset, one of the largest high-level semantic discourse datasets recently published. We show that a multitask approach can improve 7% Micro F1-score upon current state-of-the-art benchmarks, due in part to label corrections across tasks, which improve performance for underrepresented classes. We also offer a comparative review of additional techniques proposed to address resource-poor problems in NLP, and show that none of these approaches can improve classification accuracy in such a setting.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Heterogeneous graph neural networks for fraud detection and explanation in supply chain finance
Bin Wu,Kuo-Ming Chao,Yinsheng Li +2 more
TL;DR: This work proposes a multitask learning framework, MultiFraud, for complex fraud detection with reasonable explanation, which enables multiple domains to share embeddings and enhance modeling capabilities for fraud detection.
13
Temporal variational autoencoder model for in-hospital clinical emergency prediction
Trong-Nghia Nguyen,Soo-Hyung Kim,Bo‐Gun Kho,Nhu-Tai Do,Ngumimi-Karen Iyortsuun,Guee-Sang Lee,Hyung-Jeong Yang +6 more
•Posted Content
\textit{StateCensusLaws.org}: A Web Application for Consuming and Annotating Legal Discourse Learning.
Alexander Spangher,Jonathan May +1 more
TL;DR: This article presented a web application to highlight the output of NLP models trained to parse and label discourse segments in law text, focusing on state-level law that uses U.S. Census population numbers to allocate resources and organize government.
•Posted Content
\textit{NewsEdits}: A Dataset of Revision Histories for News Articles (Technical Report: Data Processing).
Alexander Spangher,Jonathan May +1 more
TL;DR: The NewsEdits dataset as discussed by the authors contains 1,278,804 articles with 4,609,430 versions from over 22 English and French-language newspaper sources based in three countries.
References
•Proceedings Article
Attention is All you Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Lukasz Kaiser,Illia Polosukhin +7 more
- 12 Jun 2017
TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
•Posted Content
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu,Myle Ott,Naman Goyal,Jingfei Du,Mandar Joshi,Danqi Chen,Omer Levy,Michael Lewis,Luke Zettlemoyer,Veselin Stoyanov +9 more
TL;DR: It is found that BERT was significantly undertrained, and can match or exceed the performance of every model published after it, and the best model achieves state-of-the-art results on GLUE, RACE and SQuAD.
V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
Fausto Milletari,Nassir Navab,Seyed-Ahmad Ahmadi +2 more
- 15 Jun 2016
TL;DR: In this article, a volumetric, fully convolutional neural network (FCN) was proposed to predict segmentation for the whole volume at one time, which can deal with situations where there is a strong imbalance between the number of foreground and background voxels.
7.7K
A unified architecture for natural language processing: deep neural networks with multitask learning
Ronan Collobert,Jason Weston +1 more
- 05 Jul 2008
TL;DR: This work describes a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, semantic roles, semantically similar words and the likelihood that the sentence makes sense using a language model.
•Posted Content
An Overview of Multi-Task Learning in Deep Neural Networks
TL;DR: This article seeks to help ML practitioners apply MTL by shedding light on how MTL works and providing guidelines for choosing appropriate auxiliary tasks, particularly in deep neural networks.
3.3K