Journal Issue10.1002/ASI.V61:4
New event detection and topic tracking in Turkish
404
TL;DR: This work introduces the first large-scale TDT test collection for Turkish, and investigates the NED and TT problems in this language, and demonstrates that the confidence scores of two different similarity measures can be combined in a straightforward manner for higher effectiveness.
read more
Abstract: Topic detection and tracking (TDT) applications aim to organize the temporally ordered stories of a news stream according to the events. Two major problems in TDT are new event detection (NED) and topic tracking (TT). These problems focus on finding the first stories of new events and identifying all subsequent stories on a certain topic defined by a small number of sample stories. In this work, we introduce the first large-scale TDT test collection for Turkish, and investigate the NED and TT problems in this language. We present our test-collection-construction approach, which is inspired by the TDT research initiative. We show that in TDT for Turkish with some similarity measures, a simple word truncation stemming method can compete with a lemmatizer-based stemming approach. Our findings show that contrary to our earlier observations on Turkish information retrieval, in NED word stopping has an impact on effectiveness. We demonstrate that the confidence scores of two different similarity measures can be combined in a straightforward manner for higher effectiveness. The influence of several similarity measures on effectiveness also is investigated. We show that it is possible to deploy TT applications in Turkish that can be used in operational settings. © 2010 Wiley Periodicals, Inc.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Book
Looking for Information: A Survey of Research on Information Seeking, Needs and Behavior
Donald O. Case,Lisa M. Given +1 more
- 19 Apr 2012
TL;DR: In this paper, the authors introduce concepts relevant to Information Behavior Models, Paradigms, and Theories in the study of Information Behavior Methods for Studying Information Behavior Research Results and Reflections.
1.5K
Text Classification Algorithms: A Survey
Kamran Kowsari,Kiana Jafari Meimandi,Mojtaba Heidarysafa,Sanjana Mendu,Laura E. Barnes,Donald E. Brown +5 more
TL;DR: A brief overview of text classification algorithms is discussed in this article, where different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods are discussed, and the limitations of each technique and their application in real-world problems are discussed.
1.2K
Participatory design of a health informatics system for rural health practitioners and disadvantaged women
Mia Liza A. Lustria,Michelle M. Kazmer,Robert L. Glueckauf,Robert P. Hawkins,Ebrahim Randeree,Ivee B. Rosario,Casey McLaughlin,Sarah Redmond +7 more
TL;DR: Results are reported of formative research conducted as part of a larger study focused on the participatory development of an electronic reminder system for breast cancer screening and insights gained from focus groups with rural patients and clinicians are discussed.
Transactive Memory Systems 1985–2010: An Integrative Framework of Key Dimensions, Antecedents, and Consequences
Yuqing Ren,Linda Argote +1 more
TL;DR: In this article, the authors reviewed 76 papers that examined transactive memory systems and summarized the findings in an integrative framework to show the antecedents and consequences of TMS.
471
Transactive Memory Systems: Current Issues and Future Research Directions
Kyle Lewis,Benjamin Herndon +1 more
TL;DR: This essay describes issues concerning how researchers define and conceptualize TMSs, interpret the relationship between TMS measures and the TMS concept, and attend to the role of task type in TMS research.
462
References
Machine learning in automated text categorization
TL;DR: This survey discusses the main approaches to text categorization that fall within the machine learning paradigm and discusses in detail issues pertaining to three different problems, namely, document representation, classifier construction, and classifier evaluation.
A very brief measure of the Big-Five personality domains
TL;DR: In this paper, a 10-item measure of the Big-Five personality dimensions is proposed for situations where very short measures are needed, personality is not the primary topic of interest, or researchers can tolerate the somewhat diminished psychometric properties associated with very brief measures.
8.3K
The dynamics of innovation: from national systems and "Mode" 2 to a triple helix of university-industry-government relations.
Henry Etzkowitz,Loet Leydesdorff +1 more
TL;DR: In this article, the Triple Helix of university-industry-government relations is compared with alternative models for explaining the current research system in its social contexts, and the authors suggest that university research may function increasingly as a locus in the "laboratory" of knowledge-intensive network transitions.
7.8K
Exploring internal stickiness: Impediments to the transfer of best practice within the firm
TL;DR: In this article, the authors analyze the internal stickiness of knowledge transfer and test the resulting model using canonical correlation analysis of a data set consisting of 271 observations of 122 best-practice transfers in eight companies.
7.6K
•Book
Social Network Analysis: A Handbook
John Scott
- 30 Dec 1991
TL;DR: Networks and Relations The Development of Social Network Analysis Handling Relational Data Lines, Direction and Density Centrality and Centralization Components, Cores, and Cliques Positions, Roles and Clusters Dimensions and Displays Appendix Social Network Packages
7.5K
Related Papers (5)
Yiming Yang,Tom Pierce,Jaime G. Carbonell +2 more
- 01 Aug 1998
Chih-Ping Wei,Pao-Feng Wu,Yen-Hsien Lee +2 more
- 01 Jan 2004