Top 690 papers published in the topic of Categorization in 2017

Showing papers on "Categorization published in 2017"

Journal Article•10.1016/J.INFFUS.2017.02.007•

Multi-view learning overview

[...]

Jing Zhao¹, Xijiong Xie¹, Xin Xu², Shiliang Sun¹•Institutions (2)

East China Normal University¹, National University of Defense Technology²

01 Nov 2017-Information Fusion

TL;DR: This overview reviews theoretical underpinnings of multi-view learning and attempts to identify promising venues and point out some specific challenges which can hopefully promote further research in this rapidly developing field.

...read moreread less

991 citations

Journal Article•10.1371/JOURNAL.PONE.0181142•

"What is relevant in a text document?": An interpretable machine learning approach

[...]

Leila Arras¹, Franziska Horn², Grégoire Montavon², Klaus-Robert Müller², Klaus-Robert Müller³, Klaus-Robert Müller⁴, Wojciech Samek¹ - Show less +3 more•Institutions (4)

Heinrich Hertz Institute¹, Technical University of Berlin², Korea University³, Max Planck Society⁴

11 Aug 2017-PLOS ONE

TL;DR: A measure of model explanatory power is introduced and it is shown that, although the SVM and CNN models perform similarly in terms of classification accuracy, the latter exhibits a higher level of explainability which makes it more comprehensible for humans and potentially more useful for other applications.

...read moreread less

Abstract: Text documents can be described by a number of abstract concepts such as semantic category, writing style, or sentiment. Machine learning (ML) models have been trained to automatically map documents to these abstract concepts, allowing to annotate very large text collections, more than could be processed by a human in a lifetime. Besides predicting the text’s category very accurately, it is also highly desirable to understand how and why the categorization process takes place. In this paper, we demonstrate that such understanding can be achieved by tracing the classification decision back to individual words using layer-wise relevance propagation (LRP), a recently developed technique for explaining predictions of complex non-linear classifiers. We train two word-based ML models, a convolutional neural network (CNN) and a bag-of-words SVM classifier, on a topic categorization task and adapt the LRP method to decompose the predictions of these models onto words. Resulting scores indicate how much individual words contribute to the overall classification decision. This enables one to distill relevant information from text documents without an explicit semantic information extraction step. We further use the word-wise relevance scores for generating novel vector-based document representations which capture semantic information. Based on these document vectors, we introduce a measure of model explanatory power and show that, although the SVM and CNN models perform similarly in terms of classification accuracy, the latter exhibits a higher level of explainability which makes it more comprehensible for humans and potentially more useful for other applications.

...read moreread less

363 citations

Journal Article•10.1073/PNAS.1619666114•

Color naming across languages reflects color use

[...]

Edward Gibson¹, Richard Futrell¹, Julian Jara-Ettinger¹, Kyle Mahowald¹, Leon Bergen¹, Sivalogeswaran Ratnasingam², Mitchell Gibson¹, Steven Piantadosi³, Bevil R. Conway² - Show less +5 more•Institutions (3)

Massachusetts Institute of Technology¹, National Institutes of Health², University of Rochester³

03 Oct 2017-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is suggested that the cross-linguistic similarity in color-naming efficiency reflects colors of universal usefulness and provides an account of a principle (color use) that governs how color categories come about.

...read moreread less

Abstract: What determines how languages categorize colors? We analyzed results of the World Color Survey (WCS) of 110 languages to show that despite gross differences across languages, communication of chromatic chips is always better for warm colors (yellows/reds) than cool colors (blues/greens). We present an analysis of color statistics in a large databank of natural images curated by human observers for salient objects and show that objects tend to have warm rather than cool colors. These results suggest that the cross-linguistic similarity in color-naming efficiency reflects colors of universal usefulness and provide an account of a principle (color use) that governs how color categories come about. We show that potential methodological issues with the WCS do not corrupt information-theoretic analyses, by collecting original data using two extreme versions of the color-naming task, in three groups: the Tsimane', a remote Amazonian hunter-gatherer isolate; Bolivian-Spanish speakers; and English speakers. These data also enabled us to test another prediction of the color-usefulness hypothesis: that differences in color categorization between languages are caused by differences in overall usefulness of color to a culture. In support, we found that color naming among Tsimane' had relatively low communicative efficiency, and the Tsimane' were less likely to use color terms when describing familiar objects. Color-naming among Tsimane' was boosted when naming artificially colored objects compared with natural objects, suggesting that industrialization promotes color usefulness.

...read moreread less

300 citations

Proceedings Article•10.1109/IJCNN.2017.7966144•

Ensemble application of convolutional and recurrent neural networks for multi-label text categorization

[...]

Guibin Chen¹, Deheng Ye¹, Zhenchang Xing², Jieshan Chen³, Erik Cambria¹ - Show less +1 more•Institutions (3)

Nanyang Technological University¹, Australian National University², Sun Yat-sen University³

14 May 2017

TL;DR: This paper proposes an ensemble application of convolutional and recurrent neural networks to capture both the global and the local textual semantics and to model high-order label correlations while having a tractable computational complexity.

...read moreread less

Abstract: Text categorization, or text classification, is one of key tasks for representing the semantic information of documents. Multi-label text categorization is finer-grained approach to text categorization which consists of assigning multiple target labels to documents. It is more challenging compared to the task of multi-class text categorization due to the exponential growth of label combinations. Existing approaches to multi-label text categorization fall short to extract local semantic information and to model label correlations. In this paper, we propose an ensemble application of convolutional and recurrent neural networks to capture both the global and the local textual semantics and to model high-order label correlations while having a tractable computational complexity. Extensive experiments show that our approach achieves the state-of-the-art performance when the CNN-RNN model is trained using a large-sized dataset.

...read moreread less

296 citations

Journal Article•10.1080/1359432X.2017.1337747•

Hiding behind a mask? Cultural intelligence, knowledge hiding, and individual and team creativity

[...]

Sabina Bogilović¹, Matej Černe¹, Miha Škerlavaj²•Institutions (2)

University of Ljubljana¹, BI Norwegian Business School²

14 Jun 2017-European Journal of Work and Organizational Psychology

TL;DR: This paper proposed that cross-cultural interactions among individuals from different national backgrounds can act as a salient contingency in the relationship between knowledge hiding and creativity (individual and team) and further suggest, based on the social categorization theory (e.g., the categorization process of "us" against "them" based on national differences), that cultural intelligence enhances the likelihood of high-quality social exchanges between culturally diverse individuals and therefore remedies the otherwise negative relationship between individual knowledge hiding, and individual creativity.

...read moreread less

Abstract: Culturally diverse colleagues can be valuable sources for stimulating creativity at work, yet only if they decide to share their knowledge. Drawing on the social exchange theory, we propose that cross-cultural interactions among individuals from different national backgrounds can act as a salient contingency in the relationship between knowledge hiding and creativity (individual and team). We further suggest, based on the social categorization theory (e.g., the categorization process of “us” against “them” based on national differences), that cultural intelligence enhances the likelihood of high-quality social exchanges between culturally diverse individuals and, therefore, remedies the otherwise negative relationship between individual knowledge hiding and individual creativity. Two studies using field and experimental data offer consistent support for this argument. First, a field study of 621 employees nested among 70 teams revealed that individual knowledge hiding is negatively related to indi...

...read moreread less

262 citations

Journal Article•10.3758/S13428-016-0743-Z•

Sentiment Analysis and Social Cognition Engine (SEANCE): An automatic tool for sentiment, social cognition, and social-order analysis

[...]

Scott A. Crossley¹, Kristopher Kyle¹, Danielle S. McNamara²•Institutions (2)

Georgia State University¹, Arizona State University²

01 Jun 2017-Behavior Research Methods

TL;DR: This study introduces the Sentiment Analysis and Cognition Engine (SEANCE), a freely available text analysis tool that is easy to use, works on most operating systems, is housed on a user’s hard drive, allows for batch processing of text files, and includes negation and part-of-speech (POS) features.

...read moreread less

Abstract: This study introduces the Sentiment Analysis and Cognition Engine (SEANCE), a freely available text analysis tool that is easy to use, works on most operating systems (Windows, Mac, Linux), is housed on a user’s hard drive (as compared to being accessed via an Internet interface), allows for batch processing of text files, includes negation and part-of-speech (POS) features, and reports on thousands of lexical categories and 20 component scores related to sentiment, social cognition, and social order. In the study, we validated SEANCE by investigating whether its indices and related component scores can be used to classify positive and negative reviews in two well-known sentiment analysis test corpora. We contrasted the results of SEANCE with those from Linguistic Inquiry and Word Count (LIWC), a similar tool that is popular in sentiment analysis, but is pay-to-use and does not include negation or POS features. The results demonstrated that both the SEANCE indices and component scores outperformed LIWC on the categorization tasks.

...read moreread less

235 citations

Journal Article•10.1016/J.TICS.2017.04.004•

The Origins of Social Categorization

[...]

Zoe Liberman¹, Amanda L. Woodward², Katherine D. Kinzler³•Institutions (3)

University of California, Santa Barbara¹, University of Chicago², Cornell University³

01 Jul 2017-Trends in Cognitive Sciences

TL;DR: The foundation of the human ability to form useful social categories is in place in infancy: social categories guide the inferences infants make about the shared characteristics and social relationships of other people.

...read moreread less

213 citations

Journal Article•10.3233/978-1-61499-753-5-246•

Medical Text Classification Using Convolutional Neural Networks.

[...]

Mark Hughes¹, Irene Li¹, Spyros Kotoulas¹, Toyotaro Suzumura•Institutions (1)

IBM¹

01 Jan 2017-Studies in health technology and informatics

TL;DR: This paper used deep convolutional neural networks to represent complex features and trained the network on a dataset providing a broad categorization of health information, which outperformed several approaches widely used in natural language processing tasks by about 15%.

...read moreread less

Abstract: We present an approach to automatically classify clinical text at a sentence level. We are using deep convolutional neural networks to represent complex features. We train the network on a dataset providing a broad categorization of health information. Through a detailed evaluation, we demonstrate that our method outperforms several approaches widely used in natural language processing tasks by about 15%.

...read moreread less

173 citations

Journal Article•10.1073/PNAS.1621073114•

How language shapes the cultural inheritance of categories

[...]

Susan A. Gelman¹, Steven O. Roberts¹•Institutions (1)

University of Michigan¹

25 Jul 2017-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is proposed that labels and generics each assume two key principles: norms and essentialism, which permits transmission of category information with great fidelity and invites innovation by means of an open-ended, placeholder structure.

...read moreread less

Abstract: It is widely recognized that language plays a key role in the transmission of human culture, but relatively little is known about the mechanisms by which language simultaneously encourages both cultural stability and cultural innovation. This paper examines this issue by focusing on the use of language to transmit categories, focusing on two universal devices: labels (e.g., shark, woman) and generics (e.g., "sharks attack swimmers"; "women are nurturing"). We propose that labels and generics each assume two key principles: norms and essentialism. The normative assumption permits transmission of category information with great fidelity, whereas essentialism invites innovation by means of an open-ended, placeholder structure. Additionally, we sketch out how labels and generics aid in conceptual alignment and the progressive "looping" between categories and cultural practices. In this way, human language is a technology that enhances and expands the categorization capacities that we share with other animals.

...read moreread less

166 citations

Book Chapter•10.1016/BS.AESP.2016.10.001•

Intergroup Perception and Cognition: An Integrative Framework for Understanding the Causes and Consequences of Social Categorization

[...]

Kerry Kawakami¹, David M. Amodio², Kurt Hugenberg³•Institutions (3)

York University¹, New York University², Miami University³

01 Jan 2017-Advances in Experimental Social Psychology

TL;DR: The authors provide a framework to understand and synthesize the processes of person construal with the processes involved in intergroup relations, and explore the implications of the activation of these constructs for a range of social judgments including emotion identification, empathy, and intergroup behaviors.

...read moreread less

Abstract: The primary aim of this chapter is to provide a framework to understand and synthesize the processes of person construal—early perceptions that lead to initial ingroup/outgroup categorizations—with the processes involved in intergroup relations. To this end, we review research examining the initial perception and categorization of ingroup and outgroup members and its downstream consequences. We first discuss bottom-up processes in person construal based on visual features (e.g., facial prototypicality and bodily cues), and then discuss how top-down factors (e.g., beliefs, stereotypes) may influence these processes. Next, we examine how the initial categorization of targets as ingroup or outgroup members influences identification, stereotyping, and group-based evaluations, and the relations between these constructs. We also explore the implications of the activation of these constructs for a range of social judgments including emotion identification, empathy, and intergroup behaviors. Finally, we describe a variety of well established and more recent strategies to reduce intergroup bias that target the activation of category-based knowledge, including intergroup contact, approach orientations, evaluative conditioning, and perspective taking.

...read moreread less

155 citations

Posted Content•

Medical Text Classification using Convolutional Neural Networks

[...]

Mark Hughes¹, Irene Li¹, Spyros Kotoulas¹, Toyotaro Suzumura•Institutions (1)

IBM¹

22 Apr 2017-arXiv: Computation and Language

TL;DR: An approach to automatically classify clinical text at a sentence level using deep convolutional neural networks to represent complex features and outperforms several approaches widely used in natural language processing tasks by about 15%.

...read moreread less

Journal Article•10.2196/PUBLICHEALTH.7157•

What Are People Tweeting About Zika? An Exploratory Study Concerning Its Symptoms, Treatment, Transmission, and Prevention

[...]

Michele Miller¹, Tanvi Banerjee¹, RoopTeja Muppalla¹, William L. Romine¹, Amit P. Sheth¹ - Show less +1 more•Institutions (1)

Wright State University¹

19 Jun 2017-JMIR public health and surveillance

TL;DR: It is demonstrated how categories of discussion on Twitter about an epidemic can be discovered so that public health officials can understand specific societal concerns within the disease-specific categories.

...read moreread less

Abstract: Background: In order to harness what people are tweeting about Zika, there needs to be a computational framework that leverages machine learning techniques to recognize relevant Zika tweets and, further, categorize these into disease-specific categories to address specific societal concerns related to the prevention, transmission, symptoms, and treatment of Zika virus. Objective: The purpose of this study was to determine the relevancy of the tweets and what people were tweeting about the 4 disease characteristics of Zika: symptoms, transmission, prevention, and treatment. Methods: A combination of natural language processing and machine learning techniques was used to determine what people were tweeting about Zika. Specifically, a two-stage classifier system was built to find relevant tweets about Zika, and then the tweets were categorized into 4 disease categories. Tweets in each disease category were then examined using latent Dirichlet allocation (LDA) to determine the 5 main tweet topics for each disease characteristic. Results: Over 4 months, 1,234,605 tweets were collected. The number of tweets by males and females was similar (28.47% [351,453/1,234,605] and 23.02% [284,207/1,234,605], respectively). The classifier performed well on the training and test data for relevancy (F1 score=0.87 and 0.99, respectively) and disease characteristics (F1 score=0.79 and 0.90, respectively). Five topics for each category were found and discussed, with a focus on the symptoms category. Conclusions: We demonstrate how categories of discussion on Twitter about an epidemic can be discovered so that public health officials can understand specific societal concerns within the disease-specific categories. Our two-stage classifier was able to identify relevant tweets to enable more specific analysis, including the specific aspects of Zika that were being discussed as well as misinformation being expressed. Future studies can capture sentiments and opinions on epidemic outbreaks like Zika virus in real time, which will likely inform efforts to educate the public at large. [JMIR Public Health Surveill 2017;3(2):e38]

...read moreread less

Journal Article•10.1109/TCYB.2016.2526058•

Task Sensitive Feature Exploration and Learning for Multitask Graph Classification

[...]

Shirui Pan¹, Jia Wu¹, Xingquan Zhu², Guodong Long¹, Chengqi Zhang¹ - Show less +1 more•Institutions (2)

University of Technology, Sydney¹, Florida Atlantic University²

01 Mar 2017-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: Results confirm that the proposed task sensitive feature exploration and learning algorithm can be used to explicitly capture task correlations and uniqueness in the feature space, and explicitly answer what are shared between tasks and what is the uniqueness of a specific task.

...read moreread less

Abstract: Multitask learning (MTL) is commonly used for jointly optimizing multiple learning tasks. To date, all existing MTL methods have been designed for tasks with feature-vector represented instances, but cannot be applied to structure data, such as graphs. More importantly, when carrying out MTL, existing methods mainly focus on exploring overall commonality or disparity between tasks for learning, but cannot explicitly capture task relationships in the feature space, so they are unable to answer important questions, such as what exactly is shared between tasks and what is the uniqueness of one task differing from others? In this paper, we formulate a new multitask graph learning problem, and propose a task sensitive feature exploration and learning algorithm for multitask graph classification. Because graphs do not have features available, we advocate a task sensitive feature exploration and learning paradigm to jointly discover discriminative subgraph features across different tasks. In addition, a feature learning process is carried out to categorize each subgraph feature into one of three categories: 1) common feature; 2) task auxiliary feature; and 3) task specific feature, indicating whether the feature is shared by all tasks, by a subset of tasks, or by only one specific task, respectively. The feature learning and the multiple task learning are iteratively optimized to form a multitask graph classification model with a global optimization goal. Experiments on real-world functional brain analysis and chemical compound categorization demonstrate the algorithm’s performance. Results confirm that our method can be used to explicitly capture task correlations and uniqueness in the feature space, and explicitly answer what are shared between tasks and what is the uniqueness of a specific task.

...read moreread less

Journal Article•10.5465/AMJ.2015.1221•

Playing cat and mouse : contests over regulatory categorization of dietary supplements in the U.S.

[...]

Pinar Ozcan¹, Kerem Gurses²•Institutions (2)

University of Warwick¹, Libera Università Internazionale degli Studi Sociali Guido Carli²

27 Nov 2017-Academy of Management Journal

TL;DR: In this paper, the authors set out to uncover this critical prgression in regulatory categorization, which can be a matter of life and death to firms, as it sets legal limitations on the production and sales of their product.

...read moreread less

Abstract: Regulatory categorization can be a matter of life and death to firms, as it sets legal limitations on the production and sales of their product. In this paper we set out to uncover this critical pr...

...read moreread less

Journal Article•10.15388/INFORMATICA.2017.119•

Developing a Two-Dimensional Categorization System for Educational Tasks in Informatics

[...]

Valentina Dagienė¹, Sue Sentance, Gabrielė Stupurienė¹•Institutions (1)

Vilnius University¹

01 Jan 2017-Informatica (lithuanian Academy of Sciences)

TL;DR: This paper presents a new twodimensional categorization system that takes account of computational thinking skills as well as content knowledge and examples are given from recent tasks that illustrate the role that Bebras can play in the development of computationalthinking skills.

...read moreread less

Abstract: Computational thinking is an increasingly important focus in computer science or informatics curricula around the world, and ways of incorporating it into the school curricula are being sought. The Bebras contest on informatics, which originated 12 years ago and now involves around 50 countries, consists of short problemsolving tasks based on topics in informatics. Bebras tasks engender the development of computational thinking skills by incorporating abstraction, algorithmic thinking, decomposition, evaluation and generalization. Bebras tasks cover a range of informatics concepts including algorithms and data structures, programming, networking, databases and social and ethical issues. Having built up a substantial number of Bebras tasks over 12 years it is important to be able to categorise them so that they can be easily accessed by the Bebras community and teachers within schools. The categorization of tasks within Bebras is important as it ensures that tasks span a wide range of topics; there have been several categorization schemes suggested to date. In this paper we present a new twodimensional categorization system that takes account of computational thinking skills as well as content knowledge. Examples are given from recent tasks that illustrate the role that Bebras can play in the development of computational thinking skills.

...read moreread less

Journal Article•10.1017/S0305000916000568•

Linking language and categorization in infancy.

[...]

Brock Ferguson¹, Sandra R. Waxman¹•Institutions (1)

Northwestern University¹

01 May 2017-Journal of Child Language

TL;DR: Evidence is reviewed documenting the developmental origins of a precocious link between language and object categories in very young infants, which proposes that, early in life, language promotes categorization at least in part through its status as a social, communicative signal.

...read moreread less

Abstract: Language exerts a powerful influence on our concepts. We review evidence documenting the developmental origins of a precocious link between language and object categories in very young infants. This collection of studies documents a cascading process in which early links between language and cognition provide the foundation for later, more precise ones. We propose that, early in life, language promotes categorization at least in part through its status as a social, communicative signal. But over the first year, infants home in on the referential power of language and, by their second year, begin teasing apart distinct kinds of names (e.g. nouns, adjectives) and their relation to distinct kinds of concepts (e.g. object categories, properties). To complement this proposal, we also relate this evidence to several alternative accounts of language's effect on categorization, appealing to similarity ('labels-as-features'), familiarity ('auditory overshadowing'), and communicative biases ('natural pedagogy').

...read moreread less

Journal Article•10.1037/XHP0000410•

Evaluating the sources and functions of gradiency in phoneme categorization: An individual differences approach.

[...]

Efthymia C. Kapnoula¹, Matthew B. Winn², Eun Jong Kong³, Jan Edwards⁴, Bob McMurray¹ - Show less +1 more•Institutions (4)

University of Iowa¹, University of Washington², Korea Aerospace University³, University of Wisconsin-Madison⁴

13 Apr 2017-Journal of Experimental Psychology: Human Perception and Performance

TL;DR: The results provide validation for this new method of assessing phoneme categorization gradiency and offer preliminary insights into how different aspects of speech perception may be linked to each other and to more general cognitive processes.

...read moreread less

Abstract: During spoken language comprehension listeners transform continuous acoustic cues into categories (e.g., /b/ and /p/). While long-standing research suggests that phonetic categories are activated in a gradient way, there are also clear individual differences in that more gradient categorization has been linked to various communication impairments such as dyslexia and specific language impairments (Joanisse, Manis, Keating, & Seidenberg, 2000; Lopez-Zamora, Luque, Alvarez, & Cobos, 2012; Serniclaes, Van Heghe, Mousty, Carre, & Sprenger-Charolles, 2004; Werker & Tees, 1987). Crucially, most studies have used 2-alternative forced choice (2AFC) tasks to measure the sharpness of between-category boundaries. Here we propose an alternative paradigm that allows us to measure categorization gradiency in a more direct way. Furthermore, we follow an individual differences approach to (a) link this measure of gradiency to multiple cue integration, (b) explore its relationship to a set of other cognitive processes, and (c) evaluate its role in individuals' ability to perceive speech in noise. Our results provide validation for this new method of assessing phoneme categorization gradiency and offer preliminary insights into how different aspects of speech perception may be linked to each other and to more general cognitive processes. (PsycINFO Database Record

...read moreread less

Journal Article•10.1037/MET0000095•

A comparison of latent class, K-means, and K-median methods for clustering dichotomous data.

[...]

Michael J. Brusco¹, Emilie Shireman², Douglas Steinley²•Institutions (2)

Florida State University¹, University of Missouri²

01 Sep 2017-Psychological Methods

TL;DR: Simulation-based comparisons of the latent class, K-means, and K-median approaches for partitioning dichotomous data found that the 3 approaches can exhibit profound differences when applied to real data.

...read moreread less

Abstract: The problem of partitioning a collection of objects based on their measurements on a set of dichotomous variables is a well-established problem in psychological research, with applications including clinical diagnosis, educational testing, cognitive categorization, and choice analysis. Latent class analysis and K-means clustering are popular methods for partitioning objects based on dichotomous measures in the psychological literature. The K-median clustering method has recently been touted as a potentially useful tool for psychological data and might be preferable to its close neighbor, K-means, when the variable measures are dichotomous. We conducted simulation-based comparisons of the latent class, K-means, and K-median approaches for partitioning dichotomous data. Although all 3 methods proved capable of recovering cluster structure, K-median clustering yielded the best average performance, followed closely by latent class analysis. We also report results for the 3 methods within the context of an application to transitive reasoning data, in which it was found that the 3 approaches can exhibit profound differences when applied to real data. (PsycINFO Database Record

...read moreread less

Proceedings Article•10.1109/CSCLOUD.2017.15•

Machine Learning for Anomaly Detection and Categorization in Multi-Cloud Environments

[...]

Tara Salman¹, Deval Bhamare², Aiman Erbad², Raj Jain¹, Mohammed Samaka² - Show less +1 more•Institutions (2)

Washington University in St. Louis¹, Qatar University²

1 Jun 2017

TL;DR: In this article, the authors investigated both detecting and categorizing anomalies rather than just detecting, which is a common trend in the contemporary research works, and argued that such categorization can be applied to multi-cloud environments using the same machine learning techniques.

...read moreread less

Abstract: Cloud computing has been widely adopted by application service providers (ASPs) and enterprises to reduce both capital expenditures (CAPEX) and operational expenditures (OPEX). Applications and services previously running on private data centers are now being migrated to private or public clouds. Since most of the ASPs and enterprises have globally distributed user bases, their services need to be distributed across multiple clouds, spread across the globe which can achieve better performance in terms of latency, scalability and load balancing. The shift has eventually led the research community to study multi-cloud environments. However, the widespread acceptance of such environments has been hampered by major security concerns. Firewalls and traditional rule-based security protection techniques are not sufficient to protect user-data in multi-cloud scenarios. Recently, advances in machine learning techniques have attracted the attention of the research community to build intrusion detection systems (IDS) that can detect anomalies in the network traffic. Most of the research works, however, do not differentiate among different types of attacks. This is, in fact, necessary for appropriate countermeasures and defense against attacks. In this paper, we investigate both detecting and categorizing anomalies rather than just detecting, which is a common trend in the contemporary research works. We have used a popular publicly available dataset to build and test learning models for both detection and categorization of different attacks. To be precise, we have used two supervised machine learning techniques, namely linear regression (LR) and random forest (RF). We show that even if detection is perfect, categorization can be less accurate due to similarities between attacks. Our results demonstrate more than 99% detection accuracy and categorization accuracy of 93.6%, with the inability to categorize some attacks. Further, we argue that such categorization can be applied to multi-cloud environments using the same machine learning techniques.

...read moreread less

Journal Article•10.3389/FPSYG.2017.01335•

A Cognitive Modeling Approach to Strategy Formation in Dynamic Decision Making.

[...]

Sabine Prezenski¹, André Brechmann², Susann Wolff³, Nele Russwinkel¹•Institutions (3)

Technical University of Berlin¹, Leibniz Association², Leibniz Institute for Neurobiology³

04 Aug 2017-Frontiers in Psychology

TL;DR: An ACT-R model is developed for a complex rule-based category learning task where participants first needed to identify the conjunction of two rules that defined a target category and later adapt to a reversal of feedback contingencies.

...read moreread less

Abstract: Decision-making is a high-level cognitive process based on cognitive processes like perception, attention, and memory. Real-life situations require series of decisions to be made, with each decision depending on previous feedback from a potentially changing environment. To gain a better understanding of the underlying processes of dynamic decision-making, we applied the method of cognitive modeling on a complex rule-based category learning task. Here, participants first needed to identify the conjunction of two rules that defined a target category and later adapt to a reversal of feedback contingencies. We developed an ACT-R model for the core aspects of this dynamic decision-making task. An important aim of our model was that it provides a general account of how such tasks are solved and, with minor changes, is applicable to other stimulus materials. The model was implemented as a mixture of an exemplar-based and a rule-based approach which incorporates perceptual-motor and metacognitive aspects as well. The model solves the categorization task by first trying out one-feature strategies and then, as a result of repeated negative feedback, switching to two-feature strategies. Overall, this model solves the task in a similar way as participants do, including generally successful initial learning as well as reversal learning after the change of feedback contingencies. Moreover, the fact that not all participants were successful in the two learning phases is also reflected in the modeling data. However, we found a larger variance and a lower overall performance of the modeling data as compared to the human data which may relate to perceptual preferences or additional knowledge and rules applied by the participants. In a next step, these aspects could be implemented in the model for a better overall fit. In view of the large interindividual differences in decision performance between participants, additional information about the underlying cognitive processes from behavioral, psychobiological and neurophysiological data may help to optimize future applications of this model such that it can be transferred to other domains of comparable dynamic decision tasks.

...read moreread less

Journal Article•10.1111/SPC3.12313•

The double-edged sword of diversity: Toward a dual pathway model

[...]

Ashli Carter¹, Katherine W. Phillips¹•Institutions (1)

Columbia University¹

01 May 2017-Social and Personality Psychology Compass

TL;DR: This article proposed a dual pathway model of diversity's effects that integrates social categorization, similarity-attraction and information and decision-making, and found that social categorisation and reduced attraction to dissimilar others will allow individuals in diverse rather than homogeneous groups to focus more on the task, anticipate differences in task-relevant opinions and perspectives, and engage in more effortful information processing.

...read moreread less

Abstract: Williams and O'Reilly (1998) published a seminal review of diversity research that has become a classic resource for researchers and practitioners alike. In the current review, we update the theoretical record by discussing traditional views of, as well as recent developments to, the 3 prominent frameworks used to understand diversity: social categorization, similarity-attraction, and information and decision-making. Furthermore, we propose a dual pathway model of diversity's effects that integrates all 3 frameworks. In our model, both positive and detrimental effects of diversity stem from processes of social categorization. Whereas these processes disrupt group functioning when intergroup bias is activated, when bias is not activated, we propose that social categorization and reduced attraction to dissimilar others will allow individuals in diverse rather than homogeneous groups to focus more on the task, anticipate differences in task-relevant opinions and perspectives, and engage in more effortful information processing. Finding the balance is key.

...read moreread less

Journal Article•10.1523/JNEUROSCI.2496-16.2017•

Neural Representations of Observed Actions Generalize across Static and Dynamic Visual Input.

[...]

Alon Hafri¹, John C. Trueswell¹, Russell A. Epstein¹•Institutions (1)

University of Pennsylvania¹

15 Mar 2017-The Journal of Neuroscience

TL;DR: Using fMRI, several areas of parietal, occipitotemporal, and frontal cortex are identified that exhibit action category codes that are similar across viewing of dynamic videos and still photographs and provide strong evidence for the involvement of these brain regions in recognizing the way that people interact physically with objects and other people.

...read moreread less

Abstract: People interact with entities in the environment in distinct and categorizable ways (e.g., kicking is making contact with foot). We can recognize these action categories across variations in actors, objects, and settings; moreover, we can recognize them from both dynamic and static visual input. However, the neural systems that support action recognition across these perceptual differences are unclear. Here, we used multivoxel pattern analysis of fMRI data to identify brain regions that support visual action categorization in a format-independent way. Human participants were scanned while viewing eight categories of interactions (e.g., pulling) depicted in two visual formats: (1) visually controlled videos of two interacting actors and (2) visually varied photographs selected from the internet involving different actors, objects, and settings. Action category was decodable across visual formats in bilateral inferior parietal, bilateral occipitotemporal, left premotor, and left middle frontal cortex. In most of these regions, the representational similarity of action categories was consistent across subjects and visual formats, a property that can contribute to a common understanding of actions among individuals. These results suggest that the identified brain regions support action category codes that are important for action recognition and action understanding.SIGNIFICANCE STATEMENT Humans tend to interpret the observed actions of others in terms of categories that are invariant to incidental features: whether a girl pushes a boy or a button and whether we see it in real-time or in a single snapshot, it is still pushing Here, we investigated the brain systems that facilitate the visual recognition of these action categories across such differences. Using fMRI, we identified several areas of parietal, occipitotemporal, and frontal cortex that exhibit action category codes that are similar across viewing of dynamic videos and still photographs. Our results provide strong evidence for the involvement of these brain regions in recognizing the way that people interact physically with objects and other people.

...read moreread less

Journal Article•10.1016/J.JARMAC.2017.07.005•

Interleaved Presentation Benefits Science Category Learning

[...]

Luke G. Eglington¹, Sean H. K. Kang¹•Institutions (1)

Dartmouth College¹

01 Dec 2017-Journal of applied research in memory and cognition

TL;DR: In this article, the authors investigated whether the sequencing of exemplars affects learning of rule-based science categories and found that interleaved presentation can improve learning of educationally relevant categories, even when coupled with pedagogical techniques that emphasize diagnostic features of each category.

...read moreread less

Abstract: The present study investigated whether the sequencing of exemplars affects learning of rule-based science categories. We taught undergraduate students chemical categories by showing them diagrams of organic chemical compounds accompanied by their category label in either an interleaved or blocked fashion. On a categorization test of novel exemplars administered 2 days later, interleaved presentation produced superior performance than blocked presentation for categories that were visually simple (Experiment 1) and relatively more complex (Experiment 2). The advantage of interleaved presentation was still observed when the diagnostic features of each category were highlighted during study (Experiments 3 and 4), suggesting that the benefit of interleaving may not be due solely to increasing the salience of contrastive features that differentiate the categories. These findings demonstrate that interleaved presentation can improve learning of educationally relevant categories, even when coupled with pedagogical techniques that emphasize the diagnostic features of each category.

...read moreread less

Journal Article•10.1080/23273798.2016.1198819•

Categories of object concepts across languages and brains: the relevance of nominal classification systems to cognitive neuroscience

[...]

David Kemmerer¹•Institutions (1)

Purdue University¹

21 Apr 2017-Language, cognition and neuroscience

TL;DR: These analyses strongly suggest that the neural underpinnings of object concepts are influenced by both universal tendencies and cultural idiosyncrasies.

...read moreread less

Abstract: Research on how categories of object concepts are implemented in the human brain has focused primarily on the sorts of semantic structures that are found in English and a few other European languages. This paper provides a broader typological perspective by considering the multifarious categories of object concepts that are encoded by languages with nominal classification systems. In these languages, speakers must explicitly categorize objects at both basic and superordinate levels – indicating, for instance, that a particular entity is not just a pencil but an elongated thing. The following semantic parameters of nominal classification systems are discussed: animacy and related properties, shape and related properties, size, constitution, and interaction/function. For each parameter, cross-linguistically frequent and infrequent semantic distinctions are surveyed first, and then their relevance to cognitive neuroscience is considered. These analyses strongly suggest that the neural underpinnings o...

...read moreread less

Journal Article•10.1017/S0007123415000393•

A categorization theory of spatial voting: how the center divides the political space

[...]

Jørgen Bølstad, Elias Dinas

01 Oct 2017-British Journal of Political Science

TL;DR: In this article, the authors present a categorization theory of spatial voting, which postulates that voters perceive political stances through coarse classifications and their preferences are characterized by discontinuities, rewarding parties on their side of the ideological space more than existing spatial models would predict.

...read moreread less

Abstract: This article presents a categorization theory of spatial voting, which postulates that voters perceive political stances through coarse classifications. Because voters think in terms of categories defined by the ideological center, their behavior deviates from standard models of utility maximization along ideological continua. Their preferences are characterized by discontinuities, rewarding parties on their side of the ideological space more than existing spatial models would predict. While this study concurs with prior studies suggesting that voters tend to use a proximity rule, it argues that this rule mainly serves to distinguish among parties of the same side. Overall, the results suggest that voters’ party evaluations are characterized by a nontrivial identity component, generating in-group biases not captured by the existing spatial models of voting.

...read moreread less

Journal Article•10.1109/TPAMI.2016.2635138•

Visually Grounded Meaning Representations

[...]

Carina Silberer¹, Vittorio Ferrari¹, Mirella Lapata¹•Institutions (1)

University of Edinburgh¹

01 Nov 2017-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A new model which uses stacked autoencoders to learn higher-level representations from textual and visual input is introduced which yields a better fit to behavioral data compared to baselines and related models which either rely on a single modality or do not make use of attribute-based input.

...read moreread less

Abstract: In this paper we address the problem of grounding distributional representations of lexical meaning. We introduce a new model which uses stacked autoencoders to learn higher-level representations from textual and visual input. The visual modality is encoded via vectors of attributes obtained automatically from images. We create a new large-scale taxonomy of 600 visual attributes representing more than 500 concepts and 700 K images. We use this dataset to train attribute classifiers and integrate their predictions with text-based distributional models of word meaning. We evaluate our model on its ability to simulate word similarity judgments and concept categorization. On both tasks, our model yields a better fit to behavioral data compared to baselines and related models which either rely on a single modality or do not make use of attribute-based input.

...read moreread less

Journal Article•10.1007/S40593-016-0116-X•

An Educational System for Learning Search Algorithms and Automatically Assessing Student Performance

[...]

Foteini Grivokostopoulou¹, Isidoros Perikos¹, Ioannis Hatzilygeroudis¹•Institutions (1)

University of Patras¹

01 Mar 2017-International Journal of Artificial Intelligence in Education

TL;DR: Assessment approaches integrated in the system that aim to assist tutors in assessing the performance of students, reduce their marking task workload and provide immediate and meaningful feedback to students are presented.

...read moreread less

Abstract: In this paper, first we present an educational system that assists students in learning and tutors in teaching search algorithms, an artificial intelligence topic. Learning is achieved through a wide range of learning activities. Algorithm visualizations demonstrate the operational functionality of algorithms according to the principles of active learning. So, a visualization process can stop and request from a student to specify the next step or explain the way that a decision was made by the algorithm. Similarly, interactive exercises assist students in learning to apply algorithms in a step-by-step interactive way. Students can apply an algorithm to an example case, specifying the algorithm’s steps interactively, with the system’s guidance and help, when necessary. Next, we present assessment approaches integrated in the system that aim to assist tutors in assessing the performance of students, reduce their marking task workload and provide immediate and meaningful feedback to students. Automatic assessment is achieved in four stages, which constitute a general assessment framework. First, the system calculates the similarity between the student’s and the correct answer using the edit distance metric. In the next stage, it identifies the type of the answer, based on an introduced answer categorization scheme related to completeness and accuracy of an answer, taking into account student carelessness too. Afterwards, the types of errors are identified, based on an introduced error categorization scheme. Finally, answer is automatically marked via an automated marker, based on its type, the edit distance and the type of errors made. To assess the learning effectiveness of the system an extended evaluation study was conducted in real class conditions. The experiment showed very encouraging results. Furthermore, to evaluate the performance of the assessment system, we compared the assessment mechanism against expert (human) tutors. A total of 400 students’ answers were assessed by three tutors and the results showed a very good agreement between the automatic assessment system and the tutors.

...read moreread less

Journal Article•10.1037/REV0000064•

A neural interpretation of exemplar theory.

[...]

F. Gregory Ashby¹, Luke A. Rosedahl¹•Institutions (1)

University of California, Santa Barbara¹

06 Apr 2017-Psychological Review

TL;DR: A neural interpretation of exemplar theory is proposed in which category learning is mediated by synaptic plasticity at cortical-striatal synapses and alters connectivity between striatal neurons and neurons in sensory association cortex.

...read moreread less

Abstract: Exemplar theory assumes that people categorize a novel object by comparing its similarity to the memory representations of all previous exemplars from each relevant category. Exemplar theory has been the most prominent cognitive theory of categorization for more than 30 years. Despite its considerable success in providing good quantitative fits to a wide variety of accuracy data, it has never had a detailed neurobiological interpretation. This article proposes a neural interpretation of exemplar theory in which category learning is mediated by synaptic plasticity at cortical-striatal synapses. In this model, categorization training does not create new memory representations, rather it alters connectivity between striatal neurons and neurons in sensory association cortex. The new model makes identical quantitative predictions as exemplar theory, yet it can account for many empirical phenomena that are either incompatible with or outside the scope of the cognitive version of exemplar theory. (PsycINFO Database Record

...read moreread less

Book Chapter•10.1108/S0733-558X20170000051011•

From Categories to Categorization: A Social Perspective on Market Categorization

[...]

Rodolphe Durand¹, Nina Granqvist, Anna Tyllström²•Institutions (2)

HEC Paris¹, Uppsala University²

17 Mar 2017

TL;DR: In this article, the authors advocate a rebalance toward the social process of categorization, paying more heed to the entity to be categorized, the actors involved, their acts, and the context and timing, which informs these activities.

...read moreread less

Abstract: The popularity of research into categories has grown in recent decades and shows no sign of abating. This introductory article takes stock of the research into two facets of categorization, addressing it both as a cognitive and a social process. We advocate a rebalance toward the social process of categorization, paying more heed to the entity to be categorized, the actors involved, their acts, and the context and timing, which informs these activities. We summarize the contributions to the volume in relation to these dimensions and briefly discuss avenues for future research.

...read moreread less

Proceedings Article•10.1145/3038912.3052605•

Temporal Effects on Hashtag Reuse in Twitter: A Cognitive-Inspired Hashtag Recommendation Approach

[...]

Dominik Kowald¹, Subhash Chandra Pujari¹, Elisabeth Lex¹•Institutions (1)

Graz University of Technology¹

3 Apr 2017

TL;DR: In this article, a cognitive-inspired hashtag recommendation algorithm called BLLi,s is proposed to incorporate the effect of time on individual hashtag reuse and social hashtag reuse into a predictive model.

...read moreread less

Abstract: Hashtags have become a powerful tool in social platforms such as Twitter to categorize and search for content, and to spread short messages across members of the social network. In this paper, we study temporal hashtag usage practices in Twitter with the aim of designing a cognitive-inspired hashtag recommendation algorithm we call BLLi,s. Our main idea is to incorporate the effect of time on (i) individual hashtag reuse (i.e., reusing own hashtags), and (ii) social hashtag reuse (i.e., reusing hashtags, which has been previously used by a followee) into a predictive model. For this, we turn to the Base-Level Learning (BLL) equation from the cognitive architecture ACT-R, which accounts for the time-dependent decay of item exposure in human memory. We validate BLLI,S using two crawled Twitter datasets in two evaluation scenarios. Firstly, only temporal usage patterns of past hashtag assignments are utilized and secondly, these patterns are combined with a content-based analysis of the current tweet. In both evaluation scenarios, we find not only that temporal effects play an important role for both individual and social hashtag reuse but also that our BLLI,S approach provides significantly better prediction accuracy and ranking results than current state-of-the-art hashtag recommendation methods.

...read moreread less

...

Expand