Web Search and Data Mining

Conference Tools

Papers published on a yearly basis

Papers

Proceedings Article•10.1145/1718487.1718520•

TwitterRank: finding topic-sensitive influential twitterers

[...]

Jianshu Weng¹, Ee-Peng Lim¹, Jing Jiang¹, Qi He²•Institutions (2)

Singapore Management University¹, Pennsylvania State University²

4 Feb 2010

TL;DR: Experimental results show that TwitterRank outperforms the one Twitter currently uses and other related algorithms, including the original PageRank and Topic-sensitive PageRank, which is proposed to measure the influence of users in Twitter.

...read moreread less

Abstract: This paper focuses on the problem of identifying influential users of micro-blogging services. Twitter, one of the most notable micro-blogging services, employs a social-networking model called "following", in which each user can choose who she wants to "follow" to receive tweets from without requiring the latter to give permission first. In a dataset prepared for this study, it is observed that (1) 72.4% of the users in Twitter follow more than 80% of their followers, and (2) 80.5% of the users have 80% of users they are following follow them back. Our study reveals that the presence of "reciprocity" can be explained by phenomenon of homophily. Based on this finding, TwitterRank, an extension of PageRank algorithm, is proposed to measure the influence of users in Twitter. TwitterRank measures the influence taking both the topical similarity between users and the link structure into account. Experimental results show that TwitterRank outperforms the one Twitter currently uses and other related algorithms, including the original PageRank and Topic-sensitive PageRank.

...read moreread less

2,244 citations

Proceedings Article•10.1145/1935826.1935845•

Everyone's an influencer: quantifying influence on twitter

[...]

Eytan Bakshy¹, Jake M. Hofman², Winter Mason², Duncan J. Watts²•Institutions (2)

University of Michigan¹, Yahoo!²

9 Feb 2011

TL;DR: It is concluded that word-of-mouth diffusion can only be harnessed reliably by targeting large numbers of potential influencers, thereby capturing average effects and that predictions of which particular user or URL will generate large cascades are relatively unreliable.

...read moreread less

Abstract: In this paper we investigate the attributes and relative influence of 1.6M Twitter users by tracking 74 million diffusion events that took place on the Twitter follower graph over a two month interval in 2009. Unsurprisingly, we find that the largest cascades tend to be generated by users who have been influential in the past and who have a large number of followers. We also find that URLs that were rated more interesting and/or elicited more positive feelings by workers on Mechanical Turk were more likely to spread. In spite of these intuitive results, however, we find that predictions of which particular user or URL will generate large cascades are relatively unreliable. We conclude, therefore, that word-of-mouth diffusion can only be harnessed reliably by targeting large numbers of potential influencers, thereby capturing average effects. Finally, we consider a family of hypothetical marketing strategies, defined by the relative cost of identifying versus compensating potential "influencers." We find that although under some circumstances, the most influential users are also the most cost-effective, under a wide range of plausible assumptions the most cost-effective performance can be realized using "ordinary influencers"---individuals who exert average or even less-than-average influence.

...read moreread less

2,102 citations

Proceedings Article•10.1145/2684822.2685324•

Exploring the Space of Topic Coherence Measures

[...]

Michael Röder¹, Andreas Both, Alexander Hinneburg²•Institutions (2)

Leipzig University¹, Martin Luther University of Halle-Wittenberg²

2 Feb 2015

TL;DR: This work is the first to propose a framework that allows to construct existing word based coherence measures as well as new ones by combining elementary components, and shows that new combinations of components outperform existing measures with respect to correlation to human ratings.

...read moreread less

Abstract: Quantifying the coherence of a set of statements is a long standing problem with many potential applications that has attracted researchers from different sciences. The special case of measuring coherence of topics has been recently studied to remedy the problem that topic models give no guaranty on the interpretablity of their output. Several benchmark datasets were produced that record human judgements of the interpretability of topics. We are the first to propose a framework that allows to construct existing word based coherence measures as well as new ones by combining elementary components. We conduct a systematic search of the space of coherence measures using all publicly available topic relevance data for the evaluation. Our results show that new combinations of components outperform existing measures with respect to correlation to human ratings. nFinally, we outline how our results can be transferred to further applications in the context of text mining, information retrieval and the world wide web.

...read moreread less

2,011 citations

Proceedings Article•10.1145/3159652.3159656•

Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding

[...]

Jiaxi Tang¹, Ke Wang¹•Institutions (1)

Simon Fraser University¹

2 Feb 2018

TL;DR: A Convolutional Sequence Embedding Recommendation Model »Caser» is proposed, which is to embed a sequence of recent items into an image in the time and latent spaces and learn sequential patterns as local features of the image using convolutional filters.

...read moreread less

Abstract: Top-N sequential recommendation models each user as a sequence of items interacted in the past and aims to predict top-N ranked items that a user will likely interact in a »near future». The order of interaction implies that sequential patterns play an important role where more recent items in a sequence have a larger impact on the next item. In this paper, we propose a Convolutional Sequence Embedding Recommendation Model »Caser» as a solution to address this requirement. The idea is to embed a sequence of recent items into an »image» in the time and latent spaces and learn sequential patterns as local features of the image using convolutional filters. This approach provides a unified and flexible network structure for capturing both general preferences and sequential patterns. The experiments on public data sets demonstrated that Caser consistently outperforms state-of-the-art sequential recommendation methods on a variety of common evaluation metrics.

...read moreread less

1,950 citations

Proceedings Article•10.1145/1935826.1935877•

Recommender systems with social regularization

[...]

Hao Ma¹, Dengyong Zhou², Chao Liu², Michael R. Lyu¹, Irwin King³ - Show less +1 more•Institutions (3)

The Chinese University of Hong Kong¹, Microsoft², AT&T Labs³

9 Feb 2011

TL;DR: This paper proposes a matrix factorization framework with social regularization, which can be easily extended to incorporate other contextual information, like social tags, etc, and demonstrates that the approaches outperform other state-of-the-art methods.

...read moreread less

Abstract: Although Recommender Systems have been comprehensively analyzed in the past decade, the study of social-based recommender systems just started. In this paper, aiming at providing a general method for improving recommender systems by incorporating social network information, we propose a matrix factorization framework with social regularization. The contributions of this paper are four-fold: (1) We elaborate how social network information can benefit recommender systems; (2) We interpret the differences between social-based recommender systems and trust-aware recommender systems; (3) We coin the term Social Regularization to represent the social constraints on recommender systems, and we systematically illustrate how to design a matrix factorization objective function with social regularization; and (4) The proposed method is quite general, which can be easily extended to incorporate other contextual information, like social tags, etc. The empirical analysis on two large datasets demonstrates that our approaches outperform other state-of-the-art methods.

...read moreread less

1,935 citations

...

Expand

Year	Papers
2023	128
2022	51
2021	157
2020	129
2019	122
2018	112

Conference Tools

Papers published on a yearly basis

Papers

TwitterRank: finding topic-sensitive influential twitterers

Everyone's an influencer: quantifying influence on twitter

Exploring the Space of Topic Coherence Measures

Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding

Recommender systems with social regularization

Performance Metrics