TL;DR: The evaluation of websites contained herein will show that assessment of Internet-based information is necessary and that as long as certain steps are taken to ensure the dependability of the information sources, there is no reason to doubt the validity of the Internet as a source of information, both scholarly and general.
Abstract: Using the Internet as an information source is a fairly new proposition for most researchers, but the depth and breadth of coverage on the net mark it as an important research tool. While the prospect of having volumes of data at one' s fingertips may seem like a dream come true to researchers, it must be recognized that certain precautions need to be taken. Websites should be evaluated and judged before their content is accepted for use. The evaluation of websites contained herein will show that assessment of Internet-based information is necessary. It is also a model of a method by which to perform these evaluations. Also evident from these evaluations will be that as long as certain steps are taken to ensure the dependability of the information sources, there is no reason to doubt the validity of the Internet as a source of information, both scholarly and general.
TL;DR: This paper analyzes a large dataset of passively collected traffic summaries to observe how pervasive online tracking is, and calls for an increase of the sensibility of people, researchers and regulators toward privacy in the Internet.
Abstract: During the visit to any website, the average internaut may face scripts that upload personal information to so called online trackers, invisible third party services that collect information about users and profile them. This is no news, and many works in the past tried to measure the extensiveness of this phenomenon. All of them ran active measurement campaigns via crawlers. In this paper, we observe the phenomenon from a passive angle, to naturally factor the diversity of the Internet and of its users. We analyze a large dataset of passively collected traffic summaries to observe how pervasive online tracking is. We see more than 400 tracking services being contacted by unaware users, of which the top 100 are regularly reached by more than 50 % of Internauts, with top three that are practically impossible to escape. Worse, more than 80 % of users gets in touch the first tracker within 1 second after starting navigating. And we see a lot of websites that hosts hundreds of tracking services. Conversely, those popular web extensions that may improve personal protection, e.g., DoNotTrackMe, are actually installed by a handful of users (3.5 %). The resulting picture witnesses how pervasive the phenomenon is, and calls for an increase of the sensibility of people, researchers and regulators toward privacy in the Internet.