Tracking Changes in Online Newspaper Headlines Cover Image

Tracking Changes in Online Newspaper Headlines
Tracking Changes in Online Newspaper Headlines

Author(s): David Brett
Subject(s): Applied Linguistics, Sociolinguistics, Philology, Stylistics
Published by: Editura Universitaria Craiova
Keywords: newspaper language; quantitative; print and online newspapers; scraping; corpus linguistics; headlines; updates;

Summary/Abstract: In this study online newspaper headlines were tracked over time to see whether, and to what extent, they undergo changes, and if they do, what form these take. The html of the homepage of the guardian.com was downloaded every hour for six days. Subsequently, all the headlines were extracted (n=810), along with the URLs of the pages to which they were linked (n=615). The discrepancy between these two numbers is due to the fact that some headlines were changed,some even up to 8 times. Timestamps allowed these changes to be ordered chronologically. Several types of variation were observed, including typo correction, content update, syntactic reformulation and the insertion/modification/deletion of kickers.

  • Issue Year: 1/2020
  • Issue No: XXI
  • Page Range: 139-156
  • Page Count: 18
  • Language: English