Detecting Textual Reuse in News Stories, At Scale

Tom Nicholls

Founding Editor

Larry Gross

University of Southern California

Founding Managing Editor

Arlene Luck

Editor

Silvio Waisbord

George Washington University

Managing Editor

Kady Bell-Garcia

Managing Editor, Special Sections

Chi Zhang

Webmaster

Andrew Taylor

Editorial Board

Sean Aday
George Washington University

Omar Al-Ghazzi
London School of Economics and Political Science

Ilhem Allagui
Northwestern University-Qatar

Abeer Al-Najjar
American University Of Sharjah

Meryl Alper
Northeastern University

Adriana Amaral
Universidade Federal Fluminense

Hector Amaya
University of Southern California

Melissa Miriam Aronczyk
Rutgers University

Jonathan David Aronson
University of Southern California

Karen Arriaza Ibarra
Universidad Complutense de Madrid Spain

Hanan Badr
Paris Lodron University of Salzburg

Sandra Ball-Rokeach
University of Southern California

Sarah Banet-Weiser
University of Pennsylvania/University of Southern California

Francois Bar
University of Southern California

Emma Baulch
Monash University Malaysia

Yochai Benkler
Harvard Law School

Lance Bennett
University of Washington

TJ Billard
Northwestern University

Bruce Bimber
UC Santa Barbara

Pablo Javier Boczkowski
Northwestern University

Mark Boukes
University of Amsterdam

Nicholas David Bowman
Syracuse University

danah boyd
Microsoft Research / Data & Society

Michael Brüggemann
University of Hamburg

Gustavo Cardoso
University of Lisbon

Manuel Castells

Lik Sam Chan
University of Sydney

Michael Chan
Chinese University of Hong Kong

Jaeho Cho
University of California, Davis

Lilie Chouliaraki
London School of Economics and Political Science

Renita Coleman
University of Texas

Simon Cottle
Cardiff University

Sasha Costanza-Chock
Massachusetts Institute of Technology

Nick Couldry
London School of Economics and Political Science

Robert T. Craig
University of Colorado at Boulder

Nick Cull
University of Southern California

Afonso de Albuquerque
Universidade Federal Fluminense

Michael X. Delli Carpini
University of Pennsylvania

Claes de Vreese
University of Amsterdam

Marco Deseriis
Scuola Normale Superiore

Alexander Dhoest
University of Antwerp

Susan Douglas
University of Michigan

William Dutton
Michigan State University

Stephen Duncombe
New York University

Richard Dyer
University of London

John Nguyet Erni
Hong Kong Baptist University

Lewis Allen Friedland
University of Wisconsin-Madison

Anthony Y.H. Fung
Chinese University of Hong Kong

Oscar Gandy
University of Pennsylvania

Dilip Gaonkar
Northwestern University

Myria Georgiou
London School of Economics and Political Science

Homero Gil de Zúñiga
University of Salamanca Pennsylvania State University

Ian Glenn
University of Cape Town

Sergio Godoy
Universidad Catolica de Chile

Guy J. Golan
Texas Christian University

Trudy Govier
University of Lethbridge

Mary L. Gray
Microsoft Research & Indiana University

Larry Grossberg
University of North Carolina

Manuel Alejandro Guerrero
Universidad Iberoamericana

Lei Guo
Fudan University

Dan Hallin
University of California, San Diego

James Hamilton
Stanford University

Eszter Hargittai
University of Zurich

John Hartley
Curtin University

Francois Heinderyckx
Université Libre de Bruxelles

Andreas Hepp
University of Bremen

David Hesmondhalgh
University of Leeds

Tom Hollihan
University of Southern California

Yu Hong
Zhejiang University

Kathleen Hall Jamieson
University of Pennsylvania

Henry Jenkins
University of Southern California

Min Jiang
University of North Carolina at Charlotte

Dal Yong Jin
Simon Fraser University

Steve Jones
University of Illinois-Chicago

Douglas Kellner
UCLA

Su Jung Kim
University of Southern California

Marwan M. Kraidy
Northwestern University in Qatar

Josh Kun
University of Southern California

Chin-Chuan Lee
National Chengchi University

Chul-joo Lee
Seoul National University

Francis Lee
Chinese University of Hong Kong

Justin Lewis
Cardiff University

Sonia Livingstone
London School of Economics

Robin Elizabeth Mansell
London School of Economics

Alice E. Marwick
University of North Carolina at Chapel Hill

Jorg Matthes
University of Vienna

Robert McChesney
University of Illinois, Urbana-Champaign

Christine Meltzer
Hanover University for Music, Theater and Media

Kaitlynn Mendes
Western University

Oren Meyers
University of Haifa

Toby Miller
Universidad de La Frontera

Peter R. Monge
University of Southern California

Seungahn Nah
University of Florida

Thomas Nakayama
Northeastern University

Philip Napoli
Duke University

Horace Newcomb
University of Georgia

Zhongdang Pan
University of Wisconsin - Madison

Zizi Papacharissi
University of Illinois at Chicago

Cinzia Padovani
Southern Illinois University

John Durham Peters
Yale University

Victor Pickard
University of Pennsylvania

Alejandro Piscitelli
Universidad de San Andrés

Dana Polan
New York University

Marshall Scott Poole
University of Illinois, Urbana-Champaign

Adam Powell
University of Southern California

Shawn Mathew Powers
Georgia State

Monroe Price
University of Pennsylvania

Jack Linchuan Qiu
Nanyang Technological University

Janice Radway
Northwestern University

N. Bhaskara Rao
Centre for Media Studies, New Delhi

Michael Renov
USC Cinematic Arts

Allissa V. Richardson
University of Southern California

Eric Rothenbuhler
Webster University

Michael Schudson
Columbia University

Ellen Seiter
Hong Kong Baptist University

Brian Semujju
Makerere University

James Shanahan
Indiana University

Limor Shifman
Hebrew University of Jerusalem

Aram Sinnreich
American University

Joseph Straubhaar
University of Texas at Austin

Lukasz Szulc
University of Manchester

Kjerstin Thorson
Colorado State University

Katrin Tiidenberg
Tallinn University

Florian Toepfl
University of Passau

Yariv Tsfati
University of Haifa

Joseph Turow
University of Pennsylvania

Nikki Usher
University of San Diego

Derek W. Vaillant
University of Michigan

Baldwin Van Gorp
Ku Leuven University

Jorge Vázquez-Herrero
Universidade de Santiago de Compostela

Ingrid Volkmer
University of Melbourne

Jay Wang
University of Southern California

James Webster
Northwestern University

Chris Wells
Boston University

Dmitri Williams
University of Southern California

Angela Xiao Wu
New York University

Guobin Yang
University of Pennsylvania

Dannagal G. Young
University of Delaware

Barbie Zelizer
Annenberg/ University of Pennsylvania

Juyan Zhang
University of Texas at San Antonio

Yuezhi Zhao
Simon Fraser University

Ying Zhu
College of Staten Island, CUNY

Journal Help

User

Article Tools

Indexing metadata

How to cite item

Email this article (Login required)

Email the author (Login required)

Journal Content
Browse

Font Size

Information

PUBLISHED BY:

EDITORIAL STAFF

Sofie Haytin

Josh Widera

Assistant Editors

Open Journal Systems

Current Issue

ISSN: 1932-8036

Follow @IJoC_USC

Detecting Textual Reuse in News Stories, At Scale

Tom Nicholls

Abstract

Motivated by the debate around “churnalism” and online media, this article develops, evaluates, and validates a computational method for detecting shared text between different news articles, at scale, using n-gram shingling. It differentiates between newswire copy, public relations material, source-to-source copying, and common-source and incidental overlaps. I evaluate the method, quantitatively and qualitatively, and show that it can effectively handle newswire content, copying, and other forms of reuse. Substantively, I find lower levels of news agency and press release copy reuse than is suggested by previous studies, and conclude that the news agency finding is robust, but the lack of press release copy found might reflect limitations of the method and the changing practices of journalists.

Keywords

computational methods, news production, churnalism, news agency, automated content analysis, online news

Full Text:

PDF

Username
Password
Remember me

International Journal of Communication

Founding Editor

Founding Managing Editor

Editor

Managing Editor

Managing Editor, Special Sections

Webmaster

Editorial Board

Detecting Textual Reuse in News Stories, At Scale

Abstract

Keywords

Full Text: