Wiki Workshop 2017

A forum bringing together researchers exploring all aspects of Wikipedia and other Wikimedia sites. Held at WWW 2017 in Perth, Australia, on April 4, 2017.

  • Jan. 24, 2017: In response to multiple requests, the first paper submission deadline has been extended to Tuesday, January 31, 2017 (see below for more details).
  • Jan. 19, 2017: Paper submission site now open
  • Jan. 18, 2017: First invited speakers announced—stay tuned for more!
  • Dec. 15, 2016: Wiki Workshop 2017 webpage online.

To be announced

(More speakers to be announced soon—stay tuned!)

Maarten de Rijke (University of Amsterdam)

Maarten is full professor of Information Processing and Internet in the Informatics Institute at the University of Amsterdam. He holds MSc degrees in Philosophy and Mathematics, and a PhD in Theoretical Computer Science. He worked as a postdoc at CWI, before becoming a Warwick Research Fellow at the University of Warwick, UK. He joined the University of Amsterdam in 1998, and was appointed full professor in 2004. Maarten leads the Information and Language Processing Systems group, one of the world's leading academic research groups in information retrieval. His research focus is on intelligent information access, with projects on self-learning search engines, semantic search, and social media analytics.

Ricardo Baeza-Yates (NTENT)

Ricardo has been Chief Technology Officer at NTENT, Inc., since July 17, 2016. He served as Vice President and Chief Research Scientist of Yahoo! Research Labs, where he spent more than 10 years in various R&D roles. He has more than 30 years of technology industry experience. He served as a Research Fellow for Barcelona and Santiago de Chile of Yahoo!, Inc., since January 23, 2006. He served as a Researcher of Catalonian Institution for Research and Advanced Studies (ICREA) and served as Professor of UPF. He is an expert on information retrieval and one of the top Scientists in this area. He is a co-author of Modern Information Retrieval, the most-used textbook on search, as well as several other books. He is also an ACM Fellow and an IEEE Fellow, with over 500 publications, tens of thousands of citations, multiple awards and several patents. Ricardo earned Bachelor and Masters Degree in both Computer Science and Electrical Engineering from the University of Chile and a PhD in Computer Science from the University of Waterloo in Ontario, Canada.

Ben Hachey (Hugo.AI)

Ben is a data science professional of 10+ years, with key expertise in computational linguistics, text analytics, machine learning, and information integration. He is currently Chief Data Scientist at Hugo.AI, where he leads the R&aml;D team building tools for fast, consistent and high-quality person research. He supervises PhD/Honours students at the University of Sydney, where he was previously a DECRA Research Fellow and developed the Master of Data Science.

Workshop date: April 4, 2017

If authors want paper to appear in proceedings:

  • Submission deadline: January 24, 2017 January 31, 2017 (end of day anywhere on Earth)
  • Author feedback: February 7, 2017
  • Camera-ready version due: February 14, 2017 February 23, 2017 (instructions here)

If authors do not want paper to appear in proceedings:

  • Submission deadline: February 26, 2017
  • Author feedback: March 7, 2017

Wikipedia is one of the most popular sites on the Web, a main source of knowledge for a large fraction of Internet users, and one of the very few projects that make not only their content but also many activity logs available to the public. Furthermore, other Wikimedia projects, such as Wikidata and Wikimedia Commons, have been created to share other types of knowledge with the world for free. For a variety of reasons (quality and quantity of content, reach in many languages, process of content production, availability of data, etc.) such projects have become important objects of study for researchers across many subfields of the computational and social sciences, such as social network analysis, artificial intelligence, linguistics, natural language processing, social psychology, education, anthropology, political science, human–computer interaction, and cognitive science.

The goal of this workshop is to bring together researchers exploring all aspects of Wikimedia websites such as Wikipedia, Wikidata, and Commons. With members of the Wikimedia Foundation's Research team on the organizing committee and with the experience of successful workshops in 2015 and 2016, we aim to continue facilitating a direct pathway for exchanging ideas between the organization that operates Wikimedia websites and the researchers interested in studying them.

Topics of interest include, but are not limited to

  • new technologies and initiatives to grow content, quality, diversity, and participation across Wikimedia projects
  • use of bots, algorithms, and crowdsourcing strategies to curate, source, or verify content and structured data
  • bias in content and gaps of knowledge
  • diversity of Wikimedia editors and users
  • understanding editor motivations, engagement models, and incentives
  • Wikimedia consumer motivations and their needs: readers, researchers, tool/API developers
  • innovative uses of Wikipedia and other Wikimedia projects for AI and NLP applications
  • consensus-finding and conflict resolution on editorial issues
  • participation in discussions and their dynamics
  • dynamics of content reuse across projects and the impact of policies and community norms on reuse
  • privacy
  • collaborative content creation (unstructured, semi-structured, or structured)
  • collaborative task management
  • innovative uses of Wikimedia projects' content and consumption patterns as sensors for real-world events, culture, etc.

Papers should be 1 to 8 pages long and will be published on the workshop webpage and optionally (depending on the authors' choice) in the workshop proceedings. Authors whose papers are accepted to the workshop will have the opportunity to participate in a poster session.

We explicitly encourage the submission of preliminary work in the form of extended abstracts (1 or 2 pages).

Papers should be 1 to 8 pages long. We explicitly encourage the submission of preliminary work in the form of extended abstracts (1 or 2 pages). No need to anonymize your submissions.

For submission dates, see above.

Robert West

Bob is an assistant professor of Computer Science at EPFL. His research aims to understand, predict, and enhance human behavior in social and information networks by developing techniques in data science, data mining, network analysis, machine learning, and natural language processing. He holds a PhD in computer science from Stanford University.

Leila Zia

Leila is a senior research scientist at the Wikimedia Foundation. Her current research interests are on understanding Wikipedia's readers, quantifying and addressing the gaps of knowledge in Wikipedia and Wikidata, and understanding and improving diversity in Wikipedia. She holds a PhD in management science and engineering from Stanford University.

Dario Taraborelli

Dario is a social computing researcher and the Wikimedia Foundation's Head of Research. His current interests focus on online collaboration, open science, and the measurement and discoverability of scientific knowledge. He holds a PhD in cognitive science from the École des Hautes Études en Sciences Sociales.

Jure Leskovec

Jure is an associate professor of Computer Science at Stanford University. His research focuses on mining and modeling large social and information networks, their evolution, and diffusion of information and influence over them. Problems he investigates are motivated by large scale data, the Web and online media.

Please direct your questions to wikiworkshopgooglegroupscom.