Wikimedia blog

News from the Wikimedia Foundation and about the Wikimedia movement

Technology

News and information from the Wikimedia Foundation’s Technology department (RSS feed).

Wikimedia engineering report, October 2013

Major news in October include:

Note: We’re also providing a shorter, simpler and translatable version of this report that does not assume specialized technical knowledge.

(more…)

Open letter for free access to Wikipedia on mobile in South Africa

This post is available in 8 languages: English 7%Afrikaans 7% العربية 7% •  Español German • Français 7%עברית 7% • Nederlands 7% • Português 7% • русский 7%

English

In November 2012, the students of Sinenjongo High School penned an open letter on Facebook, encouraging cellphone carriers to waive data charges for accessing Wikipedia so they can do their homework. In May 2013, filmmaker Charlene Music and I asked them to read their open letter on camera. Below is the video of their letter:

The cost of data is a major obstacle to accessing the free knowledge on Wikipedia for hundreds of millions of people. These students want their cellphone carriers to sign up to Wikipedia Zero, a partnership program organized by the Wikimedia Foundation to enable mobile access to Wikipedia – free of data charges – in developing countries.

We will be sharing the longer documentary about the class as soon as it’s ready. While we are still editing the longer documentary, we’re looking for:

1.) A few skilled volunteers who can help to translate captions to accompany the video above and the longer documentary. There are currently eleven official languages in South Africa alone. We need volunteers to create captions for all those languages, and as many other languages as possible.

2.) A motion graphics or digital artist who could help us design and animate a few titles, maps and statistics for the documentary. If you are interested, feel free to email me: vgrigas at wikimedia.org or get in touch with me on my talk page User:Vgrigas.

3.) If you agree with these students, please share the video above.

Victor Grigas
Visual Storyteller, Wikimedia Foundation

(more…)

Telenor Wikipedia Zero partnership will provide free access to Wikipedia on mobile in Myanmar

Wikipedia founder Jimmy Wales and Telenor CEO Jon Fredrik Baksaas at celebration.

As we announced today, the Wikimedia Foundation and Telenor have expanded our Wikipedia Zero partnership established in early 2012 to now include Myanmar. Wikipedia founder Jimmy Wales was in Oslo today and celebrated the agreement with Telenor’s President and CEO Jon Fredrik Baksaas.

On 27 June 2013, Telenor was named one of two successful applicants for a telecommunications license in Myanmar. With new mobile competition the country will see better network service, internet-capable phones and lower prices to drive mobile internet usage.

This is a big deal because Myanmar currently has one of the lowest mobile penetration rates in the world of less than 10 percent – only North Korea and Eritrea have lower rates. The Myanmar government’s stated objective is to increase mobile penetration to 80 percent in the next three years (overall internet penetration is estimated at roughly one percent). Another 40 million people will get mobile service, and many of them will be introduced to the internet for the first time.

With the extension of the partnership, Telenor Myanmar’s future mobile subscribers will be able to access the vast knowledge base in Wikipedia free of data charges. And they will be able to freely contribute their voices to Wikipedia. Today some people in Myanmar use Wikipedia, primarily in English, but usage is not widespread. The local Wikimedia community is working to grow the Burmese language version to reach a wider audience.

Removing barriers to access Wikipedia for people in Myanmar is a major step toward our goal of making the sum of all human knowledge available to everyone. We’re excited to see the benefits of this new partnership unfold.

Carolynne Schloeder
Director of Mobile Programs, Wikimedia Foundation

 

Introducing Beta Features

The Beta Features preferences page.

We’re pleased to announce Beta Features, a way you can try out new features on Wikipedia and other Wikimedia sites before they are released for everyone.

Beta Features lets developers roll out new software in an environment where lots of users can use these features, then give feedback to help make them better.

You can think of it as a digital laboratory – where community members can preview upcoming changes and help designers and engineers make improvements based on their suggestions. (more…)

Any language allowed in Wikidata

Language Committee Logo

The Language Committee of the Wikimedia Foundation, which is in charge of developing and processing new language projects, has decided that any language should be admissible for use on Wikidata. As always this comes with several considerations.

    • The language needs to have an ISO-639-3 code, which is a numeric representation of language names particularly in computer systems. Languages used with multiple scripts need to be configured in this way.
    • Historic languages are permitted; newly minted words are not.
    • Constructed languages are permitted.
    • Language Localisation on MediaWiki is not required for the use of Wikidata.

When content is added by users that do not comply with the prescribed conditions, the labels added by such users will be removed.

As Wikidata moves towards a repository of useful statements, it is likely that this information will be presented in an increasing number of Wikipedias. As items in Wikidata are enriched, all infoboxes in various Wikipedias that rely on data from Wikidata will be enriched as well.

Gerard Meijssen, Language Committee

Help design Wikipedia’s next-generation discussion system

Roundtable-Discussions-June-2013-45.jpg

Discussions are the backbone of all Wikimedia projects. Whether it’s finding a reliable source, settling on spelling and punctuation conventions, or picking an article to feature on the main page, our community of volunteer editors makes countless decisions each day simply by talking to each other. However, the way that editors communicate today – using freeform wiki pages – is confusing and difficult for new users to grasp. Flow is the Wikimedia Foundation’s project planned to create discussion and collaboration software that improves the experience for all our users, letting them focus on creating and improving content instead of mastering the talk page form.

When comments and discussion first appeared on the Internet, they brought the promise of brilliant minds discussing the issues of the day in a thoughtful, courteous fashion. Instead, what we got was a lot of: “FIRST POST!” “Jake sucks,” “Kylla rulez”, and “aliens caused climate change!!!” The Internet world dealt with this problem in various ways: by locking down poster permissions, paying staff to moderate content, or even turning comments off entirely.

Wikipedia and its sister projects face some different challenges – while the content of the encyclopedia grows in size and quality through peer-to-peer discussion and collaboration, the fact that anyone can participate in this process is still not obvious to most people who use Wikipedia as a resource. We know that a small, homogeneous contributor pool leads to gaps in knowledge and biased content, as well as overworked and frustrated editors. There are countless potential contributors who could pitch in to help, but who are dissuaded from participating in content discussions because of intimidating software. But, like other online discussion spaces, we also need to balance openness with tools to keep discussions productive and healthy.

(more…)

The Autonym Font for Language Names

When an article on Wikipedia is available in multiple languages, we see the list of those languages in a column on the side of the page. The language names in the list are written in the script that the language uses (also known as language autonym).

This also means that all the appropriate fonts are needed for the autonyms to be correctly displayed. For instance, an article like the one about the Nobel Prize is available in more than 125 languages and requires approximately 35 different fonts to display the names of all the languages in the sidebar.

Language Autonyms

Initially, this was handled by the native fonts available on the reader’s device. If a font was not present, the user would see square boxes (commonly referred to as tofu) instead of the name of a language. To work around this problem, not just for the language list, but for other sections in the content area as well, the Universal Language Selector (ULS) started to provide a set of webfonts that were loaded with the page.

While this ensured that more language names would be correctly displayed, the presence of so many fonts dramatically increased the weight of the pages, which therefore loaded much more slowly for users than before. To improve client-side performance, webfonts were set not to be used for the Interlanguage links in the sidebar anymore.

Removing webfonts from the Interlanguage links was the easy and immediate solution, but it also took us back to the sup-optimal multilingual experience that we were trying to solve in the first place. Articles may be perfectly displayed thanks to web fonts, but if a link is not displayed in the language list, many users will not be able to discover that there is a version of the article in their language.

Autonyms were not needed just for Interlanguage links. They were also required for the Language Search and Selection window of the Universal Language Selector, which allows users to find their language if they are on a wiki displaying content in a script unfamiliar to them.

Missing font or “tofu”

As a solution, the Language Engineers came up with a trimmed-down font that only contains the characters required to display the names of the languages supported in MediaWiki. It has been named the Autonym font and will be used when only the autonyms are to be displayed on the page. At just over 50KB in size, it currently provides support for nearly 95% of the 400+ supported languages. The pending issues list identifies the problems with rendering and missing glyphs for some languages. If your language misses glyphs and you know of an openly-licensed font that can fill that void, please let us know so we can add it.

The autonym font addresses a very specific use case. There have been requests to explore the possibility of extending the use of this font to similar language lists, like the ones found on Wikimedia Commons. Within MediaWiki, the font can be used easily through a CSS class named autonym.

The Autonym font has been released for free use with the SIL Open Font License, Version 1.1.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering, Wikimedia Foundation

Airtel Wikipedia Zero partnership to pilot Wikipedia via text

Today Airtel and the Wikimedia Foundation announced a partnership to launch Wikipedia Zero, an initiative to provide free access to Wikipedia on mobile phones. This partnership with Airtel will help provide Wikipedia access to 70 million new users in sub-saharan Africa, starting in Kenya.

One exciting aspect of this partnership is that we are reaching a group of people we’ve never been able to reach before: mobile phone customers who don’t have internet access.

Throughout most of the developing world, data-enabled smartphones are the exception, not the rule. That means billions of people currently cannot see Wikipedia on their phones. Which phones? Low-cost basic phones (usually called feature phones or candy-bar phones). Phones like this:

So the challenge is, how do we reach the billions of people in the world who aren’t on the internet?

With text messaging. Even phones like these can send and receive text messages.

So for the first time, we are testing a service to allow access to Wikipedia articles via text message. It can work with any phone, even the most basic feature phone. You don’t even need an application.

How does Wikipedia via text work? A search is started in the same way people already use their phone to check their balance or add airtime. To search for a Wikipedia article through the Airtel partnership, a subscriber simply dials *515# on their phone, and they’ll get a text message inviting them to search Wikipedia. The subscriber enters a topic (like ‘Cheetah’) in the same manner they would send a text message.
(more…)

Request for proposals: New datacenter in the continental US

The Wikimedia Foundation’s Technical Operations team is seeking proposals on the provisioning of a new datacenter facility.

After working through the specifics internally, we now have a public RFP posted and ready for proposals. We invite any organization meeting the requirements outlined to submit a proposal for review. Most of the relevant details are in the document itself, but feel free to reach out to myself or anyone on the Technical Operations team should anyone have any questions.

Please, feel free to forward this link far and wide; have colleagues, contacts or friends in the datacenter sector? Then please, forward it on! :)

Below are the primary requirements, excerpted from the RFP:

Primary Requirements

  • The data center location must be in the midwestern/western continental US (i.e., Chicago westward).
  • The capacity for at least 32 enclosures initially; expansion possibilities (first right of refusal in contract on adjacent or nearby cage area) for another row of 8.
  • (more…)

Scientific multimedia files get a second life on Wikipedia

On Wikimedia projects, audio and video content has traditionally taken a backseat relative to text and static images (however, changes are underway). Conversely, more and more scholarly publications come with audio and video files, though these are — a legacy from the print era — typically relegated to the “supplementary material” rather than embedded next to the relevant text passages. And a rising number of these publications are Open Access, i.e. freely available under Creative Commons licenses that allow for the materials to be reused in other contexts.

Why not enrich thematically related Wikimedia pages with such multimedia files? That’s where the Open Access Media Importer (OAMI) comes in. It makes scientific video and audio clips accessible to the Wikimedia community and a broader public audience. The OAMI is an open-source program (or ‘bot’) that crawls PubMed Central — a full-text database of over 3 million biomedical research articles — and extracts multimedia files from those publications in the database that are available under Wikimedia-compatible licenses.

Over 700 OAMI-contributed media files are currently used in Wikipedia and other Wikimedia projects. This X-ray video of a breathing American alligator — originally published by Claessens et al. (2009) in PLOS ONE — is currently being used for illustrating the “Respiratory system” entries in the Bulgarian, Chinese, English, German, Russian, and Serbocroatian Wikipedias.

Such reuse-friendly terms are the key ingredient to making scholarly materials useful beyond the article in which they have originally been published. However, OAMI aims to make this material even more useful by making it accessible:

  • in places where people actually look for them (Wikimedia platforms are a prime example),
  • in one coherent format (in our case Ogg Vorbis/Theora, which isn’t encumbered by patent restrictions), and
  • in a way that allows for collaborative annotation with relevant metadata. This makes it a lot easier to browse and search the media files.

(more…)