Wikipedia's Content Translation through Machine Learning (Project Symmetry)

Open
Project
Academic experience
140 hours per Student
Student
Anywhere
Intermediate level

Project scope

Categories
Software development Machine learning Artificial intelligence Databases Education
Skills
github agile methodology machine learning french language english language innovation
Details

WHO: Grey-box is social innovation startup that is working on providing free wifi access to educational content in areas where internet and electricity access is problematic.

WHY: 50% of the world lives offline – they haven’t connected to the internet in the last 3 months. In Africa, the average cost for 1GB of data is 7% of the average monthly salary. Even in North America, 50% of rural Canada do not have access to the minimum speed for internet. We believe that this is a solvable problem.

HOW: In a nutshell, for the current project (codename: symmetry), we want to translate content from one language in Wikipedia to another (for exemple, improve Molière English Wikipedia article based on content available only on the french page - and vice versa). The idea is to improve the overall Wikipedia content, especially in underrepresented langages.

Grey-box is a nonprofit structured under the principle of mobile-first and remote-first. Diversity is not only encouraged, it is an integral part of the strength of the team and its projects

Deliverables

Within the context of an agile methodology, the learner will be tasked to perform the following:

  • Possible options and literature review
  • Proposed a solution and a schedule
  • Data set acquisition 
  • Data Cleaning
  • Neural network training
  • Presentation of the prototype 1
  • UX/UI improvements to the interface
  • Content creation for social media (methodology and objectives)
  • Testing with Wikipedia Community 1
  • Presentation of the prototype 2
  • Testing with Wikipedia Community 2
  • Presentation of the MVP to the team
  • Content creation for social media (prototypes and implications)
  • Documentation and project GitHub closing


More information on this project can be found here: https://www.grey-box.ca/project-symmetry/

More projects information can be found on Riipen : https://app.riipen.com/companies/eLkDPPVl/projects

Outside Riipen, information can be found here: https://www.grey-box.ca/projects/




Mentorship

Weekly meetings with scrum and project owner

Concrete experience in social innovation projects

Access to our documentation and to our previous project reports

Supported causes
Quality education

About the company

Company
Montreal, Quebec, Canada
2 - 10 employees
Education, Non-profit, philanthropic & civil society, It & computing, Technology, Telecommunications

Grey-box is a social innovation startup. Its main product, Uni, provides wireless access to digital resources (such as Wikipedia, Khan academy, several MOOC-type online courses, medical databases) in areas where access to the internet or to electricity is unreliable.

While 50% of the world's population is not connected - and the COVID-19 situation is exacerbating these inequalities - our team is working on an accessible product (portable, energy efficient, climate resistant and, above all, less than 100 $ the unit) which allows anyone to connect to these essential resources for their development and autonomy.

Grey-box is a nonprofit structured under the principle of mobile-first and remote-first. Diversity is not only encouraged, it is an integral part of the strength of the team and its projects