Wikipedia's Content Translation through Machine Learning (Project Symmetry)
Project scope
Categories
Software development Machine learning Artificial intelligence Databases EducationSkills
github agile methodology machine learning french language english language innovationWHO: Grey-box is social innovation startup that is working on providing free wifi access to educational content in areas where internet and electricity access is problematic.
WHY: 50% of the world lives offline – they haven’t connected to the internet in the last 3 months. In Africa, the average cost for 1GB of data is 7% of the average monthly salary. Even in North America, 50% of rural Canada do not have access to the minimum speed for internet. We believe that this is a solvable problem.
HOW: In a nutshell, for the current project (codename: symmetry), we want to translate content from one language in Wikipedia to another (for exemple, improve Molière English Wikipedia article based on content available only on the french page - and vice versa). The idea is to improve the overall Wikipedia content, especially in underrepresented langages.
Grey-box is a nonprofit structured under the principle of mobile-first and remote-first. Diversity is not only encouraged, it is an integral part of the strength of the team and its projects
Within the context of an agile methodology, the learner will be tasked to perform the following:
- Possible options and literature review
- Proposed a solution and a schedule
- Data set acquisition
- Data Cleaning
- Neural network training
- Presentation of the prototype 1
- UX/UI improvements to the interface
- Content creation for social media (methodology and objectives)
- Testing with Wikipedia Community 1
- Presentation of the prototype 2
- Testing with Wikipedia Community 2
- Presentation of the MVP to the team
- Content creation for social media (prototypes and implications)
- Documentation and project GitHub closing
More information on this project can be found here: https://www.grey-box.ca/project-symmetry/
More projects information can be found on Riipen : https://app.riipen.com/companies/eLkDPPVl/projects
Outside Riipen, information can be found here: https://www.grey-box.ca/projects/
Weekly meetings with scrum and project owner
Concrete experience in social innovation projects
Access to our documentation and to our previous project reports
Supported causes
Quality educationAbout the company
Grey-box is a social innovation startup. Its main product, Uni, provides wireless access to digital resources (such as Wikipedia, Khan academy, several MOOC-type online courses, medical databases) in areas where access to the internet or to electricity is unreliable.
While 50% of the world's population is not connected - and the COVID-19 situation is exacerbating these inequalities - our team is working on an accessible product (portable, energy efficient, climate resistant and, above all, less than 100 $ the unit) which allows anyone to connect to these essential resources for their development and autonomy.
Grey-box is a nonprofit structured under the principle of mobile-first and remote-first. Diversity is not only encouraged, it is an integral part of the strength of the team and its projects