Search courses 👉

Professional Course

No reviews

Data Creation and Collection for Artificial Intelligence via Crowdsourcing

edX, Online

Length

6 weeks

Length

6 weeks

Visit provider website

Visit this course's homepage on the provider's site to learn more or book!

Visit provider website

Course description

Data Creation and Collection for Artificial Intelligence via Crowdsourcing

Advances in Artificial Intelligence and Machine Learning have led to technological revolutions. Yet, AI systems at the forefront of such innovations have been the center of growing concerns. These involve reports of system failure when conditions are only slightly different from the training phase and they also trigger ethical and societal considerations that arise as a result of their use.

Machine learning models have been criticized for lacking robustness, fairness and transparency. Such model-related problems can generally be attributed to a large extent to issues with data. In order to learn comprehensive, fine-grained and unbiased patterns, models have to be trained on a large number of high-quality data instances with distribution that accurately represents real application scenarios. Creating such data is not only a long, laborious and expensive process, but sometimes even impossible when the data is extremely imbalanced, or the distribution constantly evolves over time.

This course will introduce an important method that can be used to gather data for training machine learning models and building AI systems. Crowdsourcing offers a viable means of leveraging human intelligence at scale for data creation, enrichment and interpretation with great potential to improve the performance of AI systems and increase the wider adoption of AI in general.

By the end of this course you will be able to understand and apply crowdsourcing methods to elicit human input as a means of gathering high-quality data for machine learning. You will be able to identify biases in datasets as a result of how they are gathered or created and select from task design choices that can optimize data quality. These learnings will contribute to an important set of skills that are essential for career trajectories in the field of Data Science, Machine Learning, and the broader realms of Artificial Intelligence.

Who should attend?

Prerequisites

Some prior experience with a programming language (e.g. Python, Java) is recommended but not required.

Training content

Week 1: Crowdsourcing for High-quality Data Collection and The ImageNet Story

Artificial Intelligence is at the center of many recent advancements across areas such as transportation and finance. One of the reasons for this is that in the past decade we have designed methods to harness human intelligence at scale.

We will introduce and discuss the crowdsourcing paradigm and the importance of high-quality data.

Topics we will cover this week:

The intuition behind crowdsourcing
The role of crowdsourcing platforms
The need for high-quality data for AI models
What is ImageNet, the gap it filled, and how it was built

Week 2: Quality Control Mechanisms for Crowdsourcing

The quality of crowdsourced human input is one of the most crucial aspects affecting the overall value of the paradigm. In this week we will discuss the challenges that make quality control difficult to guarantee.

Topics we will cover this week:

Workers' motives and behaviors
Quality control mechanisms in crowdsourcing
Incentives in crowdsourcing (like gamification)
Cognitive aspects and psychometric methods

Week 3: Factors Affecting Quality in Crowdsourcing

Researchers and practitioners in human computation and crowdsourcing have identified several factors that affect the quality of crowdsourced data. In this week we will discuss some of the recent works in this regard.

Topics we will cover this week:

Tradeoff between task pricing and quality of output
The role of workers' demographics, qualifications and skills
The importance of task clarity and work environments
The concepts of task packaging, task framing and task priming

Week 4: Human Input for Data Creation and Model Evaluation in AI

In this week, we will cover the importance of data collection, annotation and engineering.

Topics we will cover this week:

The importance of data collection
Data generation
The role of crowdsourcing in advanced machine learning
Taxonomy of microtasks

Week 5: Reducing Worker Effort: Active Learning

In this week we explore the challenges of collecting large scale data and how to overcome them.

Topics we will cover this week:

Approaches to reducing worker effort
The implications of reducing labeling effort
The key idea of active learning
Query strategies for selecting informative instances

Week 6: Interpreting, Evaluating, and Debugging ML models

In this week, we discuss strategies for evaluating, debugging, and interpreting machine learning models.

Topics we will cover this week:

The notion of model interpretability
The role of humans in the interpretability process
Debugging ML pipelines and related challenges

Course delivery details

This course is offered through Delft University of Technology, a partner institute of EdX.

4-5 hours per week

Costs

Verified Track -$149
Audit Track - Free

Certification / Credits

What you'll learn

At the end of this course you will be able to:

Examine the use of crowdsourcing for gathering data
Explain how cognitive biases and other human factors influence data quality
Describe the use of active learning in the creation of crowdsourced training data
Demonstrate the design of crowdsourcing tasks with quality control mechanisms
Discuss the evaluation of ML models with humans in the loop

Contact this provider

Contact course provider

Fill out your details to find out more about Data Creation and Collection for Artificial Intelligence via Crowdsourcing.

Contact the provider

Get more information

Register your interest

Country *

Please recommend similar options

I accept the: Terms and Conditions & Privacy Policy

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

edX

141 Portland Street

02139 Cambridge Massachusetts

617-440-9808

edx.business

Training homepage

edX

edX For Business helps leading companies upskill their labor forces by making the world’s greatest educational resources available to learners across a wide variety of in-demand fields. edX For Business delivers high-quality corporate eLearning to train and engage your employees...

Read more and show all training delivered by this supplier

Ads

Artificial Intelligence (AI) eLearning - eLearning

Adding Value Consulting AB

Artificial Intelligence Engineer (AI) - Master's Program

Adding Value Consulting AB

Certificate in AI in Shipping

Lloyd's Maritime Academy

Artificial Intelligence for Power Excel Users (On-site)

Artificial Intelligence for Power Excel Users

AVC Machine Learning Certification - eLearning

Adding Value Consulting AB

Artificial Intelligence in Banking

IFF - International Faculty of Finance

Executive Brief: Artificial Intelligence

Velocity Knowledge

Post Graduate Program in AI and Machine Learning

Adding Value Consulting AB