SlideShare a Scribd company logo
Daniela Braga, PhD
CEO
daniela@definedcrowd.com
DefinedCrowd: Crowdsourcing,
Speech Data Science, AI
CrowdsourcingWeek, June 15th 2017
definedcrowd confidential 3
Reason #1: machines need high quality data to learn
definedcrowd confidential 4
definedcrowd confidential 5
definedcrowd confidential 6
definedcrowd confidential 7
Reason #2: big data opportunity
definedcrowd confidential 8
Reason #2: big data
definedcrowd confidential 9
Reason #3: paradigm shift when teaching machines
definedcrowd confidential 10
definedcrowd confidential 12
definedcrowd confidential 13
definedcrowd confidential
DEMO
definedcrowd confidential 15
The challenges of crowdsourcing NLP data
Crowd quality Data quality
• Language tests
• Job specific tests
• Real Time Audits
• Built-in language/spam
validators
• Referral system
• System of tokens
• Legal/privacy compliance
(under NDA)
Quality
gateways
Controlled
crowd
• Checking for suspicious
crowd behavior (multiple
accounts creation, peaks of
activity, specific job spam, IP
check against country of
living)
Machine
Learning
Data
quality
control
• Validation steps
• Inter-annotator
agreements
• Precision and Recall
metrics
definedcrowd confidential 16
DefinedCrowd combines the best of professional
services with SaaS companies
definedcrowd confidential 17
FOLLOWUSON
or sendme an email to daniela@definedcrowd.com
Learn more at definedcrowd.com

More Related Content

Similar to Crowdsourcing Speech Data Science and AI

Threat Modeling Using STRIDE
Threat Modeling Using STRIDEThreat Modeling Using STRIDE
Threat Modeling Using STRIDE
Girindro Pringgo Digdo
 
Turning Big Data into More Effective Customer Experiences
Turning Big Data into More Effective Customer ExperiencesTurning Big Data into More Effective Customer Experiences
Turning Big Data into More Effective Customer Experiences
NG DATA
 
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
Dataconomy Media
 
Ensuring Data Quality in Databricks Unleashing the Power of Great Expectation...
Ensuring Data Quality in Databricks Unleashing the Power of Great Expectation...Ensuring Data Quality in Databricks Unleashing the Power of Great Expectation...
Ensuring Data Quality in Databricks Unleashing the Power of Great Expectation...
Knoldus Inc.
 
The New Trillium DQ: Big Data Insights When and Where You Need Them
The New Trillium DQ: Big Data Insights When and Where You Need ThemThe New Trillium DQ: Big Data Insights When and Where You Need Them
The New Trillium DQ: Big Data Insights When and Where You Need Them
Precisely
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the Cloud
Inside Analysis
 
Down to Business: Taking Action Quickly with Linked Data Services
Down to Business: Taking Action Quickly with Linked Data ServicesDown to Business: Taking Action Quickly with Linked Data Services
Down to Business: Taking Action Quickly with Linked Data Services
Inside Analysis
 
Generating actionable consumer insights from analytics - Telekom R&D
Generating actionable consumer insights from analytics - Telekom R&DGenerating actionable consumer insights from analytics - Telekom R&D
Generating actionable consumer insights from analytics - Telekom R&DMerlien Institute
 
Chanchal Chatterjee PARTNERS 2017 Oct24
Chanchal Chatterjee PARTNERS 2017 Oct24Chanchal Chatterjee PARTNERS 2017 Oct24
Chanchal Chatterjee PARTNERS 2017 Oct24
Chanchal Chatterjee
 
Neo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperativeNeo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overview
jkvr
 
Supercharging AI with Data Enrichment
Supercharging AI with Data EnrichmentSupercharging AI with Data Enrichment
Supercharging AI with Data Enrichment
Precisely
 
ADV Slides: Increasing Artificial Intelligence Success with Master Data Manag...
ADV Slides: Increasing Artificial Intelligence Success with Master Data Manag...ADV Slides: Increasing Artificial Intelligence Success with Master Data Manag...
ADV Slides: Increasing Artificial Intelligence Success with Master Data Manag...
DATAVERSITY
 
Jeffrey Ricker - "Big Data Governance"
Jeffrey Ricker - "Big Data Governance"Jeffrey Ricker - "Big Data Governance"
Jeffrey Ricker - "Big Data Governance"
Lviv Startup Club
 
Enhance Bottom-Line Efficiency with Customized Offline Data Capture Solutions
Enhance Bottom-Line Efficiency with Customized Offline Data Capture SolutionsEnhance Bottom-Line Efficiency with Customized Offline Data Capture Solutions
Enhance Bottom-Line Efficiency with Customized Offline Data Capture Solutions
Andrew Leo
 
Technical track chris calvert-1 30 pm-issa conference-calvert
Technical track chris calvert-1 30 pm-issa conference-calvertTechnical track chris calvert-1 30 pm-issa conference-calvert
Technical track chris calvert-1 30 pm-issa conference-calvert
ISSA LA
 
Data lineage
Data lineageData lineage
Data lineage
GirishLingappa
 
Big data governance
Big data governanceBig data governance
Big data governance
Jeffrey Ricker
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Precisely
 
Graph Thinking: Why it Matters
Graph Thinking: Why it MattersGraph Thinking: Why it Matters
Graph Thinking: Why it Matters
Neo4j
 

Similar to Crowdsourcing Speech Data Science and AI (20)

Threat Modeling Using STRIDE
Threat Modeling Using STRIDEThreat Modeling Using STRIDE
Threat Modeling Using STRIDE
 
Turning Big Data into More Effective Customer Experiences
Turning Big Data into More Effective Customer ExperiencesTurning Big Data into More Effective Customer Experiences
Turning Big Data into More Effective Customer Experiences
 
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
 
Ensuring Data Quality in Databricks Unleashing the Power of Great Expectation...
Ensuring Data Quality in Databricks Unleashing the Power of Great Expectation...Ensuring Data Quality in Databricks Unleashing the Power of Great Expectation...
Ensuring Data Quality in Databricks Unleashing the Power of Great Expectation...
 
The New Trillium DQ: Big Data Insights When and Where You Need Them
The New Trillium DQ: Big Data Insights When and Where You Need ThemThe New Trillium DQ: Big Data Insights When and Where You Need Them
The New Trillium DQ: Big Data Insights When and Where You Need Them
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the Cloud
 
Down to Business: Taking Action Quickly with Linked Data Services
Down to Business: Taking Action Quickly with Linked Data ServicesDown to Business: Taking Action Quickly with Linked Data Services
Down to Business: Taking Action Quickly with Linked Data Services
 
Generating actionable consumer insights from analytics - Telekom R&D
Generating actionable consumer insights from analytics - Telekom R&DGenerating actionable consumer insights from analytics - Telekom R&D
Generating actionable consumer insights from analytics - Telekom R&D
 
Chanchal Chatterjee PARTNERS 2017 Oct24
Chanchal Chatterjee PARTNERS 2017 Oct24Chanchal Chatterjee PARTNERS 2017 Oct24
Chanchal Chatterjee PARTNERS 2017 Oct24
 
Neo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperativeNeo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperative
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overview
 
Supercharging AI with Data Enrichment
Supercharging AI with Data EnrichmentSupercharging AI with Data Enrichment
Supercharging AI with Data Enrichment
 
ADV Slides: Increasing Artificial Intelligence Success with Master Data Manag...
ADV Slides: Increasing Artificial Intelligence Success with Master Data Manag...ADV Slides: Increasing Artificial Intelligence Success with Master Data Manag...
ADV Slides: Increasing Artificial Intelligence Success with Master Data Manag...
 
Jeffrey Ricker - "Big Data Governance"
Jeffrey Ricker - "Big Data Governance"Jeffrey Ricker - "Big Data Governance"
Jeffrey Ricker - "Big Data Governance"
 
Enhance Bottom-Line Efficiency with Customized Offline Data Capture Solutions
Enhance Bottom-Line Efficiency with Customized Offline Data Capture SolutionsEnhance Bottom-Line Efficiency with Customized Offline Data Capture Solutions
Enhance Bottom-Line Efficiency with Customized Offline Data Capture Solutions
 
Technical track chris calvert-1 30 pm-issa conference-calvert
Technical track chris calvert-1 30 pm-issa conference-calvertTechnical track chris calvert-1 30 pm-issa conference-calvert
Technical track chris calvert-1 30 pm-issa conference-calvert
 
Data lineage
Data lineageData lineage
Data lineage
 
Big data governance
Big data governanceBig data governance
Big data governance
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
 
Graph Thinking: Why it Matters
Graph Thinking: Why it MattersGraph Thinking: Why it Matters
Graph Thinking: Why it Matters
 

More from Crowdsourcing Week

Crowdsourcing à la sbv IMPROVER: the challenge of being your own client
Crowdsourcing à la sbv IMPROVER: the challenge of being your own clientCrowdsourcing à la sbv IMPROVER: the challenge of being your own client
Crowdsourcing à la sbv IMPROVER: the challenge of being your own client
Crowdsourcing Week
 
Transforming the Global Payments Operation
Transforming the Global Payments OperationTransforming the Global Payments Operation
Transforming the Global Payments Operation
Crowdsourcing Week
 
Crowdsourced to Outsourced: How online platforms are shaping the future of work
Crowdsourced to Outsourced: How online platforms are shaping the future of workCrowdsourced to Outsourced: How online platforms are shaping the future of work
Crowdsourced to Outsourced: How online platforms are shaping the future of work
Crowdsourcing Week
 
Malasya's Experience in Crowd Labour and Sharing Economy
Malasya's Experience in Crowd Labour and Sharing EconomyMalasya's Experience in Crowd Labour and Sharing Economy
Malasya's Experience in Crowd Labour and Sharing Economy
Crowdsourcing Week
 
LM Industries: Harnessing The Power of Crowdsourced Innovation to Build the F...
LM Industries: Harnessing The Power of Crowdsourced Innovation to Build the F...LM Industries: Harnessing The Power of Crowdsourced Innovation to Build the F...
LM Industries: Harnessing The Power of Crowdsourced Innovation to Build the F...
Crowdsourcing Week
 
Human Collective Intelligence: the future of corporate innovation
Human Collective Intelligence: the future of corporate innovationHuman Collective Intelligence: the future of corporate innovation
Human Collective Intelligence: the future of corporate innovation
Crowdsourcing Week
 
9 Ways to Ruin Your Open Innovation Challenge
9 Ways to Ruin Your Open Innovation Challenge9 Ways to Ruin Your Open Innovation Challenge
9 Ways to Ruin Your Open Innovation Challenge
Crowdsourcing Week
 
Disruptive Crowdsourcing
Disruptive CrowdsourcingDisruptive Crowdsourcing
Disruptive Crowdsourcing
Crowdsourcing Week
 
Accelerating Hardware Development: Ideation and Engineering
Accelerating Hardware Development: Ideation and EngineeringAccelerating Hardware Development: Ideation and Engineering
Accelerating Hardware Development: Ideation and Engineering
Crowdsourcing Week
 
Attracting and Retaining Top Partners with a Best-in-Class Payments Experience
Attracting and Retaining Top Partners with a Best-in-Class Payments ExperienceAttracting and Retaining Top Partners with a Best-in-Class Payments Experience
Attracting and Retaining Top Partners with a Best-in-Class Payments Experience
Crowdsourcing Week
 
Crowdsourcing Disaster Relief
Crowdsourcing Disaster ReliefCrowdsourcing Disaster Relief
Crowdsourcing Disaster Relief
Crowdsourcing Week
 
Core + Crowd: Why (and how) crowdsourcing is about to become mainstream
Core + Crowd: Why (and how) crowdsourcing is about to become mainstreamCore + Crowd: Why (and how) crowdsourcing is about to become mainstream
Core + Crowd: Why (and how) crowdsourcing is about to become mainstream
Crowdsourcing Week
 
Smart and Secure Cities and Communities
Smart and Secure Cities and Communities Smart and Secure Cities and Communities
Smart and Secure Cities and Communities
Crowdsourcing Week
 
How Successful Crowdsourcing Depends on asking 'Interesting Questions'
How Successful Crowdsourcing Depends on asking 'Interesting Questions'How Successful Crowdsourcing Depends on asking 'Interesting Questions'
How Successful Crowdsourcing Depends on asking 'Interesting Questions'
Crowdsourcing Week
 
Contestant Centered Design: creative approaches to designing competitions
Contestant Centered Design: creative approaches to designing competitionsContestant Centered Design: creative approaches to designing competitions
Contestant Centered Design: creative approaches to designing competitions
Crowdsourcing Week
 
How Crypto can Monetize Crowdsourcing
How Crypto can Monetize CrowdsourcingHow Crypto can Monetize Crowdsourcing
How Crypto can Monetize Crowdsourcing
Crowdsourcing Week
 
A New Report on the State of Open Innovation and What it Means For you
A New Report on the State of Open Innovation and What it Means For youA New Report on the State of Open Innovation and What it Means For you
A New Report on the State of Open Innovation and What it Means For you
Crowdsourcing Week
 
Expert Operating System: Business On-Demand
Expert Operating System: Business On-DemandExpert Operating System: Business On-Demand
Expert Operating System: Business On-Demand
Crowdsourcing Week
 
Crowdsourcing: Changing the Faces of Innovation at NASA
Crowdsourcing: Changing the Faces of Innovation at NASACrowdsourcing: Changing the Faces of Innovation at NASA
Crowdsourcing: Changing the Faces of Innovation at NASA
Crowdsourcing Week
 
Crowdfunding an ICO Without Getting In Trouble
Crowdfunding an ICO Without Getting In TroubleCrowdfunding an ICO Without Getting In Trouble
Crowdfunding an ICO Without Getting In Trouble
Crowdsourcing Week
 

More from Crowdsourcing Week (20)

Crowdsourcing à la sbv IMPROVER: the challenge of being your own client
Crowdsourcing à la sbv IMPROVER: the challenge of being your own clientCrowdsourcing à la sbv IMPROVER: the challenge of being your own client
Crowdsourcing à la sbv IMPROVER: the challenge of being your own client
 
Transforming the Global Payments Operation
Transforming the Global Payments OperationTransforming the Global Payments Operation
Transforming the Global Payments Operation
 
Crowdsourced to Outsourced: How online platforms are shaping the future of work
Crowdsourced to Outsourced: How online platforms are shaping the future of workCrowdsourced to Outsourced: How online platforms are shaping the future of work
Crowdsourced to Outsourced: How online platforms are shaping the future of work
 
Malasya's Experience in Crowd Labour and Sharing Economy
Malasya's Experience in Crowd Labour and Sharing EconomyMalasya's Experience in Crowd Labour and Sharing Economy
Malasya's Experience in Crowd Labour and Sharing Economy
 
LM Industries: Harnessing The Power of Crowdsourced Innovation to Build the F...
LM Industries: Harnessing The Power of Crowdsourced Innovation to Build the F...LM Industries: Harnessing The Power of Crowdsourced Innovation to Build the F...
LM Industries: Harnessing The Power of Crowdsourced Innovation to Build the F...
 
Human Collective Intelligence: the future of corporate innovation
Human Collective Intelligence: the future of corporate innovationHuman Collective Intelligence: the future of corporate innovation
Human Collective Intelligence: the future of corporate innovation
 
9 Ways to Ruin Your Open Innovation Challenge
9 Ways to Ruin Your Open Innovation Challenge9 Ways to Ruin Your Open Innovation Challenge
9 Ways to Ruin Your Open Innovation Challenge
 
Disruptive Crowdsourcing
Disruptive CrowdsourcingDisruptive Crowdsourcing
Disruptive Crowdsourcing
 
Accelerating Hardware Development: Ideation and Engineering
Accelerating Hardware Development: Ideation and EngineeringAccelerating Hardware Development: Ideation and Engineering
Accelerating Hardware Development: Ideation and Engineering
 
Attracting and Retaining Top Partners with a Best-in-Class Payments Experience
Attracting and Retaining Top Partners with a Best-in-Class Payments ExperienceAttracting and Retaining Top Partners with a Best-in-Class Payments Experience
Attracting and Retaining Top Partners with a Best-in-Class Payments Experience
 
Crowdsourcing Disaster Relief
Crowdsourcing Disaster ReliefCrowdsourcing Disaster Relief
Crowdsourcing Disaster Relief
 
Core + Crowd: Why (and how) crowdsourcing is about to become mainstream
Core + Crowd: Why (and how) crowdsourcing is about to become mainstreamCore + Crowd: Why (and how) crowdsourcing is about to become mainstream
Core + Crowd: Why (and how) crowdsourcing is about to become mainstream
 
Smart and Secure Cities and Communities
Smart and Secure Cities and Communities Smart and Secure Cities and Communities
Smart and Secure Cities and Communities
 
How Successful Crowdsourcing Depends on asking 'Interesting Questions'
How Successful Crowdsourcing Depends on asking 'Interesting Questions'How Successful Crowdsourcing Depends on asking 'Interesting Questions'
How Successful Crowdsourcing Depends on asking 'Interesting Questions'
 
Contestant Centered Design: creative approaches to designing competitions
Contestant Centered Design: creative approaches to designing competitionsContestant Centered Design: creative approaches to designing competitions
Contestant Centered Design: creative approaches to designing competitions
 
How Crypto can Monetize Crowdsourcing
How Crypto can Monetize CrowdsourcingHow Crypto can Monetize Crowdsourcing
How Crypto can Monetize Crowdsourcing
 
A New Report on the State of Open Innovation and What it Means For you
A New Report on the State of Open Innovation and What it Means For youA New Report on the State of Open Innovation and What it Means For you
A New Report on the State of Open Innovation and What it Means For you
 
Expert Operating System: Business On-Demand
Expert Operating System: Business On-DemandExpert Operating System: Business On-Demand
Expert Operating System: Business On-Demand
 
Crowdsourcing: Changing the Faces of Innovation at NASA
Crowdsourcing: Changing the Faces of Innovation at NASACrowdsourcing: Changing the Faces of Innovation at NASA
Crowdsourcing: Changing the Faces of Innovation at NASA
 
Crowdfunding an ICO Without Getting In Trouble
Crowdfunding an ICO Without Getting In TroubleCrowdfunding an ICO Without Getting In Trouble
Crowdfunding an ICO Without Getting In Trouble
 

Recently uploaded

一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
eddie19851
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 

Recently uploaded (20)

一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 

Crowdsourcing Speech Data Science and AI

Editor's Notes

  1. Hi, my name is Daniela Braga and I am the founder and CEO of DefinedCrowd. Some say that entrepreneurship is like jumping out of a plane without a parachute and building one on the way down.  Many people ask me why did I leave my comfortable corporate job to start this company and to live in the last 2 years a life of excitement but hardship and uncertainty. I usually say that it was because of 3 reasons.
  2. I have 17 years of experience in the field of NLP, HCI, or what is now called, AI. In the last 7 years I moved to data science roles where I was responsible to collect and structure data for the scientists to train speech and language models. I’ve worked at some point with 50 languages in parallel. And independently of the method I was using to do this – inhouse or using vendors, the challenge was always getting high quality consistent data.
  3. https://techcrunch.com/2016/03/24/microsoft-silences-its-new-a-i-bot-tay-after-twitter-users-teach-it-racism/
  4. Each day, humans create 3 stacks of Empire State Buildings of data. 90% of that data is unstructured, but machines need structured data to learn. Now, only a few data science companies are looking at the data problem at an enterprise level. We are solving the data problems.
  5. When I started my career, we would build a lot of the dialogue systems components using a system of rules. During my PhD, I built a TTS system mainly based out of rules for the Portuguese language.
  6. But with the recent advances of data science, something changed in the way we teach machines. They don’t learn with a few rules anymore. They need machine learning models and LOTS and LOTs of training data.
  7. Introducing DefinedCrowd. Our platform allows data scientists to collect, enrich and structure high quality training data at scale.
  8. We do so, by combining tools, humans-in-the-loop and machine learning models into specialized AI workflows.
  9. Anna comes to the DefinedCrowd platform and goes to the “Build” tab. Here she picks this “multimodal data” workflow. Then a workflow assistant is displayed, with numbered steps that will light up as she is configuring the settings. First, she will configure the crowd setting such as language, gender, age and country of living. Then recording setup: at home and in a quiet environment, with microphones placed at 40 inches. And she will upload the scenario instructions. Next, she will configure the video settings and disable the audio and the other signals this time. Because she wants to understand emotion correlated with vital signals, she will pick heart and respiratory rates. All looking good, the calculator tells her it’s going to cost this, so she will push the submit button. The campaign is now finished and she’s going to check the results of her collection. Here she can see things like how many hours are completed, crowd demographics, timeline, quality, etc. Finally, Anna will send this data for validation and annotation, which are different workflows that can be found in the Build tab that we’ve seen before. To review: DefinedCrowd allowed Anna to extend her data science capacity by giving her a shopping tool for sophisticated and meaningful data set.  
  10. Existent SaaS platforms, like Crowdflower and Mturk, can’t maintain quality (they are 50% less accurate) and require 6 months of iterations to perfect a process. Traditional professional services, like Appen or Isoftstone, have higher quality but have no control over the process. DefinedCrowd brings the best of both worlds: the high quality and reach of the professional services combined with the scale and control of the SaaS platforms.
  11. We’re a Seattle-based company with R&D in PT. We’ve been partnering and servicing the biggest players in AI, mainly fortune 500 companies, in the US, Japan and Europe.
  12. We’re DefinedCrowd and we’re making machines smarter! If you’d like to try our platform, go to definedcrowd.ai/disrupt or shoot me an email!