SlideShare a Scribd company logo
1 of 18
Download to read offline
Learning Context is All You Need for
Task-General Artificial Intelligence
(Shaka) Shih-Chia Chen
Founder/CEO
www.libgirl.com
Making Real AI - Series
● Task specific machine learning systems are brittle and sensitive to
■ Data distribution shifts
■ Task specification changes
● Such shifts and changes happen a lot in practical application
developments and operations
=>
■ Systems make mistakes
■ Manual tuning costs a lot
Problems of
Task-Specific AI/Machine Learning
See appendix for more information
● Task specific machine learning systems are brittle and sensitive to
■ Data distribution shifts
■ Task specification changes
● Such shifts and changes happen a lot in practical application
developments and operations
=>
■ Systems make mistakes
■ Manual tuning costs a lot
Problems of
Task-Specific AI/Machine Learning
We need
task-general AI
See appendix for more information
Thesis:
Learning context is all you need for
task-general AI
=
As long as a single machine learning model can learn to distinguish
unbounded amount of contexts and give output accordingly,
the model is a task-general AI.
In some definition, a task-general AI is also an Artificial General Intelligence
(AGI)
Task-General = Context Sensitive
Play chess
with me
Context
=
Task information
Robot
Input Observation
as
Output Action
Robot
playing chess
Clean the
table
Context
=
Task information
Robot
Input Observation
as
Output Action
Robot
cleaning table
Task-General = Context Sensitive
Complicated Context Sensitive =
Further Task-General
Clean the
table
Robot
cleaning itself
Context
=
Task information
Robot
Input Observation
as
Output Action
Imagine you
are a table to
be cleaned
Longer Historical Context =
Higher Context Sensitivity
Clean the
table
Context
=
Task information
Robot
Input Observation
as
Output Action
Imagine you
are a table to
be cleaned
Ignore my
next
sentence
Robot
cleaning table
Context Switch = Task Switch
Play chess
with me
Robot
playing chess
Context Switch = Task Switch
Play chess
with me
Robot
playing chess
Clean the
table
Robot
cleaning table
THEN
Related Works
● Task-general machine learning trials based on their
single context-sensitive models. (language tasks only)
○ OpenAI’s GPT-2 (Radford & Wu, 2019)
○ National Taiwan University’s LAMOL (Sun & Ho, 2019)
Related Works
● Learning to distinguish unbounded amount of contexts.
(long-range contextual dependencies)
(longer-historical contexts)
○ Google’s Reformer can learn the contextual relationship of
sequences up to 1 million words.
(Kitaev & Kaiser, 2020)
○ Dai & Yang (2019) proposed Transformer-XL, it can capture
sequential dependencies beyond a fixed-length context.
● Machine learning for learning context with longer spatiotemporal
dependency.
● To train a single model with complicated contextual data,
and then to demonstrate its stronger task-generality.
● Problems conquered
■ Data distribution shifts
■ Task specification changes
Let’s Do
Next
Follow our Making Real AI series
Let’s further investigate the following terminologies:
Task-specific VS. Task-general
AI VS. AGI
In the next slides.
Appendix
Dataset Shift and Software
Requirement Changes
“Machine learning systems now excel (in expectation) at tasks they
are trained for by using a combination of large datasets,
high-capacity models, and supervised learning (Krizhevsky et al.,
2012) (Sutskever et al., 2014) (Amodei et al., 2016).
Yet these systems are brittle and sensitive to slight changes in the
data distribution (Recht et al., 2018) and task specification
(Kirkpatrick et al., 2017).
Current systems are better characterized as narrow experts rather
than competent generalists.”
(Radford & Wu, 2019)
Dataset Shift and Software
Requirement Changes
● Dataset shift is present in most practical applications
(Quiñonero-Candela, 2009)
● “It is often more than 50% of the requirements are changed
before the completion of a software project.”
(Kotonya and Sommerville, 1998)
References
● Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever.
Language models are unsupervised multitask learners. OpenAI Blog, 2019.
● Sun, F.-K., Ho, C.-H., & Lee, H.-Y. (2019). LAMOL: LAnguage MOdeling for Lifelong Language
Learning. ArXiv.Org. https://arxiv.org/abs/1909.03329
● Kitaev, N., & Kaiser, Ł. (2020, January 16). Reformer: The Efficient Transformer. Google AI Blog.
https://ai.googleblog.com/2020/01/reformer-efficient-transformer.html
● Dai, Z., Yang, Z., Yang, Y., Carbonell, J., Le, Q. V., & Salakhutdinov, R. (2019). Transformer-XL:
Attentive Language Models Beyond a Fixed-Length Context. ArXiv.Org.
https://arxiv.org/abs/1901.02860
● Quiñonero-Candela, J. (2009). Dataset Shift In Machine Learning. Mit Press.
● Kotonya, G., & Sommerville, I. (1998). Requirements engineering :
processes and techniques. John Wiley & Sons.

More Related Content

Similar to Learning context is all you need for task general artificial intelligence

Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesMatthew Lease
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
 
IET~DAVV STUDY MATERIALS SRS.docx
IET~DAVV STUDY MATERIALS SRS.docxIET~DAVV STUDY MATERIALS SRS.docx
IET~DAVV STUDY MATERIALS SRS.docxMr. Moms
 
Orchestration Graphs: Enabling Rich Learning Scenarios at Scale
Orchestration Graphs: Enabling Rich Learning Scenarios at ScaleOrchestration Graphs: Enabling Rich Learning Scenarios at Scale
Orchestration Graphs: Enabling Rich Learning Scenarios at ScaleStian Håklev
 
Introduction to Software - Coder Forge - John Mulhall
Introduction to Software - Coder Forge - John MulhallIntroduction to Software - Coder Forge - John Mulhall
Introduction to Software - Coder Forge - John MulhallJohn Mulhall
 
The Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing SystemsThe Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing SystemsNeo4j
 
ccna course 2
ccna course 2ccna course 2
ccna course 2S Sridhar
 
Lecture_01.1.pptx
Lecture_01.1.pptxLecture_01.1.pptx
Lecture_01.1.pptxRockyIslam5
 
Make Learning Big Data Work For You
Make Learning Big Data Work For YouMake Learning Big Data Work For You
Make Learning Big Data Work For YouJessie Chuang
 
Text Summarization and Conversion of Speech to Text
Text Summarization and Conversion of Speech to TextText Summarization and Conversion of Speech to Text
Text Summarization and Conversion of Speech to TextIRJET Journal
 
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...IRJET Journal
 
Apache Web Services in the Real World, an E-Science Perspective
Apache Web Services in the Real World, an E-Science PerspectiveApache Web Services in the Real World, an E-Science Perspective
Apache Web Services in the Real World, an E-Science PerspectiveSrinath Perera
 
Dialogue system②
Dialogue system②Dialogue system②
Dialogue system②Kent T
 
Fdp kavita pandey_automata
Fdp kavita pandey_automataFdp kavita pandey_automata
Fdp kavita pandey_automataKpandey
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellSri Ambati
 
1066_multitask_prompted_training_en.pdf
1066_multitask_prompted_training_en.pdf1066_multitask_prompted_training_en.pdf
1066_multitask_prompted_training_en.pdfssusere320ca
 

Similar to Learning context is all you need for task general artificial intelligence (20)

Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
GPSS interactive learning environment
GPSS interactive learning environmentGPSS interactive learning environment
GPSS interactive learning environment
 
IET~DAVV STUDY MATERIALS SRS.docx
IET~DAVV STUDY MATERIALS SRS.docxIET~DAVV STUDY MATERIALS SRS.docx
IET~DAVV STUDY MATERIALS SRS.docx
 
Orchestration Graphs: Enabling Rich Learning Scenarios at Scale
Orchestration Graphs: Enabling Rich Learning Scenarios at ScaleOrchestration Graphs: Enabling Rich Learning Scenarios at Scale
Orchestration Graphs: Enabling Rich Learning Scenarios at Scale
 
Introduction to Software - Coder Forge - John Mulhall
Introduction to Software - Coder Forge - John MulhallIntroduction to Software - Coder Forge - John Mulhall
Introduction to Software - Coder Forge - John Mulhall
 
GPSS interactive learning environment
GPSS interactive learning environmentGPSS interactive learning environment
GPSS interactive learning environment
 
The Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing SystemsThe Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing Systems
 
24 Reasons Why Variability Models Are Not Yet Universal (24RWVMANYU)
24 Reasons Why Variability Models Are Not Yet Universal (24RWVMANYU)24 Reasons Why Variability Models Are Not Yet Universal (24RWVMANYU)
24 Reasons Why Variability Models Are Not Yet Universal (24RWVMANYU)
 
ccna course 2
ccna course 2ccna course 2
ccna course 2
 
Lecture_01.1.pptx
Lecture_01.1.pptxLecture_01.1.pptx
Lecture_01.1.pptx
 
Make Learning Big Data Work For You
Make Learning Big Data Work For YouMake Learning Big Data Work For You
Make Learning Big Data Work For You
 
Data Structures & Algorithms
Data Structures & AlgorithmsData Structures & Algorithms
Data Structures & Algorithms
 
Text Summarization and Conversion of Speech to Text
Text Summarization and Conversion of Speech to TextText Summarization and Conversion of Speech to Text
Text Summarization and Conversion of Speech to Text
 
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
 
Apache Web Services in the Real World, an E-Science Perspective
Apache Web Services in the Real World, an E-Science PerspectiveApache Web Services in the Real World, an E-Science Perspective
Apache Web Services in the Real World, an E-Science Perspective
 
Dialogue system②
Dialogue system②Dialogue system②
Dialogue system②
 
Fdp kavita pandey_automata
Fdp kavita pandey_automataFdp kavita pandey_automata
Fdp kavita pandey_automata
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin Ledell
 
1066_multitask_prompted_training_en.pdf
1066_multitask_prompted_training_en.pdf1066_multitask_prompted_training_en.pdf
1066_multitask_prompted_training_en.pdf
 

Recently uploaded

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 

Learning context is all you need for task general artificial intelligence

  • 1. Learning Context is All You Need for Task-General Artificial Intelligence (Shaka) Shih-Chia Chen Founder/CEO www.libgirl.com Making Real AI - Series
  • 2. ● Task specific machine learning systems are brittle and sensitive to ■ Data distribution shifts ■ Task specification changes ● Such shifts and changes happen a lot in practical application developments and operations => ■ Systems make mistakes ■ Manual tuning costs a lot Problems of Task-Specific AI/Machine Learning See appendix for more information
  • 3. ● Task specific machine learning systems are brittle and sensitive to ■ Data distribution shifts ■ Task specification changes ● Such shifts and changes happen a lot in practical application developments and operations => ■ Systems make mistakes ■ Manual tuning costs a lot Problems of Task-Specific AI/Machine Learning We need task-general AI See appendix for more information
  • 4. Thesis: Learning context is all you need for task-general AI = As long as a single machine learning model can learn to distinguish unbounded amount of contexts and give output accordingly, the model is a task-general AI. In some definition, a task-general AI is also an Artificial General Intelligence (AGI)
  • 5. Task-General = Context Sensitive Play chess with me Context = Task information Robot Input Observation as Output Action Robot playing chess
  • 6. Clean the table Context = Task information Robot Input Observation as Output Action Robot cleaning table Task-General = Context Sensitive
  • 7. Complicated Context Sensitive = Further Task-General Clean the table Robot cleaning itself Context = Task information Robot Input Observation as Output Action Imagine you are a table to be cleaned
  • 8. Longer Historical Context = Higher Context Sensitivity Clean the table Context = Task information Robot Input Observation as Output Action Imagine you are a table to be cleaned Ignore my next sentence Robot cleaning table
  • 9. Context Switch = Task Switch Play chess with me Robot playing chess
  • 10. Context Switch = Task Switch Play chess with me Robot playing chess Clean the table Robot cleaning table THEN
  • 11. Related Works ● Task-general machine learning trials based on their single context-sensitive models. (language tasks only) ○ OpenAI’s GPT-2 (Radford & Wu, 2019) ○ National Taiwan University’s LAMOL (Sun & Ho, 2019)
  • 12. Related Works ● Learning to distinguish unbounded amount of contexts. (long-range contextual dependencies) (longer-historical contexts) ○ Google’s Reformer can learn the contextual relationship of sequences up to 1 million words. (Kitaev & Kaiser, 2020) ○ Dai & Yang (2019) proposed Transformer-XL, it can capture sequential dependencies beyond a fixed-length context.
  • 13. ● Machine learning for learning context with longer spatiotemporal dependency. ● To train a single model with complicated contextual data, and then to demonstrate its stronger task-generality. ● Problems conquered ■ Data distribution shifts ■ Task specification changes Let’s Do
  • 14. Next Follow our Making Real AI series Let’s further investigate the following terminologies: Task-specific VS. Task-general AI VS. AGI In the next slides.
  • 16. Dataset Shift and Software Requirement Changes “Machine learning systems now excel (in expectation) at tasks they are trained for by using a combination of large datasets, high-capacity models, and supervised learning (Krizhevsky et al., 2012) (Sutskever et al., 2014) (Amodei et al., 2016). Yet these systems are brittle and sensitive to slight changes in the data distribution (Recht et al., 2018) and task specification (Kirkpatrick et al., 2017). Current systems are better characterized as narrow experts rather than competent generalists.” (Radford & Wu, 2019)
  • 17. Dataset Shift and Software Requirement Changes ● Dataset shift is present in most practical applications (Quiñonero-Candela, 2009) ● “It is often more than 50% of the requirements are changed before the completion of a software project.” (Kotonya and Sommerville, 1998)
  • 18. References ● Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever. Language models are unsupervised multitask learners. OpenAI Blog, 2019. ● Sun, F.-K., Ho, C.-H., & Lee, H.-Y. (2019). LAMOL: LAnguage MOdeling for Lifelong Language Learning. ArXiv.Org. https://arxiv.org/abs/1909.03329 ● Kitaev, N., & Kaiser, Ł. (2020, January 16). Reformer: The Efficient Transformer. Google AI Blog. https://ai.googleblog.com/2020/01/reformer-efficient-transformer.html ● Dai, Z., Yang, Z., Yang, Y., Carbonell, J., Le, Q. V., & Salakhutdinov, R. (2019). Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context. ArXiv.Org. https://arxiv.org/abs/1901.02860 ● Quiñonero-Candela, J. (2009). Dataset Shift In Machine Learning. Mit Press. ● Kotonya, G., & Sommerville, I. (1998). Requirements engineering : processes and techniques. John Wiley & Sons.