SlideShare a Scribd company logo
1 of 20
GoogleIO @ Java Meetup Ahmedabad
Build next generation apps with eyes and ears using
Google Chrome
25th
Jun, 2014
Dhruv Gohil
From Ishi systems
Treat your users as human!
UX - 101
Tools should fit human not the other way around!
Now we have books on UX!
• With computers, we lost 'human touch' in
interaction
• Touch screens brought it back, but frankly its not
the natural way to do 'things'
Let's get back to basics..
• I have to tell you something.. I will speak and you
will listen.
• So why it has to be different for computers?!
Enter the new era!
• Now browsers can speak to you.
• And browsers can hear you too!
http://updates.html5rocks.com/2013/01/Voice-Driven-Web-Apps-Introduction-to-the-Web-Speech-API
Demo time!
https://www.google.com/intl/en/chrome/demos/speech.html
Do it yourself!
You talk too much...
● It's time to listen..
● Yes! That's all we need!
Off-line mode too!
http://syl22-00.github.io/pocketsphinx.js/
This is how translation should work!
http://www.moreawesomeweb.com/demos/speech_translate.html
Command mode!
● I am married.... ammm.. Can I command my computer to
do things?
https://www.talater.com/annyang/ https://github.com/TalAter/annyang
But can you see me?
http://wolframhempel.github.io/photobooth-js/
Ok but how its useful?
http://revealjs.herokuapp.com/
Time to demo!, again.! :-)
https://github.com/willy-vvu/reveal.js
What was that?
● Magic of 'Motion detection algorithms'
● Over simplified How To:
– Capture video [getUserMedia()]
– Draw frames [ html5 <canvas> ]
– Run Algorithm to detect change between frames [ .js]
● If you are deeply interested
– Read : http://labs.littlemissrobot.com/behind-the-mirror/
http://www.adobe.com/devnet/html5/articles/javascript-motion-detection.html
More!
● https://github.com/foo123/HAAR.js#live-examples
– Interactive Face Detection
– Many Faces Detection
– Interactive Mouth Detection
● http://www.soundstep.com/blog/experiments/jstracking/
– Motion tracking
● http://www.soundstep.com/blog/experiments/jsdetection/
– The Magical Xylophone
But Why chrome?
Its time to...
WebSpeech API fundametals
● Audio capture
● Request authorization
● Audio processing pip line
– Nodes (FFT , Gain, Filter...)
– http://chimera.labs.oreilly.com/books/1234000001552/
● Join to destination (Speakers)
● + Visualizations
GetUserMedia API fundamentals
● Audio + Video capture
● Request authorization
● Capture
● Draw (Hidden)
● Analyze + Process
● Re-Draw
And finally...
लगता है बर्फर िगर रही है!
https://github.com/IonicaBizau/WebcamSnowhttp://ionicabizau.net/WebcamSnow/

More Related Content

Similar to Build next generation apps with eyes and ears using Google Chrome

You Can UX Too: Avoiding the Programmer's User Interface @ NEPHP 2013
You Can UX Too: Avoiding the Programmer's User Interface @ NEPHP 2013You Can UX Too: Avoiding the Programmer's User Interface @ NEPHP 2013
You Can UX Too: Avoiding the Programmer's User Interface @ NEPHP 2013
Eryn O'Neil
 

Similar to Build next generation apps with eyes and ears using Google Chrome (20)

Google hangouts ideas
Google hangouts ideas Google hangouts ideas
Google hangouts ideas
 
Android Design Patterns
Android Design PatternsAndroid Design Patterns
Android Design Patterns
 
Paperless worksheets, a paradox or an opportunity? (by Anthony Hill)
Paperless worksheets, a paradox or an opportunity? (by Anthony Hill)Paperless worksheets, a paradox or an opportunity? (by Anthony Hill)
Paperless worksheets, a paradox or an opportunity? (by Anthony Hill)
 
Breaking the Fourth Wall in Software
Breaking the Fourth Wall in SoftwareBreaking the Fourth Wall in Software
Breaking the Fourth Wall in Software
 
You Can UX Too: Avoiding the Programmer's User Interface @ NEPHP 2013
You Can UX Too: Avoiding the Programmer's User Interface @ NEPHP 2013You Can UX Too: Avoiding the Programmer's User Interface @ NEPHP 2013
You Can UX Too: Avoiding the Programmer's User Interface @ NEPHP 2013
 
iRobot
iRobotiRobot
iRobot
 
IT4BC - Mobile Learning User Experience (UX) Design: Creating a Mobile Course...
IT4BC - Mobile Learning User Experience (UX) Design: Creating a Mobile Course...IT4BC - Mobile Learning User Experience (UX) Design: Creating a Mobile Course...
IT4BC - Mobile Learning User Experience (UX) Design: Creating a Mobile Course...
 
The english language crystal ball: the past present and future of technology ...
The english language crystal ball: the past present and future of technology ...The english language crystal ball: the past present and future of technology ...
The english language crystal ball: the past present and future of technology ...
 
Diseño inclusivo
Diseño inclusivoDiseño inclusivo
Diseño inclusivo
 
Making Effective Prototypes
Making Effective PrototypesMaking Effective Prototypes
Making Effective Prototypes
 
Screenless Browsing - Audio Sword
Screenless Browsing - Audio SwordScreenless Browsing - Audio Sword
Screenless Browsing - Audio Sword
 
How Usable Are You
How Usable Are YouHow Usable Are You
How Usable Are You
 
Conversational experience by Systango
Conversational experience by SystangoConversational experience by Systango
Conversational experience by Systango
 
Designing for mobile. A UX perspective for developers
Designing for mobile. A UX perspective for developersDesigning for mobile. A UX perspective for developers
Designing for mobile. A UX perspective for developers
 
Holo
HoloHolo
Holo
 
Holo
HoloHolo
Holo
 
7 alternative technology in the classroom presentation tools
7 alternative technology in the classroom presentation tools7 alternative technology in the classroom presentation tools
7 alternative technology in the classroom presentation tools
 
Collaboration Tools for the Arts
Collaboration Tools for the ArtsCollaboration Tools for the Arts
Collaboration Tools for the Arts
 
ETUG T.E.L.L.: Designing a Multi-device Moodle Course Site: A Case Study
ETUG T.E.L.L.: Designing a Multi-device Moodle Course Site: A Case StudyETUG T.E.L.L.: Designing a Multi-device Moodle Course Site: A Case Study
ETUG T.E.L.L.: Designing a Multi-device Moodle Course Site: A Case Study
 
The rise of Conversational User Interfaces
The rise of Conversational User Interfaces The rise of Conversational User Interfaces
The rise of Conversational User Interfaces
 

Build next generation apps with eyes and ears using Google Chrome

  • 1. GoogleIO @ Java Meetup Ahmedabad Build next generation apps with eyes and ears using Google Chrome 25th Jun, 2014 Dhruv Gohil From Ishi systems
  • 2. Treat your users as human! UX - 101 Tools should fit human not the other way around!
  • 3. Now we have books on UX! • With computers, we lost 'human touch' in interaction • Touch screens brought it back, but frankly its not the natural way to do 'things'
  • 4. Let's get back to basics.. • I have to tell you something.. I will speak and you will listen. • So why it has to be different for computers?!
  • 5. Enter the new era! • Now browsers can speak to you. • And browsers can hear you too! http://updates.html5rocks.com/2013/01/Voice-Driven-Web-Apps-Introduction-to-the-Web-Speech-API
  • 8. You talk too much... ● It's time to listen.. ● Yes! That's all we need!
  • 10. This is how translation should work! http://www.moreawesomeweb.com/demos/speech_translate.html
  • 11. Command mode! ● I am married.... ammm.. Can I command my computer to do things? https://www.talater.com/annyang/ https://github.com/TalAter/annyang
  • 12. But can you see me? http://wolframhempel.github.io/photobooth-js/
  • 13. Ok but how its useful? http://revealjs.herokuapp.com/ Time to demo!, again.! :-) https://github.com/willy-vvu/reveal.js
  • 14. What was that? ● Magic of 'Motion detection algorithms' ● Over simplified How To: – Capture video [getUserMedia()] – Draw frames [ html5 <canvas> ] – Run Algorithm to detect change between frames [ .js] ● If you are deeply interested – Read : http://labs.littlemissrobot.com/behind-the-mirror/ http://www.adobe.com/devnet/html5/articles/javascript-motion-detection.html
  • 15. More! ● https://github.com/foo123/HAAR.js#live-examples – Interactive Face Detection – Many Faces Detection – Interactive Mouth Detection ● http://www.soundstep.com/blog/experiments/jstracking/ – Motion tracking ● http://www.soundstep.com/blog/experiments/jsdetection/ – The Magical Xylophone
  • 18. WebSpeech API fundametals ● Audio capture ● Request authorization ● Audio processing pip line – Nodes (FFT , Gain, Filter...) – http://chimera.labs.oreilly.com/books/1234000001552/ ● Join to destination (Speakers) ● + Visualizations
  • 19. GetUserMedia API fundamentals ● Audio + Video capture ● Request authorization ● Capture ● Draw (Hidden) ● Analyze + Process ● Re-Draw
  • 20. And finally... लगता है बर्फर िगर रही है! https://github.com/IonicaBizau/WebcamSnowhttp://ionicabizau.net/WebcamSnow/