- The document discusses the potential for a spoken version of Google that allows users to access information via voice commands and returns results in audio form using speech synthesis.
- Currently, Google can only understand text and return text-based results, but advances in speech recognition and synthesis mean machines may soon be able to listen to and understand all spoken content online as well as the text.
- A spoken version of Google could enable users to retrieve any information from the huge quantities of multimedia content on the internet via voice from any device, making the world's knowledge universally accessible.
Google Wave Platform. Exploring the Settings for Personalized LearningJaved Alam
The Google Wave is a communication, collaboration and rich media document creation platform that is in focus of educators and researchers, because of its capable to be harnessed for educational purposes. The paper discusses characteristics of Google Wave those facilitate creation of personalized learning spaces accessible to students anywhere and anytime.
Developing online listening exercises for natural EnglishVance Stevens
In our context, we can render the unclear audio to text by listening to it, parsing it mentally, saying it back into the SR engine, and then creating text manipulation exercises from it that force students to attend to certain details in the text / speech. We have created Hot Potatoes exercises where the audio is embedded in the exercise and the students can play the audio, complete the exercise, and get a score. This explains how and why we do it.
This presentation shows how to enhance language competencies with the use of internet. Practice links are provided for beginners so that they can rehearse listening, reading, lexical, and accuracy skills.
Google Wave Platform. Exploring the Settings for Personalized LearningJaved Alam
The Google Wave is a communication, collaboration and rich media document creation platform that is in focus of educators and researchers, because of its capable to be harnessed for educational purposes. The paper discusses characteristics of Google Wave those facilitate creation of personalized learning spaces accessible to students anywhere and anytime.
Developing online listening exercises for natural EnglishVance Stevens
In our context, we can render the unclear audio to text by listening to it, parsing it mentally, saying it back into the SR engine, and then creating text manipulation exercises from it that force students to attend to certain details in the text / speech. We have created Hot Potatoes exercises where the audio is embedded in the exercise and the students can play the audio, complete the exercise, and get a score. This explains how and why we do it.
This presentation shows how to enhance language competencies with the use of internet. Practice links are provided for beginners so that they can rehearse listening, reading, lexical, and accuracy skills.
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
Embracing GenAI - A Strategic ImperativePeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
Model Attribute Check Company Auto PropertyCeline George
In Odoo, the multi-company feature allows you to manage multiple companies within a single Odoo database instance. Each company can have its own configurations while still sharing common resources such as products, customers, and suppliers.
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...Levi Shapiro
Letter from the Congress of the United States regarding Anti-Semitism sent June 3rd to MIT President Sally Kornbluth, MIT Corp Chair, Mark Gorenberg
Dear Dr. Kornbluth and Mr. Gorenberg,
The US House of Representatives is deeply concerned by ongoing and pervasive acts of antisemitic
harassment and intimidation at the Massachusetts Institute of Technology (MIT). Failing to act decisively to ensure a safe learning environment for all students would be a grave dereliction of your responsibilities as President of MIT and Chair of the MIT Corporation.
This Congress will not stand idly by and allow an environment hostile to Jewish students to persist. The House believes that your institution is in violation of Title VI of the Civil Rights Act, and the inability or
unwillingness to rectify this violation through action requires accountability.
Postsecondary education is a unique opportunity for students to learn and have their ideas and beliefs challenged. However, universities receiving hundreds of millions of federal funds annually have denied
students that opportunity and have been hijacked to become venues for the promotion of terrorism, antisemitic harassment and intimidation, unlawful encampments, and in some cases, assaults and riots.
The House of Representatives will not countenance the use of federal funds to indoctrinate students into hateful, antisemitic, anti-American supporters of terrorism. Investigations into campus antisemitism by the Committee on Education and the Workforce and the Committee on Ways and Means have been expanded into a Congress-wide probe across all relevant jurisdictions to address this national crisis. The undersigned Committees will conduct oversight into the use of federal funds at MIT and its learning environment under authorities granted to each Committee.
• The Committee on Education and the Workforce has been investigating your institution since December 7, 2023. The Committee has broad jurisdiction over postsecondary education, including its compliance with Title VI of the Civil Rights Act, campus safety concerns over disruptions to the learning environment, and the awarding of federal student aid under the Higher Education Act.
• The Committee on Oversight and Accountability is investigating the sources of funding and other support flowing to groups espousing pro-Hamas propaganda and engaged in antisemitic harassment and intimidation of students. The Committee on Oversight and Accountability is the principal oversight committee of the US House of Representatives and has broad authority to investigate “any matter” at “any time” under House Rule X.
• The Committee on Ways and Means has been investigating several universities since November 15, 2023, when the Committee held a hearing entitled From Ivory Towers to Dark Corners: Investigating the Nexus Between Antisemitism, Tax-Exempt Universities, and Terror Financing. The Committee followed the hearing with letters to those institutions on January 10, 202
1. Towards A Spoken Version of Google
─ making the global knowledge
accessible by voice
Lin-shan Lee
National Taiwan University
2. • Google, Amazon, Facebook, YouTube offer most of our daily life
information
• Physical knowledge archives (e.g. libraries, museums) useful in the
past are developing their virtualized versions
• Online courses, distant learning, electronic books, etc. become the
most efficient learning aids
Internet is the Only Largest Archive for Global
Human Knowledge
Internet
Real-time
Information
– weather, traffic
– flight schedule
– stock price
– sports scores
Special Services
– Google
– Facebook
–YouTube
– Amazon
Knowledge
Archieves
– digital libraries
– virtual museums
Intelligent Working
Environment
– intelligent agents
– teleconferencing
– distant learning
– electric commerce
Private Services
– personal assistants
– business databases
– home appliances
– network
entertainments
3. • Have been An Important Part of our Daily Life
Google and the Internet
• Finding any desired information (in text) over the Internet
relevant to the user instructions (in text): Information
Retrieval
• Google is able to match and find the relevance between
segments of text
Internet
Google
……………
…………..
…….
server
server
User instructions
(in text)
Documents/Information
(in text)
(in text)
(in text)
4. Google and the Internet
– for multimedia information (e.g. YouTube, online courses, including
video and audio) to be retrieved by Google, the content needs to include
text descriptions
• Google can only read text information
• Have been An Important Part of our Daily Life
Internet
……………
…………..
…….
server
server
User instructions
(in text)
Documents/Information
(in text)
Google
5. Google and the Internet
• The Whole Video (a lecture or a news story) can be retrieved,
but Not the Exact Sentence
– the text descriptions is for a video or a lecture
• Have been An Important Part of our Daily Life
Internet
……………
…………..
…….
server
server
User instructions
(in text)
Documents/Information
(in text)
Google
6. Google and the Internet
• The Audio Information (voice in a video or a lecture) in
Multimedia very often tells what is going on in the
Multimedia
• Can be used as the Key for Information Retrieval
• Have been An Important Part of our Daily Life
Internet
……………
…………..
…….
server
server
User instructions
(in text)
Documents/Information
(in text)
Google
7. Speech Technologies Advancing Very Fast Today
• Speech Recognition / Synthesis Technologies
• Machines can Listen to Voice or Read Text
• All Roles of Text can be Realized by Voice
Speech Recognition
(machine)
Speech Synthesis
(machine)
text
voice
How are you today?
Good morning!
Spoken language written language
8. Machines
listen to
the audio
part
Google
voice
information
Multimedia
Content
Internet
voice
input/
output
text
information
• Information Retrieval as Google does can be performed based on Voice
• User-Content Interaction can be Accomplished by Spoken and Multi-
modal Dialogues (including other Modes of Interaction, e.g. those with
fingers)
• Text Information found can be transformed to Voice
Google
(text-based)
Text
Content
Information
Retrieval
(voice-based)
Speech Synthesis
Spoken and
Multi-modal
Dialogue
Spoken Version of Google
Google may have a Spoken Version
9. Information Retrieval (text/voice-based)
• Both the User Instructions and Network Content can be in
form of Voice
Voice Instructions
US/China Trade Policy ?
Text Instructions
d1
Text Information
d2
d3
d1
d2
d3
Voice Information
President Donald Trump…
Google
/voice
10. Voice Interface is Convenient for All Different Kinds of
User Terminals
Internet
Text Content
Multimedia
Content
• Smart phones, Hand-held Devices, Wearable Devices (Watches,
Glasses, etc.), Notebooks, Vehicular Electronics, Home
Appliances …
• Network Access at Any Time, from Anywhere
• Small in Size, Light in Weight, Ubiquitous, Invisible…
• Voice is the Only Interface Convenient for ALL User Terminals at
Any Time, from Anywhere, and To the Point in one Utterance
11. Google and the Internet
• Have been An Important Part of our Daily Life
Internet
Google
……………
…………..
…….
server
server
User instructions
(in text)
Documents/Information
(in text)
• Finding any desired information (in text) over the Internet
relevant to the user instructions (in text): Information
Retrieval
• Google is able to match and find the relevance between
segments of text
12. Google can have a Spoken Version
• Finding any desired information (in voice) over the Internet
relevant to the user instructions (in voice): Information
Retrieval
• Spoken version of Google is able to match and find the
relevance between segments of voice
• Have been An Important Part of our Daily Life
Internet
Google
……………
…………..
…….
server
server
User instructions
(in voice)
Documents/Information
(in voice)
13. Google can have a Spoken Version
• Can Locate the Exact Time the Desired Information Appears
in the Multimedia
• No Need for Text Descriptions
• Have been An Important Part of our Daily Life
Internet
Google
……………
…………..
…….
server
server
User instructions
(in voice)
Documents/Information
(in voice)
14. Retrieval of Public TV News of Taiwan (公視新聞
搜尋)
• 2004
• Locate the Exact Relevant Utterances (Spoken Sentences)
without Text Descriptions
15. Retrieval of Public TV News of Taiwan (公視新聞
搜尋)
• 2004
• Locate the Exact Relevant Utterances (Spoken Sentences)
without Text Descriptions
16. User-Content Interaction for Spoken Content
Retrieval
• Problems
– User-content interaction always important even for text content
– Unlike text content, spoken content not easily summarized on screen,
thus retrieved results difficult to scan and select
User
Query
Spoken
Archives
Retrieved Results Spoken Version
of Google
17. User-Content Interaction for Spoken Content
Retrieval
• Possible Approaches
– Automatic summary/title generation and keyword extraction from
spoken content
– Topic structure for spoken content
– Multi-modal dialogue with improved interaction
Keyword/
Titles/Summaries
User
Query
Multi-modal
Dialogue
Spoken
Archives
Retrieved Results Spoken Version
of Google
User
Interface Topic
Structure
Spoken version of Google
19. User-Content Interaction for Spoken Content
Retrieval
• Possible Approaches
– Automatic summary/title generation and keyword extraction from
spoken content
– Topic structure for spoken content
– Multi-modal dialogue with improved interaction
Keyword/
Titles/Summaries
User
Query
Multi-modal
Dialogue
Spoken
Archives
Retrieved Results Spoken Version
of Google
User
Interface Topic
Structure
Spoken version of Google
20. • Example 1: retrieved results Grouped by Topics and
organized in a Two-dimensional Tree Structure
– each group of retrieved segments labeled by a set of keywords (topic)
– each group expanded into a map in the next layer
Topic Structure (1/2)
22. • Usually very long time required (e.g. 45 hrs) to learn from a
complete online course
– difficult for very busy people to learn pieces of knowledge
• Possible to retrieve a segment of lecture with a browser
– knowledge transfer in a course is usually sequential
– not easy to understand a lecture without listening to the previous
lectures
– not easy to find out background or related knowledge
• Possible Approaches
– consecutive audio/video for each slide taken as a segment of
multimedia (a page of spoken slide)
– each spoken slide labeled by the keywords (topic) extracted from the
audio
– relationships between keywords of the course represented by a graph
Online Courses
23. • Example 2: Keyword Graph
– each spoken slide labeled by a set of keywords (topics)
– relationships between keywords represented by a graph
Topic Structure (2/2)
-----
-----
-----
-----
---------
---------
---------
---
-------
-------
-------
----
spoken
slides
keyword
graph
Acoustic
Modeling
Viterbi
search
HMM
Language
Modeling
Perplexity
29. • To Which Degree can Machines Understand the Spoken
Content ?
• Machines to take TOEFL Listening Comprehension Test (an
English test for students whose native language is not English)
• An Example Problem:
A Question: “ What is a possible origin of Venus’ clouds? ”
Answer options:
(A) gases released as a result of volcanic activity
(C) bursts of radio energy from the plane's surface
(D) strong winds that blow dust into the atmosphere
Recorded voice: (5min long)
Machine Comprehension of Spoken Content
(B) chemical reactions caused by high surface temperatures
30. (1) (2) (3) (4) (5)
by some hand-crafted rules without
listening to the recorded voice
Best Machine Performance: 50.0%
choosing the shortest answer
Machine Comprehension of Spoken Content
(%)
• Machine Performance 2017
Accuracy
32. What can Machines do for Human ?
• Machines can listen to and understand the entire multimedia
knowledge archive over the Internet
– extracting desired information for each individual user
300hrs of videos
uploaded per min
(2015.01)
Roughly 2000 online
courses on Coursera
(2016.04)
• Nobody can go through so much multimedia
information, but Machines can
• Multimedia Content exponentially
increasing over the Internet
– best archive of global human knowledge is here
– desired information deeply buried under huge quantities
of unrelated information
33. An Example: Personalized Courses
• Machines generate desired personalized courses for each
individual user
I wish to learn some knowledge
about the masterpieces of
Wolfgang Amadeus Mozart
I am an engineer. I know nothing
about music
I can spend 3 hrs to learn
user
Thank you for your request. This is
the 3-hr personalized course for
you.
Information
from Internet
34. Spoken Version of Google
• Speech Recognition and Synthesis Technologies Make
Machines Capable of Listening to and Speaking Human Voice
– the best the machines can do may be as good as the human
• Machines can Handle Huge Quantities of Information
– much more efficient than human
• Google Reads All Text over the Internet
– similarly machines can listen to all voices over the Internet
• Internet is the Only Largest Archive for Global Human
Knowledge
– voice can be the key to that archive