An introduction to the biology and neurophysiology of human speech. The target audience is researchers and engineers working on speech recognition technology.
The presentation focuses on cerebral asymmetries in structural, functional and molecular levels regarding production and comprehension of language faculty. It also briefs about the role of different language areas and sex differences in language.
Content includes basic physiology of language an speech.
along with description of brain areas involved as well as basic knowledge of different types aphasia.
The presentation focuses on cerebral asymmetries in structural, functional and molecular levels regarding production and comprehension of language faculty. It also briefs about the role of different language areas and sex differences in language.
Content includes basic physiology of language an speech.
along with description of brain areas involved as well as basic knowledge of different types aphasia.
Speech disorders
1. Central Mechanisms:
Depending on the integration of the higher brain centers for symbolization (speech centers), mainly in the dominant hemisphere.
Lesion leads to Dysphasia or Aphasia.
2. Peripheral Mechanisms:
A. Articulation:
Lesion leads to Dysarthria or Anarthria.
B. Phonation:
Lesion leads to Dysphonia or Aphonia.
All Charateristics of Non FLuent Aphasia
Difficulty communicating orally
Difficulty with written words
Other names:
Motor aphasia
Anterior aphasia
Broca's aphasia
Types of Non-Fluent Aphasia
Broca’s Aphasia
Transcortical motor Aphasia
Mixed Transcortical Aphasia
Global Aphasia
Due to damage of the CNS or PNS or both. There is some involvement of the basic motor processes used in speech and this results in a movement disorder...
Speech disorders
1. Central Mechanisms:
Depending on the integration of the higher brain centers for symbolization (speech centers), mainly in the dominant hemisphere.
Lesion leads to Dysphasia or Aphasia.
2. Peripheral Mechanisms:
A. Articulation:
Lesion leads to Dysarthria or Anarthria.
B. Phonation:
Lesion leads to Dysphonia or Aphonia.
All Charateristics of Non FLuent Aphasia
Difficulty communicating orally
Difficulty with written words
Other names:
Motor aphasia
Anterior aphasia
Broca's aphasia
Types of Non-Fluent Aphasia
Broca’s Aphasia
Transcortical motor Aphasia
Mixed Transcortical Aphasia
Global Aphasia
Due to damage of the CNS or PNS or both. There is some involvement of the basic motor processes used in speech and this results in a movement disorder...
Speech perception is defined as the process by which a perceiver tries to identify the talkers underlying language patterns on the basis of speech sounds and movements. The ultimate goal of speech perception is to determine the meaning and intent behind the spoken message.
-Arthur Boothroyd (1998)
In many everyday situations, we find ourselves listening to speech-often trying to understand the speech of one particular person even as other conversions, radio broadcasts, and public address announcements create a troublesome speech background. How do we understand the speech of other people? How do we select one voice particularly from a crowd of conversing persons? By what processes do we take in the perishable acoustic signal of speech and quickly reach decision about who said it, what was said and how it was said? All of these decisions must be made before the speaker produces the next utterance. These are some of the questions that the study of speech perception attempts to answer.
Auditory perception of speech is a process of interpreting the instructions imprinted on the acoustic wave by the speaker over a time span.
Auditory perception of speech per se deals mainly with the temporal management of information from the input (Berlin 1969).
• Speech is a continuous, unsegmented event. The organs of speech glide from one target position to the next, generating transitional information in the process.
• The characteristics of the acoustic stimulus for any given phoneme are considerably influenced by its neighbors i.e., its phonetic context. Coarticulation results from overlapping of the articulatory constituents of one sound with the next.
The perception of any sound can be considered in terms of either
a) The manner of articulation used in its production
b) The resultant acoustic event.
McKay (1956) described two approaches for an explanation of how linguistic value is determined from a speech signal. They are
1) Active
2) Passive
The passive system is envisaged as a filtered system functioning to identify and combine information so as to restructure the pattern. These theories are termed ‘Non mediated’ theories.
The active models are viewed as comparator systems in which input pattern are compared to an internally generated pattern. These models/theories are referred to as ‘mediated’ theories.
20141030 ntustme computer_programmingandbeyond_shareTing-Shuo Yo
A short introduction to what programming can do, with a special focus on the field of big data and internet of things. The audience is undergraduate students taking the first programming class, so the aim is to give a general big picture instead of thorough details.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
2. References
Audition, the body senses, and the chemical senses.
Physiology of behavior, 6th Ed, 1998, pp. 185-223.
by Carlson N. R.
Human communication.
Physiology of behavior, 6th Ed, 1998, pp. 477-508.
by Carlson, N. R.
FUNCTIONAL MRI OF LANGUAGE: New Approaches to Understanding the
Cortical Organization of Semantic Processing
Annu. Rev. Neurosci., (2002), pp. 151-188.
by Bookheimer, S.
Lateralization of auditory language functions: A dynamic dual pathway model
Brain and Language, 89 (2004) 267–276
by Friederici, A.D. and Alter, K.
3. Outline
● Auditory apparatus
● MFCC
● Lesion study
● Neuroimaging
● Dynamic dual channel model
● Can we design ASR systems by mimicking
organic systems?
12. MFCC
● Mel Frequency Cepstral Coefficient
– Take the Fourier transform of a signal
– Map the log amplitudes of the spectrum obtained
above onto the mel scale, using triangular
overlapping windows.
– Take the Discrete Cosine Transform of the list of
mel log-amplitudes, as if it were a signal.
– The MFCCs are the amplitudes of the resulting
spectrum.
13. From the ears to the brain
● Ear
– Spectral signals.
– Fourier transform done by neural circuits.
● Brain
– Two pathways in two hemisphere
– Left: semantics and syntactics
– Right: prosody
14. Brain Mechanisms for Language
● From lesion study to neuroimaging
● Localization of functions
● Lateralization
● Speech Production and Comprehension
● Prosody
15. Lesion Studies
● Aphasia
– Difficulty in producing or comprehending speech
caused by brain damage.
● Broca's aphasia
– agrammatism
– anomia
● Wernicke's aphasia
– poor speech comprehension
16. Broca's Aphasia
● Agrammatism:
– difficulty in understanding / using grammar
● Anomia:
– difficulty in finding the appropriate word to describe
an object, action, or attribute.
● Apraxia of speech:
– impairment in the ability to program movements of
the tongue, lips, and throat required to produce the
proper sequence of speech sounds.
17. Broca's Aphasia Example
● "Yes ... Monday ... Dad, and Dad ... hospital,
and ... Wednesday, Wednesday, nine o'clock
and ... Thursday, ten o'clock ... doctors, two,
two ... doctors and ... teeth, yah."
● 是...阿...星期一...阿...父親及父親....阿...醫院...及
阿...星期三...星期三九點... 以及 ,喔...星期四...十
點, 阿,醫生...兩個...醫生...及阿...牙齒...對的。
19. Wernicke's Aphasia
● Poor speech comprehension:
–
● Fluent but meaningless speech:
–
● Pure word deafness:
– The ability to hear, to speak, and to read and write
without being able to comprehend the meaning of
speech.
20. Wernicke's Aphasia Example
● Examiner: What kind of work have you done?
● Patient: We, the kids, all of us, and I, we were working for a long time
in the ... you know ... it's the kind of space, I mean place rear to the
spedawn ...
● Examiner: Excuse me, but I wanted to know what work you have
been doing.
● Patient: If you had said that, we had said that, poomer, near the
fortunate, porpunate, tamppoo, all around the fourth of martz. Oh, I
get all confused.
25. Semantic Conditions
● Same
– The lawyer questioned the witness.
– The attorney questioned the witness.
● Different
– The man was attacked by the doberman.
– The man was attacked by the pitbull.
26. Syntactic Conditions
● Same
– The policeman arrested the thief.
– The thief was arrested by the policeman.
● Different
– The teacher was outsmarted by the student.
– The teacher outsmarted the student.
27. Summary by Bookheimer, 2002
● The role of the left inferior frontal lobe in semantic
processing and dissociations from other frontal lobe
language functions.
● The organization of categories of objects and
concepts in the temporal lobe.
● The role of the right hemisphere in comprehending
contextual and figurative meaning.
28. Overview by Ahrens, 2007
● Past
– Functional localization (brain damage)
● Present
– Narrower localization + discussion of overlap and
integration (neuro-imaging techniques)
● Future
– Language as a brain function (integrate knowledge
about timing, context, and individual differences)
29. The Three Myths
● Myth 1: Broca’s area deals with syntax/production
– Fact: Semantics and phonology cluster in different areas of
the IFG; syntax seems to be distributed throughout the IFG.
– Fact: IFG is activated during non-language tasks.
● Myth 2: Wernicke’s area deals with
semantics/comprehension
– Fact: There are functional subdivisions for language in
posterial temporal area.
30. The Three Myths
● Myth 3: The right hemisphere is not used when
processing language
– Fact: The right hemisphere is called upon for many
integrative language processes.
> Figurative Language and Metaphor
> Linguistic Context
> Prosody
32. Dynamic Dual Pathway Model
● Spoken language comprehension requires the
coordination of different subprocesses in time.
● Segmental information:
– phonemes, syntactic elements and lexical-semantic
elements.
● Suprasegmental information:
– accentuation and intonational phrases, i.e., prosody.
33. Localization of Different Subsystems
● Segmental information:
– syntactic and semantic information are primarily
processed in a left hemispheric temporo-frontal
pathway including separate circuits for syntactic and
semantic information
● Suprasegmental information:
– sentence level prosody is processed in a right
hemispheric temporo-frontal pathway.
35. Can we design ASR systems
by imitating the brain?
● An open question
– Is it possible? Is it more effective?
● Complexity
– Basic computation power of a neuron: 60 hz
– 10^8 of input, 10^10 in the brain, each with >8000
connections
● Training time
– How long would it take for a human being to
understand language?