SlideShare a Scribd company logo
1 of 14
Download to read offline
Digital Transformation
with AI and Process Automation
Harendra Singh
AI & AUTOMATION CONSULTANT
harendra@lueinanalytics.com
LUEIN ANALYTICS
About
A thought leader and machine learning enthusiast with great passion for technology, creativity, challenging goals &
social entrepreneurship. Consulted more than 20+ startups in the areas of Natural Language Processing, Text/Image/
Video/Audio Processing, Deep Learning, Data Mining, Machine Learning, Unstructured Document Parsing, Statistics
or related field
Core expertise in :
1. AI and Automation product development
2. AI team hiring and team building for clients
3. AI training and team skill development
4. Remote AI product development and consulting
5. Generating new revenue stream for client and also helping client to save a lot of cost with
automated solutions.
Optimized the logistic packing algorithm
Select the best and most efficient box for a shipment
● Dependent factors - shipper types, dimensions of items,
rotations, packing more than one items at a time, shipping
costs, operation time, and experience for both our clients
and their customers.
● Ensemble model with combination of below algorithms:
○ First Fit Descending, pack the biggest products
first in the smallest space we can.
○ Knapsack problem, given a set of items, each with
a weight and a value, determine the number of each
item to include in a collection so that the total
weight is less than or equal to a given limit and the
total value is as large as possible.
○ Bin packing problem, objects of different volumes
must be packed into a finite number of bins or
containers each of volume V in a way that minimizes
the number of bins used.
Auto sectioned sample PDF output
PDF content extraction starts with manually zoning all the sections
( like title, images, sub-titles, page number, paragraphs, etc ) in a
PDF. Then it goes for section by section content extraction
With image processing and convolution neural network, we have
automated the entire manual zoning process.
We treated the text problem into an image problem and solved
the manual zoning problem effectively.
Auto PDF section identification and extraction
AI for video hiring ( AUDIO / SPEECH / VIDEO / IMAGE )
AI for video interview and video resume from saved or
real time interview streaming
● Video content parser
● Video content analytics
● Calculate candidates confidence level
● Calculate discussion interest level
● Calculate subject understanding level
● Calculate communications levels
● Calculate solution approach strategy
● Calculate calculate response time to answer
● Check whether candidate copying during interview.
● Candidate nervousness (with facial micro expressions)
● Conduct a expert less interview process.
Text summarization and generation (under process)
We are currently working on sophisticated Neural
Network algorithm combined with supporting framework
and infrastructure, we running multiple researches to
solve below Publishing use cases in the space of Natural
Language Generation:
1. Automatically create an initial article for author
from a topic
Create unique content from scratch, simulating a
human writer. You choose the topic and length, and the
algorithm will create your textual content.
2. NLG Financial Services
Automatically generate high quality personalised risk
analysis, financial, compliance and other reports in
writing in seconds..
3. Academic books summarization
Automatically generate titles, rewrite articles and
content summarisation for academic books.
Content generation
Machine scans an image and generate an alternate
description for the same.
● Significant amount of scientific research has been
carried out and still under process, to solve one of the
toughest problem of image alternate text generation
(also called as image caption generation).
● Machine takes an images as an input and returns a
short or long description about the input image.
● Neural network based algorithm image classification,
object detection, computer vision processing, along with
natural language generation has been used to solve this
problem .
● Image captioning domain is quite vast, we have
achieved initial success in medical equipment images,
maths graphs/reports and complex object detection.
1. Extract faces and names from school yearbook scanned image
dataset (collected from US, UK and few more European countries
school year books since past 45 years) and create facial database for
the client automatically.
2. Detect face from individual face images and also from group
scanned yearbook image files.
3. Extract Names and alias names from the same scanned yearbook
image file.
4. Using natural language processing and custom business rule to
associate faces with their names and feed it into the database.
Face detection, extraction and auto tagging
Sample input image file
Tone and actor
labeled srt file
Extract dialogue
and create subtitle
file
Extract
speech
features
Translate srt file
Process audio
for tone and
features
Embed tone feature in
translated audio
Speech trained
model
Srt file
Input Video
Video auto subtitleing.
Video emotion extraction (emotionML).
Audio feature extraction.
Tone analysis.
Subtitling translation to another language.
Use text2speech library to convert translated text to another language speech/dialogue text.
Embed original tone intensity features from original audio to new audio, also embed video emotion features
to make the final audio output look more realistic.
Embed new audio to original video file to generate dubbed video file.
Use minor manual modulation (if needed) to sync the final dialogue speed and timings
Video auto-dubbing (under development)
Use tone features
Embed new
audio to video
with dialog time
Dubbed Video
Tone analysis injection
text2speech
Generate alternate text for an image
Neural Network based generative model for captioning/
describing an image automatically in natural language.
Machine learning model trained with supervised 20k plus
manually labeled images.
The process consist of three core components :
1. CNN encoder
A pre-trained CNN is used to encode an image to its
features and also pre-encoded each image to its feature set
for high performance and speed.
2. Word embedding
Used pre-trained word embedding model and also explicitly
trained and embedding model that takes a word and
outputs an embedding vector.
3. CNN decoder
It takes the image vector and partial captions at the current
timestep and input and generated the next most probable
word as output.
Research and work on multimedia ( AUDIO / SPEECH / VIDEO / IMAGE )
1. VIDEO PROCESSING
Automatic subtitling for TV series videos.
Video search within for tagging opening credits and closing
credits.
Video dubbing- translating and embedding speech features in new
audio (under research).
Real time emotion and expression identification for complex
behavior identification.
2. SPEECH PROCESSING
Audio Signal Classification, speech detection, audio tone analysis,
dialogue recognition, speech-to-text and text-to-speech
conversion.
act text with layout from scanned documents.
3. IMAGE PROCESSING WITH NEURAL NETWORK
Object detection, face detection & recognition, image similarity
check and image upsampling / downsampling
4. CONTENT EXTRACTION FROM IMAGE
​Text area detection.
Automatic layout analysis.
Object detection & labelling.
Content extraction from scanned financial forms.
5. IMAGE CAPTIONING
Object detection, brand detection, event detection and
automatically generate alternate-text for an image.
6. SMART OCR
Text/image/table detection & extraction. OCR
orientation detection & correction. Extract text with
layout from scanned documents.
Worked on many multimedia AI use cases. Listing all the recent individual research and client use cases as below :
1. Adaptability is more vital to success than ever.
2. Smart machines and artificial intelligence (AI) are taking off in a big way.
3. Growing importance of the user experience.
4. Innovating rapidly and AI is being used across organisations.
5. Organisations are embracing remote AI workforces.
6. AI growth is sustainable and should not Be Ignored
WHY COMPANIES NEED AI DIGITAL TRANSFORMATION
PRIOR INDIVIDUAL CONSULTING EXPERIENCE WITH
And more..
Thank you

More Related Content

What's hot

Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognitionأحلام انصارى
 
Speech recognition challenges
Speech recognition challengesSpeech recognition challenges
Speech recognition challengesAlexandru Chica
 
Voice recognition security systems
Voice recognition security systemsVoice recognition security systems
Voice recognition security systemsSandeep Kumar
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniquessonukumar142
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxNsaroj kumar
 
SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK Kamonasish Hore
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologyAamir-sheriff
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 
Minor Project Presentation 1
Minor Project Presentation 1Minor Project Presentation 1
Minor Project Presentation 1Pratishtha Ram
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionAhmed Moawad
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition systemavinash raibole
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice RecognitionAmrita More
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognitionananth
 
Automatic speech recognition system using deep learning
Automatic speech recognition system using deep learningAutomatic speech recognition system using deep learning
Automatic speech recognition system using deep learningAnkan Dutta
 

What's hot (20)

Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognition
 
Speech recognition challenges
Speech recognition challengesSpeech recognition challenges
Speech recognition challenges
 
Voice recognition security systems
Voice recognition security systemsVoice recognition security systems
Voice recognition security systems
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptx
 
SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Automatic Speech Recognion
Automatic Speech RecognionAutomatic Speech Recognion
Automatic Speech Recognion
 
Minor Project Presentation 1
Minor Project Presentation 1Minor Project Presentation 1
Minor Project Presentation 1
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
C5 giruba beulah
C5 giruba beulahC5 giruba beulah
C5 giruba beulah
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Chatbot_Presentation
Chatbot_PresentationChatbot_Presentation
Chatbot_Presentation
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognition
 
Sais svcc
Sais svccSais svcc
Sais svcc
 
Automatic speech recognition system using deep learning
Automatic speech recognition system using deep learningAutomatic speech recognition system using deep learning
Automatic speech recognition system using deep learning
 

Similar to Prior AI consulting use cases

Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...FIAT/IFTA
 
Harendra singh ai_consulting_portfolio_2021
Harendra singh ai_consulting_portfolio_2021Harendra singh ai_consulting_portfolio_2021
Harendra singh ai_consulting_portfolio_2021Harendra Singh
 
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IRJET Journal
 
Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...
Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...
Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...AgileNetwork
 
How to prepare a perfect video abstract for your research paper – Pubrica.pdf
How to prepare a perfect video abstract for your research paper – Pubrica.pdfHow to prepare a perfect video abstract for your research paper – Pubrica.pdf
How to prepare a perfect video abstract for your research paper – Pubrica.pdfPubrica
 
How to prepare a perfect video abstract for your research paper – Pubrica.pptx
How to prepare a perfect video abstract for your research paper – Pubrica.pptxHow to prepare a perfect video abstract for your research paper – Pubrica.pptx
How to prepare a perfect video abstract for your research paper – Pubrica.pptxPubrica
 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET Journal
 
Automatic multi-modal metadata annotation based on trained cognitive solution...
Automatic multi-modal metadata annotation based on trained cognitive solution...Automatic multi-modal metadata annotation based on trained cognitive solution...
Automatic multi-modal metadata annotation based on trained cognitive solution...FIAT/IFTA
 
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...e-dialog GmbH
 
Using AI to transcribe qualitative data: Personal reflections of an experienc...
Using AI to transcribe qualitative data: Personal reflections of an experienc...Using AI to transcribe qualitative data: Personal reflections of an experienc...
Using AI to transcribe qualitative data: Personal reflections of an experienc...John Wren
 
Integra Micro Software Services (P) Ltd. - Imaging Expertise
Integra Micro Software Services (P) Ltd. - Imaging ExpertiseIntegra Micro Software Services (P) Ltd. - Imaging Expertise
Integra Micro Software Services (P) Ltd. - Imaging Expertisesreesinbox
 
Animation Framework with Internationalization (I18n)
Animation Framework with Internationalization (I18n)Animation Framework with Internationalization (I18n)
Animation Framework with Internationalization (I18n)IRJET Journal
 
Netex learningMaker | Authoring tool for HTML5 e-learning content [EN]
Netex learningMaker | Authoring tool for HTML5 e-learning content [EN]Netex learningMaker | Authoring tool for HTML5 e-learning content [EN]
Netex learningMaker | Authoring tool for HTML5 e-learning content [EN]Netex Learning
 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET Journal
 
ai courses
ai coursesai courses
ai coursesramyav17
 
ARTIFICIAL INTELLIGENCE TRAINING
ARTIFICIAL INTELLIGENCE TRAININGARTIFICIAL INTELLIGENCE TRAINING
ARTIFICIAL INTELLIGENCE TRAININGramyav17
 

Similar to Prior AI consulting use cases (20)

Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...
 
Harendra singh ai_consulting_portfolio_2021
Harendra singh ai_consulting_portfolio_2021Harendra singh ai_consulting_portfolio_2021
Harendra singh ai_consulting_portfolio_2021
 
Python Open CV
Python Open CVPython Open CV
Python Open CV
 
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
 
Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...
Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...
Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...
 
How to prepare a perfect video abstract for your research paper – Pubrica.pdf
How to prepare a perfect video abstract for your research paper – Pubrica.pdfHow to prepare a perfect video abstract for your research paper – Pubrica.pdf
How to prepare a perfect video abstract for your research paper – Pubrica.pdf
 
How to prepare a perfect video abstract for your research paper – Pubrica.pptx
How to prepare a perfect video abstract for your research paper – Pubrica.pptxHow to prepare a perfect video abstract for your research paper – Pubrica.pptx
How to prepare a perfect video abstract for your research paper – Pubrica.pptx
 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech Recognition
 
Automatic multi-modal metadata annotation based on trained cognitive solution...
Automatic multi-modal metadata annotation based on trained cognitive solution...Automatic multi-modal metadata annotation based on trained cognitive solution...
Automatic multi-modal metadata annotation based on trained cognitive solution...
 
Intelligent ChatBot
Intelligent ChatBotIntelligent ChatBot
Intelligent ChatBot
 
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
 
CV machine learning freelancer
CV machine learning freelancerCV machine learning freelancer
CV machine learning freelancer
 
Using AI to transcribe qualitative data: Personal reflections of an experienc...
Using AI to transcribe qualitative data: Personal reflections of an experienc...Using AI to transcribe qualitative data: Personal reflections of an experienc...
Using AI to transcribe qualitative data: Personal reflections of an experienc...
 
Integra Micro Software Services (P) Ltd. - Imaging Expertise
Integra Micro Software Services (P) Ltd. - Imaging ExpertiseIntegra Micro Software Services (P) Ltd. - Imaging Expertise
Integra Micro Software Services (P) Ltd. - Imaging Expertise
 
Ai/ML services
Ai/ML servicesAi/ML services
Ai/ML services
 
Animation Framework with Internationalization (I18n)
Animation Framework with Internationalization (I18n)Animation Framework with Internationalization (I18n)
Animation Framework with Internationalization (I18n)
 
Netex learningMaker | Authoring tool for HTML5 e-learning content [EN]
Netex learningMaker | Authoring tool for HTML5 e-learning content [EN]Netex learningMaker | Authoring tool for HTML5 e-learning content [EN]
Netex learningMaker | Authoring tool for HTML5 e-learning content [EN]
 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind Person
 
ai courses
ai coursesai courses
ai courses
 
ARTIFICIAL INTELLIGENCE TRAINING
ARTIFICIAL INTELLIGENCE TRAININGARTIFICIAL INTELLIGENCE TRAINING
ARTIFICIAL INTELLIGENCE TRAINING
 

Recently uploaded

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 

Recently uploaded (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 

Prior AI consulting use cases

  • 1. Digital Transformation with AI and Process Automation Harendra Singh AI & AUTOMATION CONSULTANT harendra@lueinanalytics.com LUEIN ANALYTICS
  • 2. About A thought leader and machine learning enthusiast with great passion for technology, creativity, challenging goals & social entrepreneurship. Consulted more than 20+ startups in the areas of Natural Language Processing, Text/Image/ Video/Audio Processing, Deep Learning, Data Mining, Machine Learning, Unstructured Document Parsing, Statistics or related field Core expertise in : 1. AI and Automation product development 2. AI team hiring and team building for clients 3. AI training and team skill development 4. Remote AI product development and consulting 5. Generating new revenue stream for client and also helping client to save a lot of cost with automated solutions.
  • 3. Optimized the logistic packing algorithm Select the best and most efficient box for a shipment ● Dependent factors - shipper types, dimensions of items, rotations, packing more than one items at a time, shipping costs, operation time, and experience for both our clients and their customers. ● Ensemble model with combination of below algorithms: ○ First Fit Descending, pack the biggest products first in the smallest space we can. ○ Knapsack problem, given a set of items, each with a weight and a value, determine the number of each item to include in a collection so that the total weight is less than or equal to a given limit and the total value is as large as possible. ○ Bin packing problem, objects of different volumes must be packed into a finite number of bins or containers each of volume V in a way that minimizes the number of bins used.
  • 4. Auto sectioned sample PDF output PDF content extraction starts with manually zoning all the sections ( like title, images, sub-titles, page number, paragraphs, etc ) in a PDF. Then it goes for section by section content extraction With image processing and convolution neural network, we have automated the entire manual zoning process. We treated the text problem into an image problem and solved the manual zoning problem effectively. Auto PDF section identification and extraction
  • 5. AI for video hiring ( AUDIO / SPEECH / VIDEO / IMAGE ) AI for video interview and video resume from saved or real time interview streaming ● Video content parser ● Video content analytics ● Calculate candidates confidence level ● Calculate discussion interest level ● Calculate subject understanding level ● Calculate communications levels ● Calculate solution approach strategy ● Calculate calculate response time to answer ● Check whether candidate copying during interview. ● Candidate nervousness (with facial micro expressions) ● Conduct a expert less interview process.
  • 6. Text summarization and generation (under process) We are currently working on sophisticated Neural Network algorithm combined with supporting framework and infrastructure, we running multiple researches to solve below Publishing use cases in the space of Natural Language Generation: 1. Automatically create an initial article for author from a topic Create unique content from scratch, simulating a human writer. You choose the topic and length, and the algorithm will create your textual content. 2. NLG Financial Services Automatically generate high quality personalised risk analysis, financial, compliance and other reports in writing in seconds.. 3. Academic books summarization Automatically generate titles, rewrite articles and content summarisation for academic books.
  • 7. Content generation Machine scans an image and generate an alternate description for the same. ● Significant amount of scientific research has been carried out and still under process, to solve one of the toughest problem of image alternate text generation (also called as image caption generation). ● Machine takes an images as an input and returns a short or long description about the input image. ● Neural network based algorithm image classification, object detection, computer vision processing, along with natural language generation has been used to solve this problem . ● Image captioning domain is quite vast, we have achieved initial success in medical equipment images, maths graphs/reports and complex object detection.
  • 8. 1. Extract faces and names from school yearbook scanned image dataset (collected from US, UK and few more European countries school year books since past 45 years) and create facial database for the client automatically. 2. Detect face from individual face images and also from group scanned yearbook image files. 3. Extract Names and alias names from the same scanned yearbook image file. 4. Using natural language processing and custom business rule to associate faces with their names and feed it into the database. Face detection, extraction and auto tagging Sample input image file
  • 9. Tone and actor labeled srt file Extract dialogue and create subtitle file Extract speech features Translate srt file Process audio for tone and features Embed tone feature in translated audio Speech trained model Srt file Input Video Video auto subtitleing. Video emotion extraction (emotionML). Audio feature extraction. Tone analysis. Subtitling translation to another language. Use text2speech library to convert translated text to another language speech/dialogue text. Embed original tone intensity features from original audio to new audio, also embed video emotion features to make the final audio output look more realistic. Embed new audio to original video file to generate dubbed video file. Use minor manual modulation (if needed) to sync the final dialogue speed and timings Video auto-dubbing (under development) Use tone features Embed new audio to video with dialog time Dubbed Video Tone analysis injection text2speech
  • 10. Generate alternate text for an image Neural Network based generative model for captioning/ describing an image automatically in natural language. Machine learning model trained with supervised 20k plus manually labeled images. The process consist of three core components : 1. CNN encoder A pre-trained CNN is used to encode an image to its features and also pre-encoded each image to its feature set for high performance and speed. 2. Word embedding Used pre-trained word embedding model and also explicitly trained and embedding model that takes a word and outputs an embedding vector. 3. CNN decoder It takes the image vector and partial captions at the current timestep and input and generated the next most probable word as output.
  • 11. Research and work on multimedia ( AUDIO / SPEECH / VIDEO / IMAGE ) 1. VIDEO PROCESSING Automatic subtitling for TV series videos. Video search within for tagging opening credits and closing credits. Video dubbing- translating and embedding speech features in new audio (under research). Real time emotion and expression identification for complex behavior identification. 2. SPEECH PROCESSING Audio Signal Classification, speech detection, audio tone analysis, dialogue recognition, speech-to-text and text-to-speech conversion. act text with layout from scanned documents. 3. IMAGE PROCESSING WITH NEURAL NETWORK Object detection, face detection & recognition, image similarity check and image upsampling / downsampling 4. CONTENT EXTRACTION FROM IMAGE ​Text area detection. Automatic layout analysis. Object detection & labelling. Content extraction from scanned financial forms. 5. IMAGE CAPTIONING Object detection, brand detection, event detection and automatically generate alternate-text for an image. 6. SMART OCR Text/image/table detection & extraction. OCR orientation detection & correction. Extract text with layout from scanned documents. Worked on many multimedia AI use cases. Listing all the recent individual research and client use cases as below :
  • 12. 1. Adaptability is more vital to success than ever. 2. Smart machines and artificial intelligence (AI) are taking off in a big way. 3. Growing importance of the user experience. 4. Innovating rapidly and AI is being used across organisations. 5. Organisations are embracing remote AI workforces. 6. AI growth is sustainable and should not Be Ignored WHY COMPANIES NEED AI DIGITAL TRANSFORMATION
  • 13. PRIOR INDIVIDUAL CONSULTING EXPERIENCE WITH And more..