• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Practical Natural Language Processing
 

Practical Natural Language Processing

on

  • 756 views

 

Statistics

Views

Total Views
756
Views on SlideShare
756
Embed Views
0

Actions

Likes
5
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Practical Natural Language Processing Practical Natural Language Processing Presentation Transcript

    • Practical Natural Language ProcessingFrom Theory to Industrial ApplicationsJaganadh Ghttp://jaganadhg.injaganadhg@gmail.comCentral University of KeralaKasargod22ndFeb 2013Jaganadh G Practical Natural Language Processing
    • About me !!Working in Natural Language Processing, MachineLearning, Data Mining etc...Passionate about Free and Open source :-)When gets free time teaches Python, Speaks about FOSSand blogs athttp://jaganadhg.inI am a computational linguist / Linguist and Indologist,Book reviewerSoftware Engineer by ProfessionJaganadh G Practical Natural Language Processing
    • Question ??Have you ever used any Natural Language Processing basedtools/services?Jaganadh G Practical Natural Language Processing
    • Question ??Have you ever used any Natural Language Processing basedtools/services?Jaganadh G Practical Natural Language Processing
    • Question ??Have you ever used any Natural Language Processing basedtools/services?Jaganadh G Practical Natural Language Processing
    • What is Natural Language Processing (NLP) ?Aim : To build intelligent systems that can interact withhuman beings as like human beingsJaganadh G Practical Natural Language Processing
    • What is Natural Language Processing (NLP) ?Aim : To build intelligent systems that can interact withhuman beings as like human beingsJaganadh G Practical Natural Language Processing
    • What is Natural Language Processing (NLP) ?Aim : To build intelligent systems that can interact withhuman beings as like human beingsA sub-field of Artificial Intelligence (AI)Jaganadh G Practical Natural Language Processing
    • What is Natural Language Processing (NLP) ?Aim : To build intelligent systems that can interact withhuman beings as like human beingsA sub-field of Artificial Intelligence (AI)Inter-disciplinary subject (Language + Linguistics +Statistics + Computer Science + .. )Natural LanguageRefers to the language spoken by people, e.g.English,Japanese, Tamil, Malayalam as opposed to artificiallanguages, like C++, Java, etc.Jaganadh G Practical Natural Language Processing
    • DefinitionNatural Language ProcessingNatural Language Processing is a theoretically motivated rangeof computational techniques for analyzing and representingnaturally occurring texts/speech at one or more levels oflinguistic analysis for the purpose of achieving human-likelanguage processing for a range of tasks or applications.NLP was considered as an academic discipline beforesome 10 to 20 years.Now concepts from NLP is applied in variety ofComputing Platforms and ServicesJaganadh G Practical Natural Language Processing
    • Practical NLP ?ProblemPicture Courtesy: http://twitpic.com/1y21qm/fullJaganadh G Practical Natural Language Processing
    • Practical NLP ?ProblemBefore going to some theory can we have some funnypractical problems to solve ?Picture Courtesy: http://twitpic.com/1y21qm/fullJaganadh G Practical Natural Language Processing
    • Practical NLP ?ProblemBefore going to some theory can we have some funnypractical problems to solve ?Picture Courtesy: http://twitpic.com/1y21qm/fullJaganadh G Practical Natural Language Processing
    • Practical NLPProblemJaganadh G Practical Natural Language Processing
    • Practical NLPProblemTweet-a-Toddy receives thousands of tweets per dayJaganadh G Practical Natural Language Processing
    • Practical NLPProblemTweet-a-Toddy receives thousands of tweets per dayTweets requesting home deliveryJaganadh G Practical Natural Language Processing
    • Practical NLPProblemTweet-a-Toddy receives thousands of tweets per dayTweets requesting home deliveryTweets about quality of productsJaganadh G Practical Natural Language Processing
    • Practical NLPProblemTweet-a-Toddy receives thousands of tweets per dayTweets requesting home deliveryTweets about quality of productsTweets related to enquirersJaganadh G Practical Natural Language Processing
    • Practical NLPProblemTweet-a-Toddy receives thousands of tweets per dayTweets requesting home deliveryTweets about quality of productsTweets related to enquirersThey requires following things to be automatedJaganadh G Practical Natural Language Processing
    • Practical NLPProblemTweet-a-Toddy receives thousands of tweets per dayTweets requesting home deliveryTweets about quality of productsTweets related to enquirersThey requires following things to be automatedIdentify tweet categoryJaganadh G Practical Natural Language Processing
    • Practical NLPProblemTweet-a-Toddy receives thousands of tweets per dayTweets requesting home deliveryTweets about quality of productsTweets related to enquirersThey requires following things to be automatedIdentify tweet categoryProcess home-delivery requestJaganadh G Practical Natural Language Processing
    • Practical NLPProblemTweet-a-Toddy receives thousands of tweets per dayTweets requesting home deliveryTweets about quality of productsTweets related to enquirersThey requires following things to be automatedIdentify tweet categoryProcess home-delivery requestEvaluate quality related tweetsJaganadh G Practical Natural Language Processing
    • Practical NLPProblemTweet-a-Toddy receives thousands of tweets per dayTweets requesting home deliveryTweets about quality of productsTweets related to enquirersThey requires following things to be automatedIdentify tweet categoryProcess home-delivery requestEvaluate quality related tweetsHow?How to find a solution for Tweet-a-ToddyJaganadh G Practical Natural Language Processing
    • Solution??Any SolutionsJaganadh G Practical Natural Language Processing
    • Solution??Any SolutionsSome thoughtsJaganadh G Practical Natural Language Processing
    • Solution??Any SolutionsSome thoughtsText ClassificationJaganadh G Practical Natural Language Processing
    • Solution??Any SolutionsSome thoughtsText ClassificationEntity IdentificationJaganadh G Practical Natural Language Processing
    • Solution??Any SolutionsSome thoughtsText ClassificationEntity IdentificationInformation ExtractionJaganadh G Practical Natural Language Processing
    • Solution??Any SolutionsSome thoughtsText ClassificationEntity IdentificationInformation ExtractionSentiment AnalysisJaganadh G Practical Natural Language Processing
    • Solution??Any SolutionsSome thoughtsText ClassificationEntity IdentificationInformation ExtractionSentiment AnalysisParsing, gammer ...Jaganadh G Practical Natural Language Processing
    • Solution??Any SolutionsSome thoughtsText ClassificationEntity IdentificationInformation ExtractionSentiment AnalysisParsing, gammer ...Regex (Regular Expressions)Jaganadh G Practical Natural Language Processing
    • Another Practical QuestionEverybody might have used spell checker available in wordprocessing systems like OpenOffice.org or Microsoft Word Anyguess on how to develop a spell checker system ?SolutionsJaganadh G Practical Natural Language Processing
    • Another Practical QuestionEverybody might have used spell checker available in wordprocessing systems like OpenOffice.org or Microsoft Word Anyguess on how to develop a spell checker system ?SolutionsWord ListJaganadh G Practical Natural Language Processing
    • Another Practical QuestionEverybody might have used spell checker available in wordprocessing systems like OpenOffice.org or Microsoft Word Anyguess on how to develop a spell checker system ?SolutionsWord ListStructure of wordsJaganadh G Practical Natural Language Processing
    • Another Practical QuestionEverybody might have used spell checker available in wordprocessing systems like OpenOffice.org or Microsoft Word Anyguess on how to develop a spell checker system ?SolutionsWord ListStructure of wordsDynamic Programming (Edit Distance)Jaganadh G Practical Natural Language Processing
    • Another Practical Question ...Context Sensitive Spell-checkingIdentifying and suggesting spelling of words based on contextHow ??Jaganadh G Practical Natural Language Processing
    • Another Practical Question ...Context Sensitive Spell-checkingIdentifying and suggesting spelling of words based on contextHow ??SolutionsJaganadh G Practical Natural Language Processing
    • Another Practical Question ...Context Sensitive Spell-checkingIdentifying and suggesting spelling of words based on contextHow ??SolutionsStatistical ModelsJaganadh G Practical Natural Language Processing
    • Another Practical Question ...Context Sensitive Spell-checkingIdentifying and suggesting spelling of words based on contextHow ??SolutionsStatistical ModelsWord category based suggestionsJaganadh G Practical Natural Language Processing
    • Can Machines Translate ??Answer !!!Jaganadh G Practical Natural Language Processing
    • Why NLP ?Because ”Information is Power !!!”Jaganadh G Practical Natural Language Processing
    • Why NLP ?Because ”Information is Power !!!”Picture Courtesy: http://soundsgood.in/wikipediafat print book/Jaganadh G Practical Natural Language Processing
    • Why NLP ?Because ”Information is Power !!!”Every day wast amount of text and speech data is beingproducedPicture Courtesy: http://soundsgood.in/wikipediafat print book/Jaganadh G Practical Natural Language Processing
    • Why NLP ?Because ”Information is Power !!!”Every day wast amount of text and speech data is beingproducedInternet == at least 40 Million pagesPicture Courtesy: http://soundsgood.in/wikipediafat print book/Jaganadh G Practical Natural Language Processing
    • Why NLP ?Because ”Information is Power !!!”Every day wast amount of text and speech data is beingproducedInternet == at least 40 Million pagesPicture Courtesy: http://soundsgood.in/wikipediafat print book/Jaganadh G Practical Natural Language Processing
    • HistoryJaganadh G Practical Natural Language Processing
    • HistorySecond World War !!!Jaganadh G Practical Natural Language Processing
    • HistorySecond World War !!!Machine TranslationJaganadh G Practical Natural Language Processing
    • HistorySecond World War !!!Machine TranslationNow :Jaganadh G Practical Natural Language Processing
    • HistorySecond World War !!!Machine TranslationNow :Most promising imperfect technologyJaganadh G Practical Natural Language Processing
    • HistorySecond World War !!!Machine TranslationNow :Most promising imperfect technologyMoves from Lab to Industry to LaymanJaganadh G Practical Natural Language Processing
    • NLP Really Hard to Achieve?NLP delas with human languagesHuman Language is dynamic and mysterious !!!Jaganadh G Practical Natural Language Processing
    • NLP Really Hard to Achieve?NLP delas with human languagesHuman Language is dynamic and mysterious !!!Communication in Human LanguageJaganadh G Practical Natural Language Processing
    • NLP Really Hard to Achieve?Levels of Knowledge encoding in Language DataJaganadh G Practical Natural Language Processing
    • Tasks in NLPBroad AreasJaganadh G Practical Natural Language Processing
    • Tasks in NLPBroad AreasText ProcessingJaganadh G Practical Natural Language Processing
    • Tasks in NLPBroad AreasText ProcessingSpeech ProcessingJaganadh G Practical Natural Language Processing
    • Major tasks in Text ProcessingJaganadh G Practical Natural Language Processing
    • Major tasks in Text ProcessingWord Level AnalysisJaganadh G Practical Natural Language Processing
    • Major tasks in Text ProcessingWord Level AnalysisMorphological SynthesisJaganadh G Practical Natural Language Processing
    • Major tasks in Text ProcessingWord Level AnalysisMorphological SynthesisPart of Speech TaggingJaganadh G Practical Natural Language Processing
    • Major tasks in Text ProcessingWord Level AnalysisMorphological SynthesisPart of Speech TaggingStemmingJaganadh G Practical Natural Language Processing
    • Major tasks in Text ProcessingWord Level AnalysisMorphological SynthesisPart of Speech TaggingStemmingLemmatizationJaganadh G Practical Natural Language Processing
    • Major tasks in Text ProcessingWord Level AnalysisMorphological SynthesisPart of Speech TaggingStemmingLemmatizationSentence Level Analysis - Syntactical ParsingJaganadh G Practical Natural Language Processing
    • Major tasks in Text ProcessingWord Level AnalysisMorphological SynthesisPart of Speech TaggingStemmingLemmatizationSentence Level Analysis - Syntactical ParsingDiscourse Analysis - Semantic ProcessingJaganadh G Practical Natural Language Processing
    • MorphologyThe branch of linguistics that studies word structures.Jaganadh G Practical Natural Language Processing
    • MorphologyThe branch of linguistics that studies word structures.To a computer program a word is : ???Jaganadh G Practical Natural Language Processing
    • MorphologyThe branch of linguistics that studies word structures.To a computer program a word is : ???Morphological analysis can be explained as: the process ofanalyzing words to identify its constituentsJaganadh G Practical Natural Language Processing
    • MorphologyThe branch of linguistics that studies word structures.To a computer program a word is : ???Morphological analysis can be explained as: the process ofanalyzing words to identify its constituentsComputational Analysis of MorphologyMorphological AnalysisJaganadh G Practical Natural Language Processing
    • MorphologyThe branch of linguistics that studies word structures.To a computer program a word is : ???Morphological analysis can be explained as: the process ofanalyzing words to identify its constituentsComputational Analysis of MorphologyMorphological AnalysisJaganadh G Practical Natural Language Processing
    • MorphologyThe branch of linguistics that studies word structures.To a computer program a word is : ???Morphological analysis can be explained as: the process ofanalyzing words to identify its constituentsComputational Analysis of MorphologyMorphological AnalysisMorphological GenerationJaganadh G Practical Natural Language Processing
    • MorphologyThe branch of linguistics that studies word structures.To a computer program a word is : ???Morphological analysis can be explained as: the process ofanalyzing words to identify its constituentsComputational Analysis of MorphologyMorphological AnalysisMorphological GenerationStemmingJaganadh G Practical Natural Language Processing
    • MorphologyThe branch of linguistics that studies word structures.To a computer program a word is : ???Morphological analysis can be explained as: the process ofanalyzing words to identify its constituentsComputational Analysis of MorphologyMorphological AnalysisMorphological GenerationStemmingLemmatizationJaganadh G Practical Natural Language Processing
    • Practical Question from MorphologyApproximate number of word forms that can be derived fromthe word”maram”Jaganadh G Practical Natural Language Processing
    • Parts of Speech TaggingPOS tagging is the process of marking up the words in a text(corpus) as corresponding to a particular part of speech, basedon both its definition, as well as its context.Ram goes to school.Ram/NNP goes/VBZ to/TO school/NN ./.Jaganadh G Practical Natural Language Processing
    • Parts of Speech TaggingPOS tagging is the process of marking up the words in a text(corpus) as corresponding to a particular part of speech, basedon both its definition, as well as its context.Ram goes to school.Ram/NNP goes/VBZ to/TO school/NN ./.Words are ambiguous !!!!e.g. book, cricket, bankJaganadh G Practical Natural Language Processing
    • Syntactical ParsingParsingIn computer science and linguistics, parsing, or, more formally,syntactic analysis, is the process of analyzing a text, made of asequence of tokens (for example, words), to determine itsgrammatical structure with respect to a given (more or less)formal grammar.Jaganadh G Practical Natural Language Processing
    • Syntactical ParsingParsingIn computer science and linguistics, parsing, or, more formally,syntactic analysis, is the process of analyzing a text, made of asequence of tokens (for example, words), to determine itsgrammatical structure with respect to a given (more or less)formal grammar.Sentences are ambiguous !!!!Jaganadh G Practical Natural Language Processing
    • SemanticsStudy of meaning ans its structureJaganadh G Practical Natural Language Processing
    • SemanticsStudy of meaning ans its structureWord meaning is ambiguous !!!!E.g. marriageJaganadh G Practical Natural Language Processing
    • Where can I apply this techniques?Machine Translation SystemsJaganadh G Practical Natural Language Processing
    • Where can I apply this techniques?Machine Translation SystemsSearch EngineJaganadh G Practical Natural Language Processing
    • Where can I apply this techniques?Machine Translation SystemsSearch EngineSpell-checkerJaganadh G Practical Natural Language Processing
    • Where can I apply this techniques?Machine Translation SystemsSearch EngineSpell-checkerGrammar CheckerJaganadh G Practical Natural Language Processing
    • Where can I apply this techniques?Machine Translation SystemsSearch EngineSpell-checkerGrammar Checker..........Jaganadh G Practical Natural Language Processing
    • Other Interesting TasksNamed Entity IdentificationJaganadh G Practical Natural Language Processing
    • Other Interesting TasksNamed Entity IdentificationInformation ExtractionJaganadh G Practical Natural Language Processing
    • Other Interesting TasksNamed Entity IdentificationInformation ExtractionInformation RetrievalJaganadh G Practical Natural Language Processing
    • Other Interesting TasksNamed Entity IdentificationInformation ExtractionInformation RetrievalText Classification and ClusteringJaganadh G Practical Natural Language Processing
    • Speech ProcessingTwo Major AreasText to SpeechSpeech RecognitionJaganadh G Practical Natural Language Processing
    • Speech ProcessingTwo Major AreasText to SpeechSpeech RecognitionPractical ApplicationsIVRTechnology for Visually Challenged PeopleMobile PhonesSpeech Enabled WebVehicle Mounted GPS NavigatorJaganadh G Practical Natural Language Processing
    • Commerical NLP ApplicationsWhat Industry LooksJaganadh G Practical Natural Language Processing
    • Commerical NLP ApplicationsWhat Industry LooksComponents of Word ProcessorsJaganadh G Practical Natural Language Processing
    • Commerical NLP ApplicationsWhat Industry LooksComponents of Word ProcessorsMachine Translation SystemsJaganadh G Practical Natural Language Processing
    • Commerical NLP ApplicationsWhat Industry LooksComponents of Word ProcessorsMachine Translation SystemsCustom Search SystemsJaganadh G Practical Natural Language Processing
    • Commerical NLP ApplicationsWhat Industry LooksComponents of Word ProcessorsMachine Translation SystemsCustom Search SystemsInformation ExtractionJaganadh G Practical Natural Language Processing
    • Commerical NLP ApplicationsWhat Industry LooksComponents of Word ProcessorsMachine Translation SystemsCustom Search SystemsInformation ExtractionEntity IdentificationJaganadh G Practical Natural Language Processing
    • Commerical NLP ApplicationsWhat Industry LooksComponents of Word ProcessorsMachine Translation SystemsCustom Search SystemsInformation ExtractionEntity IdentificationText SummarizationJaganadh G Practical Natural Language Processing
    • Commerical NLP ApplicationsWhat Industry LooksComponents of Word ProcessorsMachine Translation SystemsCustom Search SystemsInformation ExtractionEntity IdentificationText SummarizationSpeech SystemsJaganadh G Practical Natural Language Processing
    • Commerical NLP ApplicationsWhat Industry LooksComponents of Word ProcessorsMachine Translation SystemsCustom Search SystemsInformation ExtractionEntity IdentificationText SummarizationSpeech SystemsQuestion Answering SystemsJaganadh G Practical Natural Language Processing
    • Future of NLPFuture!!!Semantics oriented technologiesJaganadh G Practical Natural Language Processing
    • NLP in other domainsBio-MedicalLegalForensic ScienceAdvertisementEducationPoliticsE-governanceBusiness DevelopmentMarketingand where ever we use language !!!Jaganadh G Practical Natural Language Processing
    • Natural Language Processing in IndiaAcademic InstitutionsIIT Kanpur, Kharagpur, BombayIIIT hydrabadIISc BangaloreAU-KBC ChennaiAmritha University Ettimadai, CoimbatoreIIITMK, TrivandrumCentral University, HydrabadJNU, DelhiTamil University, ThanjoreJaganadh G Practical Natural Language Processing
    • Natural Language Processing in IndiaIndustryMicrosoftYahoo!AOL365Media Pvt. Ltd.Inside ViewThaazzaAIAIO LabsJaganadh G Practical Natural Language Processing
    • Questions ??Jaganadh G Practical Natural Language Processing
    • ReferencesDaniel Jurafsky,James H. Martin, SPEECH andLANGUAGE PROCESSING, 2ndEdition.U.S. Tiwary, Tanveer Siddiqui , Natural LanguageProcessing and Information RetrievalJaganadh G Practical Natural Language Processing
    • FinallyJaganadh G Practical Natural Language Processing
    • Questions ??Jaganadh G Practical Natural Language Processing
    • ReferencesDaniel Jurafsky,James H. Martin, SPEECH andLANGUAGE PROCESSING, 2ndEdition.U.S. Tiwary, Tanveer Siddiqui , Natural LanguageProcessing and Information RetrievalJaganadh G Practical Natural Language Processing
    • FinallyJaganadh G Practical Natural Language Processing