SlideShare a Scribd company logo
1 of 7
Download to read offline
INTERNATIONALComputer EngineeringCOMPUTER ENGINEERING
  International Journal of
                              JOURNAL OF and Technology (IJCET), ISSN 0976-
  6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME
                             & TECHNOLOGY (IJCET)
ISSN 0976 – 6367(Print)
ISSN 0976 – 6375(Online)
Volume 4, Issue 1, January- February (2013), pp. 124-130
                                                                              IJCET
© IAEME: www.iaeme.com/ijcet.asp
Journal Impact Factor (2012): 3.9580 (Calculated by GISI)                  ©IAEME
www.jifactor.com




     THE MACHINE LEARNING METHOD REGARDING EFFICIENT
             SOFT COMPUTING AND ICT USING SVM

                        1
                           TARUN DHAR DIWAN, 2UPASANA SINHA
                                   ASSISTANT PROFESSOR
                                   DEPT. OF ENGINEERING
                      1
                        Dr.C.V.RAMAN UNIVERSITY, BILASPUR (INDIA)
                  2
                    J K INSTITUTE OF ENGINEERING, BILASPUR (INDIA)
                       1
                        taruncsit@gmail.com, 2upasana.sihna@gmail.com

  ABSTRACT

          Soft Computing and Information Communication Technology (ICT) both are main
  part in information technology and computer science because today all documentations are
  computerized to store necessary information and database into English language, so natural
  language processing concept is come from artificial intelligence (A.I) branch of computer
  science and information technology as part of soft computing and ICT (information
  communication technology). Here English language may be used as a major part of machine
  learning method ,where sentence boundary detection is a great challenge in current time of
  NLP(natural language processing).So SVM(support vector machine) can be used to solve the
  problem of sentence boundary detection which indicates the machine learning method
  regarding efficient soft computing and ICT. Hence Computer system may be learned to avoid
  ambiguity of dot (.) periods in detecting sentence boundary.

  I. INTRODUCTION

          Basically sentence boundary detection is essential feature of recognizing the correct
  sentence meaning in text area and natural language processing, because without valid
  meaning based documentation has no any information extraction features and communication
  is generated in efficient successful trend. SVM (support vector machine) may be used to
  extract the feature of sentence boundaries where Support Vector Machine (SVM) is a
  machine learning model and which solved the problem of sentence boundary detection by
  calculation of precision, recall, f feature value [1]. Feature extraction f value is evaluated on

                                                124
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-
6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME

the basis of precision, recall value calculation.Sentence boundary detection (SBD) means
detecting end of sentence in given or used text.
Various type of punctuation marks contain multiple feature at one time which gives sentence
boundary ambiguity in text , This type of error of ambiguity can be find by calculating
precision, recall values and feature values of support vector machine and in such a way
sentence boundary is detected , where ambiguity is come due to multiple behavior of one
punctuation mark or symbols in text at one time, then precision and recall values are
calculated to provide better information on what kinds of errors were made by ambiguous
punctuation mark. Basically SVM (support vector machine) firstly calculate ambiguity of
each segment in input English text by calculating precision value, here segment is one part of
input text but not lead to exact sentence boundary [2]. Now we calculate recall value to find
exact sentence boundary and f is calculated at last to extract features of each segment of input
text.

II. RELATED WORK

        Basically here sentence boundary detection is done on the basis of dependency
analysis of each segment of text. Segment of text is not necessary to occur as a boundary of
segment so there dependency analysis of previous and next segment of text is done in three
type of sentence which is given bellow:

   1. Open dependency based sentence.
   2. Closed dependency based sentence.
   3. Without dependency based sentence i.e. Independent sentence.

In above three type of sentence based ambiguous text has a lot of need of detecting sentence
boundary detection and after analysis of this three type of sentence in this paper by using
suitable example, it is search that more feature of text segment gives more ambiguity with
minimum SBD (Sentence Boundary Detection) [1,3]..

III. CHALLENGES/ PROBLEMS OF SENTENCE BOUNDARY DETECTION

       In sentence boundary detection following problems are come as challenges to detect
end of sentence in creating recognize correct meaning of any text of English language during
soft computing or information communication technology (ICT) [4].

   1. Recognize tokens by removing white space and special characters from English text.
   2. Resolving ambiguous separators in numbers to text.
   3. Resolving ambiguous abbreviation in text.
   4. Then resolving end of sentences (EOS) (?).
   5. Morphology analysis on a corpus database in a form of statistical analysis and
      word/letter analysis which are part of NLP.
   6. Correct syntactic and semantic meaning of each word by using grammar i.e.
      morphology and lexical analysis.
   7. Avoiding ambiguity in punctuation marks.

                                              125
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-
6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME

IV. OBJECTIVE/GOAL OF PAPER

        Finding the solution of sentence boundary detection challenge as avoiding ambiguity
of dot (.) punctuation mark. It is done by calculating the error rate where error rate is related
to each segment of text which is evaluated to check dependency analysis on the basis of dot
(.) Punctuation as either it is a abbreviation, termination or name into English language based
text.

V. SIGNIFICANT OF PAPER

        In this paper ambiguity is avoided of dot(.) periods into each segment of English
language based text as to detect sentence boundary where f feature value is detected from
SVM as when any word is presence it is counted as 1 otherwise as 0 value and this 0 or 1
value is assign finally assign into vector of SVMs.

VI. PROPOSED METHODOLOGY

        SVM methodology is support the creation of document vector from the features
extraction value of each segment or line of text. Each new word seen by the SVM module is
internally assigned to a different coordinate in the vector. The value of each coordinate is
zero when the word is absent from a document otherwise one, basically it is a binary
arrangement of text segment vector in memory[5][6].So maximum feature f value is created
with maximum presence of one word at one time into one segment of text, which represent
high maximum ambiguity i.e. maximum precision value, and maximum ambiguity of any
word or string into one segment or line of text ,decreases the sentence break or boundary
detection behavior as a less(minimum)recall value. This is clear by using bellow example of
input, output text along precision, recall, f feature extraction value. To apply methodology,
calculation is done to calculate true positives, false positives, true negatives, and false
negatives values whose description is bellow given. These values are used in recall, precision,
f feature extraction value calculation formula[7][8]..

   1. True Positives: Those who test positive for a condition and are positive (i.e., have the
      condition).
   2. False Positives: Those who test positive, but are negative (i.e., do not have the
      condition).
   3. True Negatives: Those who test negative and are negative,
   4. False Negatives: Those who test negative, but are positive.

Now Precision is the ratio between the number of candidate tokens that have been correctly
assigned to a class and the number of all candidates that have been assigned to this class
[9][10].
Precision value=true positives / (true positives + false positives) ------- (1)

Recall is defined as the proportion of all candidates truly belonging to a certain class that
have also been assigned to that class by the evaluated system.
Recall value=true positives / (true positives + false negatives) -------- (2)
                                              126
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-
6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME

Finally, the so-called F measure is the harmonic mean of precision and recall (van
Rijsbergen 1979).

f measure value =2* precision * recall / (precision + recall)    ---------(3)

VII. ANALYSIS

        Here if any sentence is closed with proper syntactic and semantic analysis where
start and end both are related to only one sentence then this type of sentence is come into
without dependency or independent type of sentence, but if end is correct as syntactic and
semantic way but starting is depend on previous segment of text then this type of sentience is
known as closed dependency based sentence and if both starting and ending of any segment
of text are incomplete the this type of segment is known as open dependency based sentence.

Therefore sentence boundary is introduced as three types on the basis of above description of
analysis i.e.
   1. Strong boundary(independent sentence i.e. without dependency based sentence)
   2. Weak boundary(open dependency based sentence)
   3. Absolute boundary(closed dependency based sentence)

Finding Dependency Structure of Each Sentence to Used Input Text

Example of Input English Text
I am doing Ph.D. Ph.D. is easy. Mr. and Mrs.
Sharma are well. They are Mr. and Mrs. Arun is
singing a song. Our college name is S.E.C.
Mr. V. Dixit is a best guide.

Expected Output after Used Methodology to Input English Text
I am doing Ph.D.
Ph.D. is easy.
Mr. and Mrs. Sharma are well.
They are Mr. and Mrs.
Arun is singing a song.
Our college name is S.E.C.
Mr. V. Dixit is a best guide.

Finding Four Type of Sentence Dependency Structure
1) I am doing Ph.D. Ph.D. is easy. Mr. and Mrs.
(Open dependency based sentence)
2) Sharma are well. They are Mr. and Mrs. Arun is
(Open dependency based sentence)
3) playing a game. Our college name is S.E.C.
(Closed dependency based sentence)
4) Mr. V. Dixit is a best guide.
 (Without dependency based sentence i.e. independent sentence)
                                              127
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-
6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME

Here Precision to each statement.
10/10+7=1.4=10/17=0.588
10/10+3=3.3=10/13=0.769
8/8+6=1.3=8/14=4/7=0.571
7/7+3=2.3=7/10=0.7

Here Recall to each statement.
10/10+2=5=10/12=5/6=0.8333
10/10+2=5=10/12=5/6=0.8333
8/8+1=8=8/9=0.888
7/7+1=7/8=0.875
&

So f measure values-

(2*.588*.833)/ (.588+.833)=.979/1.421=0.688
(2*.769*.833)/ (.769+.833)=1.602
(2*(.571*.888)/ (.571+.888)= 1.014/1.459=0.694
(2*(.875*0.7)/ (.875+0.7)=1.225/1.575=0.777

 Here input English text has a evaluation of four type of sentences i.e. first two type
are open dependency based sentence, third closed dependency based sentence and
finally last fourth type of sentence is without dependency based sentence and above
calculation proof that max precision gives max ambiguity regarding to abbreviation,
punctuation, separator so sentence boundary has a need of evaluating ambiguity and
then eliminating it to gain exact sentence boundary to any input text.

Sentence boundary maximally detected to open dependency structure based text
because features are maximum with various individual sentence boundaries.

VIII. RESULT

Here f is feature measure i.e. lexical analysis, parsing or morphological analysis.
Recall is exact sentence boundary detection value calculation.
Precision is ambiguity calculation of each segment of used text.
Hence feature extraction value f is max 1.6% in open dependency based sentence to
maximum ambiguity of 0.8% precision value and corresponding exact sentence
boundary recall value is minimum i.e. 0.83%.




                                           128
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-
6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME

                                       Table1
                          Sentence Boundary Detection Result
                            Table Obtained by Using SVMs




IX. CONCLUSION

        Maximum precision value with maximum f feature values generates the minimum
recall value where precision is ambiguity during detecting sentence boundary, f feature
values is made by ambiguous behavior of punctuation marks. This precision and f feature
values are creates a minimum recall value as less sentence boundary of text, i.e. weak
boundary extracts various features of text segment with less sentence boundary detection.

X. FUTURE WORK

        Machine learning method of SVM must be implemented to compiler design as
creating software of sentence boundary detection in all type of ambiguity of punctuation
marks and symbols to English, Hindi Text, it will come into automatic sentence boundary
detection sentence task and by there meaningful individual sentence are easy to recognize.

REFERENCES

[1] Tarun Dhar Diwan, "SVM (Support Vector Machine) Learning to Detect English
Sentence Boundary Regarding Dot (.) Punctuation marks" International Conference on
Emerging Trends in Soft Computing and ICT, pages78-82, ISBN: GURU GHASIDAS
VISHWAVIDYALAYA, Bilaspur (CG),INDIA



                                           129
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-
6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME

[2] Tarun Dhar Diwan,Shweta dubey "Supporting English-Hindi Parallel Corpus using word
alignment " International Journal of Computer Application, Volume49, Number 6, pages 16-
19, july 2012, ISSN 0975-8887.
[3] Tarun Dhar Diwan "Knowledge of Word Alignment Position in English-Hindi Sentences
"CiiT - International Journal of Artificial Intelligent Systems and Machine Learning, 2012,
ISSN: 0974–9543, DOI: AIML082012001
[4] Tarun Dhar Diwan " Word Alignment to Encourage Outsized English-Hindi Parallel
Corpus "CiiT - International Journal of Fuzzy Logic, 2012, ISSN 0974 – 9608, DOI:
AIML082012005
[5] Tarun Dhar Diwan " Alignment of English-Hindi Sentences "CiiT - International Journal
of Fuzzy Logic, 2012, ISSN: 0974–9543, DOI: AIML082012006
[6] Tarun Dhar Diwan " Sentence Ending in English Language "CiiT - International Journal
of Artificial Intelligent Systems and Machine Learning, September 2012, ISSN: 0974–9543,
DOI: AIML092012006
[7] Tarun Dhar Diwan " Development of English-Hindi Parallel Corpus using Sentence
Alignments " CiiT - International Journal of Artificial Intelligent Systems and Machine
Learning, September 2012, ISSN: 0974–9543, DOI: AIML092012007
 [8]Spector, A. Z. 1989 Achieving application requirements. In Distributed Systems,
S.Mullender, Ed. Acm Press Frontier Series, ACM Press, New York, NY, 19-33.
[9]Kiyotaka Uchimoto, Satoshi Sekine, and Hitoshi Isahara.1999. Japanese Dependency
Structure Analysis Based on Maximum Entropy Models. In Proceedings of the EACL, pages
196–203.
[10]Kiyotaka Uchimoto, Masaki Murata, Satoshi Sekine, and HitoshiIsahara. 2000.
Dependency Model Using Posterior Context. In Proceedings of the IWPT, pages 321–322.




                                           130

More Related Content

What's hot

A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
ijma
 

What's hot (17)

text summarization using amr
text summarization using amrtext summarization using amr
text summarization using amr
 
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNINGDETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
 
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNINGDETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
 
Unsupervised Quality Estimation Model for English to German Translation and I...
Unsupervised Quality Estimation Model for English to German Translation and I...Unsupervised Quality Estimation Model for English to German Translation and I...
Unsupervised Quality Estimation Model for English to German Translation and I...
 
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
A MODEL TO CONVERT WAVE–FORM-TEXT TO LINEAR-FORM-TEXT FOR BETTER READABILITY ...
 
Mca2030 object oriented programming – c++
Mca2030  object oriented programming – c++Mca2030  object oriented programming – c++
Mca2030 object oriented programming – c++
 
Mca2030 object oriented programming – c++
Mca2030  object oriented programming – c++Mca2030  object oriented programming – c++
Mca2030 object oriented programming – c++
 
C Sharp Nagina (1)
C Sharp Nagina (1)C Sharp Nagina (1)
C Sharp Nagina (1)
 
Off line system for the recognition of handwritten arabic character
Off line system for the recognition of handwritten arabic characterOff line system for the recognition of handwritten arabic character
Off line system for the recognition of handwritten arabic character
 
Basic calculus
Basic calculusBasic calculus
Basic calculus
 
Overview of C Language
Overview of C LanguageOverview of C Language
Overview of C Language
 
An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...
An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...
An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...
 
ma52006id386
ma52006id386ma52006id386
ma52006id386
 
Automatic Summarization in Chinese Product Reviews
Automatic Summarization in Chinese Product ReviewsAutomatic Summarization in Chinese Product Reviews
Automatic Summarization in Chinese Product Reviews
 
Text Summarization
Text SummarizationText Summarization
Text Summarization
 
Ee java lab assignment 3
Ee java lab assignment 3Ee java lab assignment 3
Ee java lab assignment 3
 
Top C Language Interview Questions and Answer
Top C Language Interview Questions and AnswerTop C Language Interview Questions and Answer
Top C Language Interview Questions and Answer
 

Viewers also liked

Stabilization of weak pavement subgrades using cement kiln dust 2
Stabilization of weak pavement subgrades using cement kiln dust 2Stabilization of weak pavement subgrades using cement kiln dust 2
Stabilization of weak pavement subgrades using cement kiln dust 2
IAEME Publication
 
Supplier development practices and current trends a review of literature
Supplier development practices and current trends a review of literatureSupplier development practices and current trends a review of literature
Supplier development practices and current trends a review of literature
IAEME Publication
 
Quantification of energy losses and performance improvement in dx cooling by ...
Quantification of energy losses and performance improvement in dx cooling by ...Quantification of energy losses and performance improvement in dx cooling by ...
Quantification of energy losses and performance improvement in dx cooling by ...
IAEME Publication
 
Tribological testing regime for establishing ficiency of zddp in presence of ...
Tribological testing regime for establishing ficiency of zddp in presence of ...Tribological testing regime for establishing ficiency of zddp in presence of ...
Tribological testing regime for establishing ficiency of zddp in presence of ...
IAEME Publication
 
A review of anomaly based intrusions detection in multi tier web applications
A review of anomaly based intrusions detection in multi tier web applicationsA review of anomaly based intrusions detection in multi tier web applications
A review of anomaly based intrusions detection in multi tier web applications
IAEME Publication
 
Comparison and analysis of combining techniques for spatial multiplexing spac...
Comparison and analysis of combining techniques for spatial multiplexing spac...Comparison and analysis of combining techniques for spatial multiplexing spac...
Comparison and analysis of combining techniques for spatial multiplexing spac...
IAEME Publication
 
Claytronics the building block of new virtual world
Claytronics the building block of new virtual worldClaytronics the building block of new virtual world
Claytronics the building block of new virtual world
IAEME Publication
 
Mobile safety systems for automobiles
Mobile safety systems for automobilesMobile safety systems for automobiles
Mobile safety systems for automobiles
IAEME Publication
 
Design and experimental investigations of pressure
Design and experimental investigations of pressureDesign and experimental investigations of pressure
Design and experimental investigations of pressure
IAEME Publication
 
Innovative hr practices in software organization
Innovative hr practices in software organizationInnovative hr practices in software organization
Innovative hr practices in software organization
IAEME Publication
 
An emprirical investigation into factors affecting service quality among indi...
An emprirical investigation into factors affecting service quality among indi...An emprirical investigation into factors affecting service quality among indi...
An emprirical investigation into factors affecting service quality among indi...
IAEME Publication
 
A study on the entrepreneurial intention among students
A study on the entrepreneurial intention among students A study on the entrepreneurial intention among students
A study on the entrepreneurial intention among students
IAEME Publication
 

Viewers also liked (20)

Stabilization of weak pavement subgrades using cement kiln dust 2
Stabilization of weak pavement subgrades using cement kiln dust 2Stabilization of weak pavement subgrades using cement kiln dust 2
Stabilization of weak pavement subgrades using cement kiln dust 2
 
Supplier development practices and current trends a review of literature
Supplier development practices and current trends a review of literatureSupplier development practices and current trends a review of literature
Supplier development practices and current trends a review of literature
 
Quantification of energy losses and performance improvement in dx cooling by ...
Quantification of energy losses and performance improvement in dx cooling by ...Quantification of energy losses and performance improvement in dx cooling by ...
Quantification of energy losses and performance improvement in dx cooling by ...
 
Tribological testing regime for establishing ficiency of zddp in presence of ...
Tribological testing regime for establishing ficiency of zddp in presence of ...Tribological testing regime for establishing ficiency of zddp in presence of ...
Tribological testing regime for establishing ficiency of zddp in presence of ...
 
A review of anomaly based intrusions detection in multi tier web applications
A review of anomaly based intrusions detection in multi tier web applicationsA review of anomaly based intrusions detection in multi tier web applications
A review of anomaly based intrusions detection in multi tier web applications
 
Comparison and analysis of combining techniques for spatial multiplexing spac...
Comparison and analysis of combining techniques for spatial multiplexing spac...Comparison and analysis of combining techniques for spatial multiplexing spac...
Comparison and analysis of combining techniques for spatial multiplexing spac...
 
Claytronics the building block of new virtual world
Claytronics the building block of new virtual worldClaytronics the building block of new virtual world
Claytronics the building block of new virtual world
 
Mobile safety systems for automobiles
Mobile safety systems for automobilesMobile safety systems for automobiles
Mobile safety systems for automobiles
 
Design and experimental investigations of pressure
Design and experimental investigations of pressureDesign and experimental investigations of pressure
Design and experimental investigations of pressure
 
Innovative hr practices in software organization
Innovative hr practices in software organizationInnovative hr practices in software organization
Innovative hr practices in software organization
 
An emprirical investigation into factors affecting service quality among indi...
An emprirical investigation into factors affecting service quality among indi...An emprirical investigation into factors affecting service quality among indi...
An emprirical investigation into factors affecting service quality among indi...
 
A study on the entrepreneurial intention among students
A study on the entrepreneurial intention among students A study on the entrepreneurial intention among students
A study on the entrepreneurial intention among students
 
Expresso132
Expresso132Expresso132
Expresso132
 
Pnacionales pp
Pnacionales ppPnacionales pp
Pnacionales pp
 
colegio de bachilleres plantel cancun tres
colegio de bachilleres plantel cancun trescolegio de bachilleres plantel cancun tres
colegio de bachilleres plantel cancun tres
 
Perfuracoes
PerfuracoesPerfuracoes
Perfuracoes
 
Presentación2
Presentación2Presentación2
Presentación2
 
Danny videoconferencia
Danny videoconferenciaDanny videoconferencia
Danny videoconferencia
 
Portfolio
PortfolioPortfolio
Portfolio
 
vgL&R
vgL&RvgL&R
vgL&R
 

Similar to The machine learning method regarding efficient soft computing and ict using svm

Automation tool for evaluation of the quality of nlp based
Automation tool for evaluation of the quality of nlp basedAutomation tool for evaluation of the quality of nlp based
Automation tool for evaluation of the quality of nlp based
IAEME Publication
 
A domain specific automatic text summarization using fuzzy logic
A domain specific automatic text summarization using fuzzy logicA domain specific automatic text summarization using fuzzy logic
A domain specific automatic text summarization using fuzzy logic
IAEME Publication
 
English to punjabi machine translation system using hybrid approach of word s
English to punjabi machine translation system using hybrid approach of word sEnglish to punjabi machine translation system using hybrid approach of word s
English to punjabi machine translation system using hybrid approach of word s
IAEME Publication
 
Efficient text compression using special character replacement
Efficient text compression using special character replacementEfficient text compression using special character replacement
Efficient text compression using special character replacement
iaemedu
 
Cohesive Software Design
Cohesive Software DesignCohesive Software Design
Cohesive Software Design
ijtsrd
 
Intelligent hands free speech based sms
Intelligent hands free speech based smsIntelligent hands free speech based sms
Intelligent hands free speech based sms
Kamal Spring
 

Similar to The machine learning method regarding efficient soft computing and ict using svm (20)

40120130406014 2
40120130406014 240120130406014 2
40120130406014 2
 
Multi Document Text Summarization using Backpropagation Network
Multi Document Text Summarization using Backpropagation NetworkMulti Document Text Summarization using Backpropagation Network
Multi Document Text Summarization using Backpropagation Network
 
Automation tool for evaluation of the quality of nlp based
Automation tool for evaluation of the quality of nlp basedAutomation tool for evaluation of the quality of nlp based
Automation tool for evaluation of the quality of nlp based
 
A domain specific automatic text summarization using fuzzy logic
A domain specific automatic text summarization using fuzzy logicA domain specific automatic text summarization using fuzzy logic
A domain specific automatic text summarization using fuzzy logic
 
English to punjabi machine translation system using hybrid approach of word s
English to punjabi machine translation system using hybrid approach of word sEnglish to punjabi machine translation system using hybrid approach of word s
English to punjabi machine translation system using hybrid approach of word s
 
Compression tech
Compression techCompression tech
Compression tech
 
Efficient text compression using special character replacement
Efficient text compression using special character replacementEfficient text compression using special character replacement
Efficient text compression using special character replacement
 
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
 
Improvement of Text Summarization using Fuzzy Logic Based Method
Improvement of Text Summarization using Fuzzy Logic Based  MethodImprovement of Text Summarization using Fuzzy Logic Based  Method
Improvement of Text Summarization using Fuzzy Logic Based Method
 
High level speaker specific features modeling in automatic speaker recognitio...
High level speaker specific features modeling in automatic speaker recognitio...High level speaker specific features modeling in automatic speaker recognitio...
High level speaker specific features modeling in automatic speaker recognitio...
 
Tools and techniques of code coverage testing
Tools and techniques of code coverage testingTools and techniques of code coverage testing
Tools and techniques of code coverage testing
 
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
 
Malayalam Word Sense Disambiguation using Machine Learning Approach
Malayalam Word Sense Disambiguation using Machine Learning ApproachMalayalam Word Sense Disambiguation using Machine Learning Approach
Malayalam Word Sense Disambiguation using Machine Learning Approach
 
Natural Language Interface for Java Programming: Survey
Natural Language Interface for Java Programming: SurveyNatural Language Interface for Java Programming: Survey
Natural Language Interface for Java Programming: Survey
 
Cohesive Software Design
Cohesive Software DesignCohesive Software Design
Cohesive Software Design
 
Isolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantizationIsolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantization
 
Isolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantizationIsolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantization
 
Detection of multiword from a wordnet is complex
Detection of multiword from a wordnet is complexDetection of multiword from a wordnet is complex
Detection of multiword from a wordnet is complex
 
Intelligent hands free speech based sms
Intelligent hands free speech based smsIntelligent hands free speech based sms
Intelligent hands free speech based sms
 
Intelligent hands free speech based sms
Intelligent hands free speech based smsIntelligent hands free speech based sms
Intelligent hands free speech based sms
 

More from IAEME Publication

A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
IAEME Publication
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
IAEME Publication
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
IAEME Publication
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
IAEME Publication
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
IAEME Publication
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
IAEME Publication
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
IAEME Publication
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
IAEME Publication
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
IAEME Publication
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
IAEME Publication
 

More from IAEME Publication (20)

IAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdf
 
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
 
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
 
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSDETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
 
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
 
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOVOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
 
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
 
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYVISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
 
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
 
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
 
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTA MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
 

The machine learning method regarding efficient soft computing and ict using svm

  • 1. INTERNATIONALComputer EngineeringCOMPUTER ENGINEERING International Journal of JOURNAL OF and Technology (IJCET), ISSN 0976- 6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME & TECHNOLOGY (IJCET) ISSN 0976 – 6367(Print) ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), pp. 124-130 IJCET © IAEME: www.iaeme.com/ijcet.asp Journal Impact Factor (2012): 3.9580 (Calculated by GISI) ©IAEME www.jifactor.com THE MACHINE LEARNING METHOD REGARDING EFFICIENT SOFT COMPUTING AND ICT USING SVM 1 TARUN DHAR DIWAN, 2UPASANA SINHA ASSISTANT PROFESSOR DEPT. OF ENGINEERING 1 Dr.C.V.RAMAN UNIVERSITY, BILASPUR (INDIA) 2 J K INSTITUTE OF ENGINEERING, BILASPUR (INDIA) 1 taruncsit@gmail.com, 2upasana.sihna@gmail.com ABSTRACT Soft Computing and Information Communication Technology (ICT) both are main part in information technology and computer science because today all documentations are computerized to store necessary information and database into English language, so natural language processing concept is come from artificial intelligence (A.I) branch of computer science and information technology as part of soft computing and ICT (information communication technology). Here English language may be used as a major part of machine learning method ,where sentence boundary detection is a great challenge in current time of NLP(natural language processing).So SVM(support vector machine) can be used to solve the problem of sentence boundary detection which indicates the machine learning method regarding efficient soft computing and ICT. Hence Computer system may be learned to avoid ambiguity of dot (.) periods in detecting sentence boundary. I. INTRODUCTION Basically sentence boundary detection is essential feature of recognizing the correct sentence meaning in text area and natural language processing, because without valid meaning based documentation has no any information extraction features and communication is generated in efficient successful trend. SVM (support vector machine) may be used to extract the feature of sentence boundaries where Support Vector Machine (SVM) is a machine learning model and which solved the problem of sentence boundary detection by calculation of precision, recall, f feature value [1]. Feature extraction f value is evaluated on 124
  • 2. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976- 6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME the basis of precision, recall value calculation.Sentence boundary detection (SBD) means detecting end of sentence in given or used text. Various type of punctuation marks contain multiple feature at one time which gives sentence boundary ambiguity in text , This type of error of ambiguity can be find by calculating precision, recall values and feature values of support vector machine and in such a way sentence boundary is detected , where ambiguity is come due to multiple behavior of one punctuation mark or symbols in text at one time, then precision and recall values are calculated to provide better information on what kinds of errors were made by ambiguous punctuation mark. Basically SVM (support vector machine) firstly calculate ambiguity of each segment in input English text by calculating precision value, here segment is one part of input text but not lead to exact sentence boundary [2]. Now we calculate recall value to find exact sentence boundary and f is calculated at last to extract features of each segment of input text. II. RELATED WORK Basically here sentence boundary detection is done on the basis of dependency analysis of each segment of text. Segment of text is not necessary to occur as a boundary of segment so there dependency analysis of previous and next segment of text is done in three type of sentence which is given bellow: 1. Open dependency based sentence. 2. Closed dependency based sentence. 3. Without dependency based sentence i.e. Independent sentence. In above three type of sentence based ambiguous text has a lot of need of detecting sentence boundary detection and after analysis of this three type of sentence in this paper by using suitable example, it is search that more feature of text segment gives more ambiguity with minimum SBD (Sentence Boundary Detection) [1,3].. III. CHALLENGES/ PROBLEMS OF SENTENCE BOUNDARY DETECTION In sentence boundary detection following problems are come as challenges to detect end of sentence in creating recognize correct meaning of any text of English language during soft computing or information communication technology (ICT) [4]. 1. Recognize tokens by removing white space and special characters from English text. 2. Resolving ambiguous separators in numbers to text. 3. Resolving ambiguous abbreviation in text. 4. Then resolving end of sentences (EOS) (?). 5. Morphology analysis on a corpus database in a form of statistical analysis and word/letter analysis which are part of NLP. 6. Correct syntactic and semantic meaning of each word by using grammar i.e. morphology and lexical analysis. 7. Avoiding ambiguity in punctuation marks. 125
  • 3. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976- 6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME IV. OBJECTIVE/GOAL OF PAPER Finding the solution of sentence boundary detection challenge as avoiding ambiguity of dot (.) punctuation mark. It is done by calculating the error rate where error rate is related to each segment of text which is evaluated to check dependency analysis on the basis of dot (.) Punctuation as either it is a abbreviation, termination or name into English language based text. V. SIGNIFICANT OF PAPER In this paper ambiguity is avoided of dot(.) periods into each segment of English language based text as to detect sentence boundary where f feature value is detected from SVM as when any word is presence it is counted as 1 otherwise as 0 value and this 0 or 1 value is assign finally assign into vector of SVMs. VI. PROPOSED METHODOLOGY SVM methodology is support the creation of document vector from the features extraction value of each segment or line of text. Each new word seen by the SVM module is internally assigned to a different coordinate in the vector. The value of each coordinate is zero when the word is absent from a document otherwise one, basically it is a binary arrangement of text segment vector in memory[5][6].So maximum feature f value is created with maximum presence of one word at one time into one segment of text, which represent high maximum ambiguity i.e. maximum precision value, and maximum ambiguity of any word or string into one segment or line of text ,decreases the sentence break or boundary detection behavior as a less(minimum)recall value. This is clear by using bellow example of input, output text along precision, recall, f feature extraction value. To apply methodology, calculation is done to calculate true positives, false positives, true negatives, and false negatives values whose description is bellow given. These values are used in recall, precision, f feature extraction value calculation formula[7][8].. 1. True Positives: Those who test positive for a condition and are positive (i.e., have the condition). 2. False Positives: Those who test positive, but are negative (i.e., do not have the condition). 3. True Negatives: Those who test negative and are negative, 4. False Negatives: Those who test negative, but are positive. Now Precision is the ratio between the number of candidate tokens that have been correctly assigned to a class and the number of all candidates that have been assigned to this class [9][10]. Precision value=true positives / (true positives + false positives) ------- (1) Recall is defined as the proportion of all candidates truly belonging to a certain class that have also been assigned to that class by the evaluated system. Recall value=true positives / (true positives + false negatives) -------- (2) 126
  • 4. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976- 6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME Finally, the so-called F measure is the harmonic mean of precision and recall (van Rijsbergen 1979). f measure value =2* precision * recall / (precision + recall) ---------(3) VII. ANALYSIS Here if any sentence is closed with proper syntactic and semantic analysis where start and end both are related to only one sentence then this type of sentence is come into without dependency or independent type of sentence, but if end is correct as syntactic and semantic way but starting is depend on previous segment of text then this type of sentience is known as closed dependency based sentence and if both starting and ending of any segment of text are incomplete the this type of segment is known as open dependency based sentence. Therefore sentence boundary is introduced as three types on the basis of above description of analysis i.e. 1. Strong boundary(independent sentence i.e. without dependency based sentence) 2. Weak boundary(open dependency based sentence) 3. Absolute boundary(closed dependency based sentence) Finding Dependency Structure of Each Sentence to Used Input Text Example of Input English Text I am doing Ph.D. Ph.D. is easy. Mr. and Mrs. Sharma are well. They are Mr. and Mrs. Arun is singing a song. Our college name is S.E.C. Mr. V. Dixit is a best guide. Expected Output after Used Methodology to Input English Text I am doing Ph.D. Ph.D. is easy. Mr. and Mrs. Sharma are well. They are Mr. and Mrs. Arun is singing a song. Our college name is S.E.C. Mr. V. Dixit is a best guide. Finding Four Type of Sentence Dependency Structure 1) I am doing Ph.D. Ph.D. is easy. Mr. and Mrs. (Open dependency based sentence) 2) Sharma are well. They are Mr. and Mrs. Arun is (Open dependency based sentence) 3) playing a game. Our college name is S.E.C. (Closed dependency based sentence) 4) Mr. V. Dixit is a best guide. (Without dependency based sentence i.e. independent sentence) 127
  • 5. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976- 6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME Here Precision to each statement. 10/10+7=1.4=10/17=0.588 10/10+3=3.3=10/13=0.769 8/8+6=1.3=8/14=4/7=0.571 7/7+3=2.3=7/10=0.7 Here Recall to each statement. 10/10+2=5=10/12=5/6=0.8333 10/10+2=5=10/12=5/6=0.8333 8/8+1=8=8/9=0.888 7/7+1=7/8=0.875 & So f measure values- (2*.588*.833)/ (.588+.833)=.979/1.421=0.688 (2*.769*.833)/ (.769+.833)=1.602 (2*(.571*.888)/ (.571+.888)= 1.014/1.459=0.694 (2*(.875*0.7)/ (.875+0.7)=1.225/1.575=0.777 Here input English text has a evaluation of four type of sentences i.e. first two type are open dependency based sentence, third closed dependency based sentence and finally last fourth type of sentence is without dependency based sentence and above calculation proof that max precision gives max ambiguity regarding to abbreviation, punctuation, separator so sentence boundary has a need of evaluating ambiguity and then eliminating it to gain exact sentence boundary to any input text. Sentence boundary maximally detected to open dependency structure based text because features are maximum with various individual sentence boundaries. VIII. RESULT Here f is feature measure i.e. lexical analysis, parsing or morphological analysis. Recall is exact sentence boundary detection value calculation. Precision is ambiguity calculation of each segment of used text. Hence feature extraction value f is max 1.6% in open dependency based sentence to maximum ambiguity of 0.8% precision value and corresponding exact sentence boundary recall value is minimum i.e. 0.83%. 128
  • 6. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976- 6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME Table1 Sentence Boundary Detection Result Table Obtained by Using SVMs IX. CONCLUSION Maximum precision value with maximum f feature values generates the minimum recall value where precision is ambiguity during detecting sentence boundary, f feature values is made by ambiguous behavior of punctuation marks. This precision and f feature values are creates a minimum recall value as less sentence boundary of text, i.e. weak boundary extracts various features of text segment with less sentence boundary detection. X. FUTURE WORK Machine learning method of SVM must be implemented to compiler design as creating software of sentence boundary detection in all type of ambiguity of punctuation marks and symbols to English, Hindi Text, it will come into automatic sentence boundary detection sentence task and by there meaningful individual sentence are easy to recognize. REFERENCES [1] Tarun Dhar Diwan, "SVM (Support Vector Machine) Learning to Detect English Sentence Boundary Regarding Dot (.) Punctuation marks" International Conference on Emerging Trends in Soft Computing and ICT, pages78-82, ISBN: GURU GHASIDAS VISHWAVIDYALAYA, Bilaspur (CG),INDIA 129
  • 7. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976- 6367(Print), ISSN 0976 – 6375(Online) Volume 4, Issue 1, January- February (2013), © IAEME [2] Tarun Dhar Diwan,Shweta dubey "Supporting English-Hindi Parallel Corpus using word alignment " International Journal of Computer Application, Volume49, Number 6, pages 16- 19, july 2012, ISSN 0975-8887. [3] Tarun Dhar Diwan "Knowledge of Word Alignment Position in English-Hindi Sentences "CiiT - International Journal of Artificial Intelligent Systems and Machine Learning, 2012, ISSN: 0974–9543, DOI: AIML082012001 [4] Tarun Dhar Diwan " Word Alignment to Encourage Outsized English-Hindi Parallel Corpus "CiiT - International Journal of Fuzzy Logic, 2012, ISSN 0974 – 9608, DOI: AIML082012005 [5] Tarun Dhar Diwan " Alignment of English-Hindi Sentences "CiiT - International Journal of Fuzzy Logic, 2012, ISSN: 0974–9543, DOI: AIML082012006 [6] Tarun Dhar Diwan " Sentence Ending in English Language "CiiT - International Journal of Artificial Intelligent Systems and Machine Learning, September 2012, ISSN: 0974–9543, DOI: AIML092012006 [7] Tarun Dhar Diwan " Development of English-Hindi Parallel Corpus using Sentence Alignments " CiiT - International Journal of Artificial Intelligent Systems and Machine Learning, September 2012, ISSN: 0974–9543, DOI: AIML092012007 [8]Spector, A. Z. 1989 Achieving application requirements. In Distributed Systems, S.Mullender, Ed. Acm Press Frontier Series, ACM Press, New York, NY, 19-33. [9]Kiyotaka Uchimoto, Satoshi Sekine, and Hitoshi Isahara.1999. Japanese Dependency Structure Analysis Based on Maximum Entropy Models. In Proceedings of the EACL, pages 196–203. [10]Kiyotaka Uchimoto, Masaki Murata, Satoshi Sekine, and HitoshiIsahara. 2000. Dependency Model Using Posterior Context. In Proceedings of the IWPT, pages 321–322. 130