SlideShare a Scribd company logo
1 of 7
Download to read offline
International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016
ISSN: 2395-1303 http://www.ijetjournal.org Page 76
Implementation Paper on Document Recommendation in
Conversations
Anshika1
, Sujit Tak2
, Sandeep Ugale3
, Abhishek Pohekar4
(Dept. of Computer Engineering, D.Y.P.I.E.T., Savitribai Phule Pune University, Pune)
I. INDRODUCTION
Humans are surrounded by abundance of data,
accessible as records, data stores, or mixed media
sources of information. This data can be accessed
by using suitable web indexes, however when these
are accessible, clients frequently don't start a search
action, in light of the reality that their present work
does not allow them to do such tasks, or they are
not mindful that applicable data is accessible. In
this paper, a novel technique of suggesting archives
just-in-time that is identified with clients' present
work is suggested. At the time, when these tasks are
primarily conversational, for instance when clients
take part in a meeting, then their data needs can be
understood by the keywords present in speech,
acquired through continuous automatic speech
recognition (ASR) engine. These certain implicitly
generated questions are utilized to recover and
suggest reports or documents from the websites or a
local repository, which clients can decide to,
investigate in more detail if they discover them
intriguing. The focus of our paper is on showing
framework for utilization in meeting room or
conference where information needs to be fetched
in-time, to help involved people in better
understanding of the topic. This shown framework
will be including speech to text translation,
extraction of words from this text, clustering those
words for topic identification, generating implicit
queries from identified main words and retrieving
documents from the available storehouses related to
the words in the queries.
Various libraries (regional/language specific)
can be added to the ASR engine to improve speech
recognition efficiency.
RESEARCH ARTICLE OPEN ACCESS
Abstract:
This paper is focused towards extraction of important words from conversations, and then these
words are utilized to recover a small number of relevant documents, which can be given to users as per
their needs. In any case, even a short audio fragment contains of large number of words, which are
conceivably identified with a few topics and also, using automatic speech recognition (ASR) framework
reduces errors in the output. It is difficult to provide the data needs of the conversation members
appropriately. A calculation to remove decisive words from the yield of an ASR framework (or a manual
transcript for testing) to support the potentially differing qualities of subjects and decrease ASR
commotion is proposed. At that point, a technique is used to make many implicit queries from the selected
keywords. These queries will in return produce list of relevant documents. The scores demonstrate that our
proposition is modified over past systems that consider just word re-occurrence or topic closeness, and
speaks to an appropriate answer for a report recommender framework to be utilized as a part of
discussions.
Keywords — Document recommendation, information retrieval, keyword extraction.
International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016
ISSN: 2395-1303 http://www.ijetjournal.org Page 77
II. RELATED WORK
A. Remembrance Agent: A continuously
running automated information retrieval
system.
Authors: B. Rhodes and T. Starner
Remembrance Agent (RA) is a program
which can display list of documents relevant
to user needs. RA executes without
continuous user interaction. RA is basically
used to give suggestions to user which can be
used or ignored as desired. Computer used to
wait for user interaction, RA makes use of
such wait cycle to perform its search
operation which is relevant to user’s context.
RA reminds about relevant data of user task.
RA gives suggestions in one line description
at bottom of screen. For the front-end for RA,
EMACS-19 and lisp are used. One-line
suggestions and number to indicate relevancy
is displayed by front-end. Full text document
can be obtained as per the request. A program
which is used as back-end for RA produces
suggestions for given text query from pre-
indexed documents. SMART decides
suggestions for RA, depending upon common
word frequency.
This system consists of two sources of
suggestions. In first source, first three lines
print suggestions from last year’s personal
email (approximately up to 60MB). And in
case of second source, last line prints
suggestions from 7MB files which were
entered over past few years. RA should
suggest documents related to keywords in
query. This works on ‘scopes’, which are
centred on current location of cursor. There
are three types of scopes used in RA. First
scope is around the last thousands of words.
Second is around the last 50 words. And third
is around the last 10 words. User can
customize scope coverage, time updates.
B. Enforcing topic diversity in a document
recommender for conversations.
Authors: M. Habibi and A. Popescu-Belis
In our system, we also focus on
building of concise, diverse and relevant lists
of documents which provide information to
the users according to their need.
From this paper, we study the concept
of merging lists of documents which are
obtained through multiple implicit queries,
formed for small conversations fragments.
The aim of the method used in this
paper is to get a unique and precise list of
documents that can satisfy user’s need in real
time. The obtained list should cover the
maximum number of implicit queries and
topics.
The proposed method rewards:
a. Topic similarity – It is concerned with
selection of the most relevant documents to
the conversation.
b. Topic diversity – It is concerned with
coverage of the maximum implicit queries
and topics in a precise and relevant list of
recommendations when more than one topic
is discussed in the conversation.
ACLD system is used as framework for
this document recommender system. The
ACLD system monitors the conversation, and
generates queries based on the keywords
detected by a real-time ASR system. The
queries are then fired periodically for
retrieving documents. These documents are
then recommended to the users according to
their needs.
Here a diverse merging of lists is done.
It is the process of generating a short, diverse
and relevant list of recommended documents.
The generated list should cover the maximum
number of topics of each conversation
fragment.
The method proceeds in two steps:
International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016
ISSN: 2395-1303 http://www.ijetjournal.org Page 78
a. Document and Query Representation:
In this part, the representation of the
queries and the corresponding list of
documents from the Apache Lucene search
engine are done. This representation is done
using topic modelling techniques.
b. Ranking Documents:
In this step, ranking of documents is
done by using topical similarity and then,
rewarding the coverage of different lists is
taken care.
C. A Speech-based Just-in-Time Retrieval
System using Semantic Search
Authors: A. Popescu-Belis, M. Yazdani, A.
Nanchen, and P. N. Garner
A just-in time retrieval system for
extraction of useful information is required in
our project. We have used ACLD i.e.,
Automatic Content Linking Device (ACLD)
for our project. ACLD is a just-in-time
document recommendation system for
meetings.
From this paper, we study the concept
of content linking for our recommendation
system. This is done by storing documents
with tags.
a. What is ACLD?
ACLD is a recommendation system
which analyses spoken input from one or
more speakers and retrieves relevant
documents for user as per their need.
b. Content Linking: Scenarios of Use
One of the main usage scenarios for the
ACLD is meeting rooms. The ACLD searches
the relevant documents for members in the
meeting without interrupting their discussion
flow. In other usage scenarios, ACLD can be
used for live or recorded lectures.
The ACLD provides related and
relevant documents which are obtained from
various repositories, required for the lectures.
c. Description of the ACLD
ACLD system performs following
functions:
i.Document Preparation and Indexing:
This function is concerned with the
extraction of text, and then the indexing of the
documents. Apache Lucene software is used
for this purpose.
ii. Sensing the User’s Information Needs
The ACLD system makes use of the
AMI real-time ASR system. One of its main
features is a pre-compiled grammar. This
feature helps in maintaining accuracy even in
real time on a low resource machine.
iii. Querying the Document Database
The Query Aggregator retrieves the
most relevant documents from one or more
databases using ASR words. The current
version of the ACLD makes use of semantic
search, while word-based search is used for
previous version.
iv. Semantic Search over Wikipedia
The main focus of our method of
semantic search is to improve the relevancy
of the retrieved documents as well as the
robustness of system to remove noise from
the ASR. Here, for document retrieval, the
graph-based model of semantic relatedness is
used.
v. The User Interface (UI)
The UI mainly makes all information
produced by the system available to user in a
configurable way. The users are allowed to
use a larger or smaller amount of information
according to their needs in this.
International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016
ISSN: 2395-1303 http://www.ijetjournal.org Page 79
ACLD is the first just in time retrieval
system that uses involuntary speech. Access
to multimedia documents and web pages
using a robust semantic search method is also
supported.
D. User Interactions with Everyday
Applications as Context for Just-in-time
Information Access
Authors: J. Budzik and K. J. Hammond
Information retrieval is one of the
important functions of our project. The
retrieved information is then used for
obtaining relevant and related documents. As
per observations, rich contextual information
obtained from our interactions with everyday
applications, such as, word processors, web
browsers etc. can be used to support just in
time retrieval of relevant documents.
This paper provides us the concept of
just in time information retrieval by using
user interactions with everyday applications
as a context. For obtaining more clear view
about just in time information retrieval, we
are using Watson system as an example.
Previous efforts in building context for
information access can be classified into
four classes:
a. Relevance feedback in information
retrieval:
In relevance feedback supporting
system, we start with a standard query and
then judge the result. Usually a result is
judged as relevant or not relevant. Then, this
result of user’s judgment can used to
customize the original query by casting
positive or negative search terms.
b. Systems that use user profiles:
The aggregation of user profiles can be
done by gathering terms based on rating of
documents.
c. Word-sense disambiguation:
Explicit word-sense disambiguation on
the user’s part, or approved information
intrinsic in the hypertext documents structure
can be used for reducing ambiguity in some
systems.
d. Knowledge engineering approaches:
The behaviour of user in a particular
application is modelled and then explicit
queries are generated in a particular state of
tasks in such approaches.
III. PROPOSED SYSTEM
We propose a system where, output is
taken from Automatic Speech Recognition
(ASR) system. This output is processed using
topic modelling technique instead of directly
selecting words on re-occurrence basis.
Various functions are used in the system to
process. In first step, noisy words like articles
(a, an, the), unwanted words, common
unnecessary symbols, etc. are removed from
the conversation fragment. Human critics
ranks various conversational corpora like
Fisher, AMI and ELEA which provides audio
pieces to test our proposed methods on the
basis of relevance with respect to audio pieces
from the conversational corpora.
Now, a set (mathematics term) is
formed from words that are obtained after
noisy word removal process. Clustering is
carried on the words present in the set by
using clustering algorithm. In our system we
are using k-means as it is simpler to use and
implement and which in turn, reduces the
processing time of the system and do not put
much load on the system.
According topical similarity and co-
occurrence of words, the clusters are formed.
Then, from each cluster, one query will be
formulated implicitly. This will generate more
precise query. Documents will be retrieved
for each topically different query. One of the
primary tasks is to present the list of
documents in user-friendly way. Two column
International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016
ISSN: 2395-1303 http://www.ijetjournal.org Page 80
GUI is used for this for showing document to
user. They are:
1. Query relevant document.
2. Document’s similarity percentage to the
keyword (They are based on co-occurrence
of words).
At various stages of processing,
different technique are used to remove errors.
Ranking of document helps user in accessing
them before displaying.
IV. MAIN ALGORITHM
1) Latent Dirichlet Allocation
LDA, used for topic modelling, assumes the
following generative process for each
document w in a large and structured set of
texts D:
1. Choose N ∼ Poisson(ξ)
2. Choose θ ∼ Dir (α).
3. For each of the N words wn:
a. Choose a topic zn ∼ Multinomial (θ).
b. Choose a word wn from p (wn |zn, β), a
multinomial probability conditioned on the
topic zn.
A k-dimensional Dirichlet random variable θ
can take values in the (k −1)-simplex (a k-
vector θ lies in the (k−1)-simplex if θi ≥ 0, Σk
i=1 θi = 1), and has the following probability
density on this simplex:
2) Diverse Keyword Extraction
Algorithmic form is as follows:
Input: A given text t, a set of topics Z, the
number of keywords k;
Output: a set of keywords S
S ← ∅;
while |S| ≤ k do
S ← S ∪ {argmaxw∈t S (h (w))
where
h (w) = Pz∈Z p(z|t)[ r{w},z + rS,z] λ
};
end
return S;
Where, rS,z is the topical similarity with
respect to topic z of the keyword set S
selected from the fragment t.
V. SYSTEM ARCHITECTURE
Figure 4 shows ASR system records user’s
conversation. The output of ASR is send further to
noise removal function. Then diverse keywords are
extracted using keyword extraction method. After
that these diverse keywords are clustered. Further
these keywords are used for obtaining result of
implicit queries. Then relevant documents are
recommended to user.
Fig: System Architecture
VI. SYSTEM SCREENSHOTS
International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016
ISSN: 2395-1303 http://www.ijetjournal.org Page 81
Fig. 1: Administrator UI
Fig. 2: Uploading documents to local repository
Fig. 3: Setting filter words
Fig. 4: Android Application (ASR Engine)
Fig. 5.1: Relevant Document Retrieved in Client UI
International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016
ISSN: 2395-1303 http://www.ijetjournal.org Page 82
Fig. 5.2: Relevant Document Retrieved in Client UI
VII. FUTURE SCOPE
With more processing power, the
system could be made real-time recommender.
Also, the system can be linked with online
encyclopedias like Wikipedia, etc. for more
detailed recommendations. The common
APIs used in ASR system can be added with
region-specific APIs to improve its efficiency.
VIII. CONCLUSION
In our system architecture, there are
four steps of processing after obtaining the
ASR output. The speed and precision are the
main criterions to judge a just-in-time
recommender. Topic diversity is maintained
by extracting the keywords using a diverse
keyword search algorithm. There are many
practical applications like meeting rooms,
conferences, workshops, etc. because of
implicit query formation process.
IX. ACKNOWLEDGEMENT
We would like to thank our guide Prof.
B. S. Satpute for helping us in understanding
complex technical issues regarding project
and for helping in all the phases of system
development. We are also grateful to Dr. R. K.
Jain and Dr. Pramod Patil for their
wholehearted support, guidance and for
providing us with required amenities and
infrastructure. We would like to thank to all
the authors, whose paper we have referred,
for their research in this field. It helped us in
understanding the algorithms and various
techniques in detail.
REFERENCES
1. B. Rhodes and T. Starner, “Remembrance Agent: A
continuously running automated information retrieval
system,” in Proc. 1st Int. Conf. Pract. Applicat. Intell.
Agents Multi Agent Technol., London, U.K., 1996, pp.
487–495.
2. M. Habibi and A. Popescu-Belis, “Enforcing topic
diversity in a document recommender for
conversations,” in Proc. 25th Int. Conf. Comput.
Linguist. (Coling), 2014, pp. 588–599.
3. A. Popescsu-Belis, M. Yazdani, A. Nanchen, and P. N.
Garner, “A speech-based just-in-time retrieval system
using semantic search,” in Proc. Annu. Conf. North
Amer. Chap. ACL (HLT-NAACL), 2011, pp.80–85.
4. J Budzik and K. J. Hammond, “User interactions with
everyday applications as context for just-in-time
information access,” in Proc. 5th
Int. Conf. Intell. User
Interfaces (IUI’00), 2000, pp. 44–51.
5. Maryam Habibi and Andrei Popescu-Belis, “Keyword
Extraction and Clustering for Document
Recommendation in Conversations”, in IEEE/ACM
Transactions on Audio, Speech and Language
Processing, Vol. 23, NO. 4, April 2010

More Related Content

What's hot

Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalVikas Bhushan
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3alaa223
 
Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemIJTET Journal
 
Finding Influential People from Historical News Repository (Master's thesis w...
Finding Influential People from Historical News Repository (Master's thesis w...Finding Influential People from Historical News Repository (Master's thesis w...
Finding Influential People from Historical News Repository (Master's thesis w...Aayushee Gupta
 
Conceptual foundations of text mining and preprocessing steps nfaoui el_habib
Conceptual foundations of text mining and preprocessing steps nfaoui el_habibConceptual foundations of text mining and preprocessing steps nfaoui el_habib
Conceptual foundations of text mining and preprocessing steps nfaoui el_habibEl Habib NFAOUI
 
Text databases and information retrieval
Text databases and information retrievalText databases and information retrieval
Text databases and information retrievalunyil96
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantinimaxfalc
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notesBAIRAVI T
 
Indexing Automated Vs Automatic Galvan1
Indexing Automated Vs Automatic   Galvan1Indexing Automated Vs Automatic   Galvan1
Indexing Automated Vs Automatic Galvan1CorinaF
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notesAnandh Arumugakan
 
SOFTWARE ENGINEERING AND SOFTWARE PROJECT MANAGEMENT
SOFTWARE ENGINEERING AND SOFTWARE PROJECT MANAGEMENTSOFTWARE ENGINEERING AND SOFTWARE PROJECT MANAGEMENT
SOFTWARE ENGINEERING AND SOFTWARE PROJECT MANAGEMENTARAVINDRM2
 
Context Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic WebContext Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic WebIOSR Journals
 
Research on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesResearch on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesKausar Mukadam
 

What's hot (18)

Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3
 
Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval System
 
Finding Influential People from Historical News Repository (Master's thesis w...
Finding Influential People from Historical News Repository (Master's thesis w...Finding Influential People from Historical News Repository (Master's thesis w...
Finding Influential People from Historical News Repository (Master's thesis w...
 
Accessing database using nlp
Accessing database using nlpAccessing database using nlp
Accessing database using nlp
 
Conceptual foundations of text mining and preprocessing steps nfaoui el_habib
Conceptual foundations of text mining and preprocessing steps nfaoui el_habibConceptual foundations of text mining and preprocessing steps nfaoui el_habib
Conceptual foundations of text mining and preprocessing steps nfaoui el_habib
 
Convolutional Neural Networks
Convolutional Neural Networks Convolutional Neural Networks
Convolutional Neural Networks
 
Ir 01
Ir   01Ir   01
Ir 01
 
Text databases and information retrieval
Text databases and information retrievalText databases and information retrieval
Text databases and information retrieval
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantini
 
Lec 2
Lec 2Lec 2
Lec 2
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notes
 
Indexing Automated Vs Automatic Galvan1
Indexing Automated Vs Automatic   Galvan1Indexing Automated Vs Automatic   Galvan1
Indexing Automated Vs Automatic Galvan1
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
Lec1,2
Lec1,2Lec1,2
Lec1,2
 
SOFTWARE ENGINEERING AND SOFTWARE PROJECT MANAGEMENT
SOFTWARE ENGINEERING AND SOFTWARE PROJECT MANAGEMENTSOFTWARE ENGINEERING AND SOFTWARE PROJECT MANAGEMENT
SOFTWARE ENGINEERING AND SOFTWARE PROJECT MANAGEMENT
 
Context Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic WebContext Based Web Indexing For Semantic Web
Context Based Web Indexing For Semantic Web
 
Research on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesResearch on ontology based information retrieval techniques
Research on ontology based information retrieval techniques
 

Viewers also liked

Evan Pace Finance Major RIT Resume
Evan Pace Finance Major RIT ResumeEvan Pace Finance Major RIT Resume
Evan Pace Finance Major RIT ResumeEvan Pace
 
Un cambio de ambiente esta por darse
Un cambio de ambiente esta por darseUn cambio de ambiente esta por darse
Un cambio de ambiente esta por darsescsonsonate
 
URVISH-RESUME-Intern
URVISH-RESUME-InternURVISH-RESUME-Intern
URVISH-RESUME-InternUrvish Shah
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statisticsDhwani Shah
 
Clasificación de las voces humanas
Clasificación de las voces humanasClasificación de las voces humanas
Clasificación de las voces humanasFrancisco Vazallo
 
гра підсумок тижня
гра підсумок тижнягра підсумок тижня
гра підсумок тижняCергей С
 
Maquiavelo y las formas de gobierno
Maquiavelo y las formas de gobiernoMaquiavelo y las formas de gobierno
Maquiavelo y las formas de gobiernoJavier Contreras
 
Traffic & Transportation surveys
Traffic & Transportation surveysTraffic & Transportation surveys
Traffic & Transportation surveysDhwani Shah
 
Flemming Ulvang CV 2016
Flemming Ulvang CV 2016 Flemming Ulvang CV 2016
Flemming Ulvang CV 2016 Flemming Ulvang
 

Viewers also liked (14)

Carta de Belgrado
Carta de BelgradoCarta de Belgrado
Carta de Belgrado
 
Evan Pace Finance Major RIT Resume
Evan Pace Finance Major RIT ResumeEvan Pace Finance Major RIT Resume
Evan Pace Finance Major RIT Resume
 
Un cambio de ambiente esta por darse
Un cambio de ambiente esta por darseUn cambio de ambiente esta por darse
Un cambio de ambiente esta por darse
 
EMP Pamphlet 2016
EMP Pamphlet 2016EMP Pamphlet 2016
EMP Pamphlet 2016
 
Resume-06-13-2016
Resume-06-13-2016Resume-06-13-2016
Resume-06-13-2016
 
Simple past
Simple pastSimple past
Simple past
 
URVISH-RESUME-Intern
URVISH-RESUME-InternURVISH-RESUME-Intern
URVISH-RESUME-Intern
 
Ch.01 inroduction
Ch.01 inroductionCh.01 inroduction
Ch.01 inroduction
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
Clasificación de las voces humanas
Clasificación de las voces humanasClasificación de las voces humanas
Clasificación de las voces humanas
 
гра підсумок тижня
гра підсумок тижнягра підсумок тижня
гра підсумок тижня
 
Maquiavelo y las formas de gobierno
Maquiavelo y las formas de gobiernoMaquiavelo y las formas de gobierno
Maquiavelo y las formas de gobierno
 
Traffic & Transportation surveys
Traffic & Transportation surveysTraffic & Transportation surveys
Traffic & Transportation surveys
 
Flemming Ulvang CV 2016
Flemming Ulvang CV 2016 Flemming Ulvang CV 2016
Flemming Ulvang CV 2016
 

Similar to [IJET V2I3P13] Authors: Anshika, Sujit Tak, Sandeep Ugale, Abhishek Pohekar

Answer extraction and passage retrieval for
Answer extraction and passage retrieval forAnswer extraction and passage retrieval for
Answer extraction and passage retrieval forWaheeb Ahmed
 
IRJET - BOT Virtual Guide
IRJET -  	  BOT Virtual GuideIRJET -  	  BOT Virtual Guide
IRJET - BOT Virtual GuideIRJET Journal
 
Modern Association Rule Mining Methods
Modern Association Rule Mining MethodsModern Association Rule Mining Methods
Modern Association Rule Mining Methodsijcsity
 
Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introductionnimmyjans4
 
IRJET- Automated Document Summarization and Classification using Deep Lear...
IRJET- 	  Automated Document Summarization and Classification using Deep Lear...IRJET- 	  Automated Document Summarization and Classification using Deep Lear...
IRJET- Automated Document Summarization and Classification using Deep Lear...IRJET Journal
 
An Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using ClusteringAn Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using ClusteringKelly Lipiec
 
Article Summarizer
Article SummarizerArticle Summarizer
Article SummarizerJose Katab
 
INTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAMINTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAMijcsa
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMIRJET Journal
 
G04124041046
G04124041046G04124041046
G04124041046IOSR-JEN
 
Performance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information RetrievalPerformance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information Retrievalidescitation
 
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...mlaij
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...IRJET Journal
 
Design of optimal search engine using text summarization through artificial i...
Design of optimal search engine using text summarization through artificial i...Design of optimal search engine using text summarization through artificial i...
Design of optimal search engine using text summarization through artificial i...TELKOMNIKA JOURNAL
 
A novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrieA novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrieIAEME Publication
 
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYINTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYcscpconf
 
An Evaluation and Overview of Indices Based on Arabic Documents
An Evaluation and Overview of Indices Based on Arabic DocumentsAn Evaluation and Overview of Indices Based on Arabic Documents
An Evaluation and Overview of Indices Based on Arabic DocumentsIJCSEA Journal
 

Similar to [IJET V2I3P13] Authors: Anshika, Sujit Tak, Sandeep Ugale, Abhishek Pohekar (20)

Answer extraction and passage retrieval for
Answer extraction and passage retrieval forAnswer extraction and passage retrieval for
Answer extraction and passage retrieval for
 
IRJET - BOT Virtual Guide
IRJET -  	  BOT Virtual GuideIRJET -  	  BOT Virtual Guide
IRJET - BOT Virtual Guide
 
Modern Association Rule Mining Methods
Modern Association Rule Mining MethodsModern Association Rule Mining Methods
Modern Association Rule Mining Methods
 
[IJET-V1I6P17] Authors : Mrs.R.Kalpana, Mrs.P.Padmapriya
[IJET-V1I6P17] Authors : Mrs.R.Kalpana, Mrs.P.Padmapriya[IJET-V1I6P17] Authors : Mrs.R.Kalpana, Mrs.P.Padmapriya
[IJET-V1I6P17] Authors : Mrs.R.Kalpana, Mrs.P.Padmapriya
 
[IJET-V2I3P19] Authors: Priyanka Sharma
[IJET-V2I3P19] Authors: Priyanka Sharma[IJET-V2I3P19] Authors: Priyanka Sharma
[IJET-V2I3P19] Authors: Priyanka Sharma
 
Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introduction
 
IRJET- Automated Document Summarization and Classification using Deep Lear...
IRJET- 	  Automated Document Summarization and Classification using Deep Lear...IRJET- 	  Automated Document Summarization and Classification using Deep Lear...
IRJET- Automated Document Summarization and Classification using Deep Lear...
 
An Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using ClusteringAn Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using Clustering
 
Article Summarizer
Article SummarizerArticle Summarizer
Article Summarizer
 
INTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAMINTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAM
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
 
N017249497
N017249497N017249497
N017249497
 
G04124041046
G04124041046G04124041046
G04124041046
 
Performance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information RetrievalPerformance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information Retrieval
 
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
A Newly Proposed Technique for Summarizing the Abstractive Newspapers’ Articl...
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...
 
Design of optimal search engine using text summarization through artificial i...
Design of optimal search engine using text summarization through artificial i...Design of optimal search engine using text summarization through artificial i...
Design of optimal search engine using text summarization through artificial i...
 
A novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrieA novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrie
 
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYINTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
 
An Evaluation and Overview of Indices Based on Arabic Documents
An Evaluation and Overview of Indices Based on Arabic DocumentsAn Evaluation and Overview of Indices Based on Arabic Documents
An Evaluation and Overview of Indices Based on Arabic Documents
 

More from IJET - International Journal of Engineering and Techniques

More from IJET - International Journal of Engineering and Techniques (20)

healthcare supervising system to monitor heart rate to diagonize and alert he...
healthcare supervising system to monitor heart rate to diagonize and alert he...healthcare supervising system to monitor heart rate to diagonize and alert he...
healthcare supervising system to monitor heart rate to diagonize and alert he...
 
verifiable and multi-keyword searchable attribute-based encryption scheme for...
verifiable and multi-keyword searchable attribute-based encryption scheme for...verifiable and multi-keyword searchable attribute-based encryption scheme for...
verifiable and multi-keyword searchable attribute-based encryption scheme for...
 
Ijet v5 i6p18
Ijet v5 i6p18Ijet v5 i6p18
Ijet v5 i6p18
 
Ijet v5 i6p17
Ijet v5 i6p17Ijet v5 i6p17
Ijet v5 i6p17
 
Ijet v5 i6p16
Ijet v5 i6p16Ijet v5 i6p16
Ijet v5 i6p16
 
Ijet v5 i6p15
Ijet v5 i6p15Ijet v5 i6p15
Ijet v5 i6p15
 
Ijet v5 i6p14
Ijet v5 i6p14Ijet v5 i6p14
Ijet v5 i6p14
 
Ijet v5 i6p13
Ijet v5 i6p13Ijet v5 i6p13
Ijet v5 i6p13
 
Ijet v5 i6p12
Ijet v5 i6p12Ijet v5 i6p12
Ijet v5 i6p12
 
Ijet v5 i6p11
Ijet v5 i6p11Ijet v5 i6p11
Ijet v5 i6p11
 
Ijet v5 i6p10
Ijet v5 i6p10Ijet v5 i6p10
Ijet v5 i6p10
 
Ijet v5 i6p2
Ijet v5 i6p2Ijet v5 i6p2
Ijet v5 i6p2
 
IJET-V3I2P24
IJET-V3I2P24IJET-V3I2P24
IJET-V3I2P24
 
IJET-V3I2P23
IJET-V3I2P23IJET-V3I2P23
IJET-V3I2P23
 
IJET-V3I2P22
IJET-V3I2P22IJET-V3I2P22
IJET-V3I2P22
 
IJET-V3I2P21
IJET-V3I2P21IJET-V3I2P21
IJET-V3I2P21
 
IJET-V3I2P20
IJET-V3I2P20IJET-V3I2P20
IJET-V3I2P20
 
IJET-V3I2P19
IJET-V3I2P19IJET-V3I2P19
IJET-V3I2P19
 
IJET-V3I2P18
IJET-V3I2P18IJET-V3I2P18
IJET-V3I2P18
 
IJET-V3I2P17
IJET-V3I2P17IJET-V3I2P17
IJET-V3I2P17
 

Recently uploaded

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 

Recently uploaded (20)

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 

[IJET V2I3P13] Authors: Anshika, Sujit Tak, Sandeep Ugale, Abhishek Pohekar

  • 1. International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016 ISSN: 2395-1303 http://www.ijetjournal.org Page 76 Implementation Paper on Document Recommendation in Conversations Anshika1 , Sujit Tak2 , Sandeep Ugale3 , Abhishek Pohekar4 (Dept. of Computer Engineering, D.Y.P.I.E.T., Savitribai Phule Pune University, Pune) I. INDRODUCTION Humans are surrounded by abundance of data, accessible as records, data stores, or mixed media sources of information. This data can be accessed by using suitable web indexes, however when these are accessible, clients frequently don't start a search action, in light of the reality that their present work does not allow them to do such tasks, or they are not mindful that applicable data is accessible. In this paper, a novel technique of suggesting archives just-in-time that is identified with clients' present work is suggested. At the time, when these tasks are primarily conversational, for instance when clients take part in a meeting, then their data needs can be understood by the keywords present in speech, acquired through continuous automatic speech recognition (ASR) engine. These certain implicitly generated questions are utilized to recover and suggest reports or documents from the websites or a local repository, which clients can decide to, investigate in more detail if they discover them intriguing. The focus of our paper is on showing framework for utilization in meeting room or conference where information needs to be fetched in-time, to help involved people in better understanding of the topic. This shown framework will be including speech to text translation, extraction of words from this text, clustering those words for topic identification, generating implicit queries from identified main words and retrieving documents from the available storehouses related to the words in the queries. Various libraries (regional/language specific) can be added to the ASR engine to improve speech recognition efficiency. RESEARCH ARTICLE OPEN ACCESS Abstract: This paper is focused towards extraction of important words from conversations, and then these words are utilized to recover a small number of relevant documents, which can be given to users as per their needs. In any case, even a short audio fragment contains of large number of words, which are conceivably identified with a few topics and also, using automatic speech recognition (ASR) framework reduces errors in the output. It is difficult to provide the data needs of the conversation members appropriately. A calculation to remove decisive words from the yield of an ASR framework (or a manual transcript for testing) to support the potentially differing qualities of subjects and decrease ASR commotion is proposed. At that point, a technique is used to make many implicit queries from the selected keywords. These queries will in return produce list of relevant documents. The scores demonstrate that our proposition is modified over past systems that consider just word re-occurrence or topic closeness, and speaks to an appropriate answer for a report recommender framework to be utilized as a part of discussions. Keywords — Document recommendation, information retrieval, keyword extraction.
  • 2. International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016 ISSN: 2395-1303 http://www.ijetjournal.org Page 77 II. RELATED WORK A. Remembrance Agent: A continuously running automated information retrieval system. Authors: B. Rhodes and T. Starner Remembrance Agent (RA) is a program which can display list of documents relevant to user needs. RA executes without continuous user interaction. RA is basically used to give suggestions to user which can be used or ignored as desired. Computer used to wait for user interaction, RA makes use of such wait cycle to perform its search operation which is relevant to user’s context. RA reminds about relevant data of user task. RA gives suggestions in one line description at bottom of screen. For the front-end for RA, EMACS-19 and lisp are used. One-line suggestions and number to indicate relevancy is displayed by front-end. Full text document can be obtained as per the request. A program which is used as back-end for RA produces suggestions for given text query from pre- indexed documents. SMART decides suggestions for RA, depending upon common word frequency. This system consists of two sources of suggestions. In first source, first three lines print suggestions from last year’s personal email (approximately up to 60MB). And in case of second source, last line prints suggestions from 7MB files which were entered over past few years. RA should suggest documents related to keywords in query. This works on ‘scopes’, which are centred on current location of cursor. There are three types of scopes used in RA. First scope is around the last thousands of words. Second is around the last 50 words. And third is around the last 10 words. User can customize scope coverage, time updates. B. Enforcing topic diversity in a document recommender for conversations. Authors: M. Habibi and A. Popescu-Belis In our system, we also focus on building of concise, diverse and relevant lists of documents which provide information to the users according to their need. From this paper, we study the concept of merging lists of documents which are obtained through multiple implicit queries, formed for small conversations fragments. The aim of the method used in this paper is to get a unique and precise list of documents that can satisfy user’s need in real time. The obtained list should cover the maximum number of implicit queries and topics. The proposed method rewards: a. Topic similarity – It is concerned with selection of the most relevant documents to the conversation. b. Topic diversity – It is concerned with coverage of the maximum implicit queries and topics in a precise and relevant list of recommendations when more than one topic is discussed in the conversation. ACLD system is used as framework for this document recommender system. The ACLD system monitors the conversation, and generates queries based on the keywords detected by a real-time ASR system. The queries are then fired periodically for retrieving documents. These documents are then recommended to the users according to their needs. Here a diverse merging of lists is done. It is the process of generating a short, diverse and relevant list of recommended documents. The generated list should cover the maximum number of topics of each conversation fragment. The method proceeds in two steps:
  • 3. International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016 ISSN: 2395-1303 http://www.ijetjournal.org Page 78 a. Document and Query Representation: In this part, the representation of the queries and the corresponding list of documents from the Apache Lucene search engine are done. This representation is done using topic modelling techniques. b. Ranking Documents: In this step, ranking of documents is done by using topical similarity and then, rewarding the coverage of different lists is taken care. C. A Speech-based Just-in-Time Retrieval System using Semantic Search Authors: A. Popescu-Belis, M. Yazdani, A. Nanchen, and P. N. Garner A just-in time retrieval system for extraction of useful information is required in our project. We have used ACLD i.e., Automatic Content Linking Device (ACLD) for our project. ACLD is a just-in-time document recommendation system for meetings. From this paper, we study the concept of content linking for our recommendation system. This is done by storing documents with tags. a. What is ACLD? ACLD is a recommendation system which analyses spoken input from one or more speakers and retrieves relevant documents for user as per their need. b. Content Linking: Scenarios of Use One of the main usage scenarios for the ACLD is meeting rooms. The ACLD searches the relevant documents for members in the meeting without interrupting their discussion flow. In other usage scenarios, ACLD can be used for live or recorded lectures. The ACLD provides related and relevant documents which are obtained from various repositories, required for the lectures. c. Description of the ACLD ACLD system performs following functions: i.Document Preparation and Indexing: This function is concerned with the extraction of text, and then the indexing of the documents. Apache Lucene software is used for this purpose. ii. Sensing the User’s Information Needs The ACLD system makes use of the AMI real-time ASR system. One of its main features is a pre-compiled grammar. This feature helps in maintaining accuracy even in real time on a low resource machine. iii. Querying the Document Database The Query Aggregator retrieves the most relevant documents from one or more databases using ASR words. The current version of the ACLD makes use of semantic search, while word-based search is used for previous version. iv. Semantic Search over Wikipedia The main focus of our method of semantic search is to improve the relevancy of the retrieved documents as well as the robustness of system to remove noise from the ASR. Here, for document retrieval, the graph-based model of semantic relatedness is used. v. The User Interface (UI) The UI mainly makes all information produced by the system available to user in a configurable way. The users are allowed to use a larger or smaller amount of information according to their needs in this.
  • 4. International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016 ISSN: 2395-1303 http://www.ijetjournal.org Page 79 ACLD is the first just in time retrieval system that uses involuntary speech. Access to multimedia documents and web pages using a robust semantic search method is also supported. D. User Interactions with Everyday Applications as Context for Just-in-time Information Access Authors: J. Budzik and K. J. Hammond Information retrieval is one of the important functions of our project. The retrieved information is then used for obtaining relevant and related documents. As per observations, rich contextual information obtained from our interactions with everyday applications, such as, word processors, web browsers etc. can be used to support just in time retrieval of relevant documents. This paper provides us the concept of just in time information retrieval by using user interactions with everyday applications as a context. For obtaining more clear view about just in time information retrieval, we are using Watson system as an example. Previous efforts in building context for information access can be classified into four classes: a. Relevance feedback in information retrieval: In relevance feedback supporting system, we start with a standard query and then judge the result. Usually a result is judged as relevant or not relevant. Then, this result of user’s judgment can used to customize the original query by casting positive or negative search terms. b. Systems that use user profiles: The aggregation of user profiles can be done by gathering terms based on rating of documents. c. Word-sense disambiguation: Explicit word-sense disambiguation on the user’s part, or approved information intrinsic in the hypertext documents structure can be used for reducing ambiguity in some systems. d. Knowledge engineering approaches: The behaviour of user in a particular application is modelled and then explicit queries are generated in a particular state of tasks in such approaches. III. PROPOSED SYSTEM We propose a system where, output is taken from Automatic Speech Recognition (ASR) system. This output is processed using topic modelling technique instead of directly selecting words on re-occurrence basis. Various functions are used in the system to process. In first step, noisy words like articles (a, an, the), unwanted words, common unnecessary symbols, etc. are removed from the conversation fragment. Human critics ranks various conversational corpora like Fisher, AMI and ELEA which provides audio pieces to test our proposed methods on the basis of relevance with respect to audio pieces from the conversational corpora. Now, a set (mathematics term) is formed from words that are obtained after noisy word removal process. Clustering is carried on the words present in the set by using clustering algorithm. In our system we are using k-means as it is simpler to use and implement and which in turn, reduces the processing time of the system and do not put much load on the system. According topical similarity and co- occurrence of words, the clusters are formed. Then, from each cluster, one query will be formulated implicitly. This will generate more precise query. Documents will be retrieved for each topically different query. One of the primary tasks is to present the list of documents in user-friendly way. Two column
  • 5. International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016 ISSN: 2395-1303 http://www.ijetjournal.org Page 80 GUI is used for this for showing document to user. They are: 1. Query relevant document. 2. Document’s similarity percentage to the keyword (They are based on co-occurrence of words). At various stages of processing, different technique are used to remove errors. Ranking of document helps user in accessing them before displaying. IV. MAIN ALGORITHM 1) Latent Dirichlet Allocation LDA, used for topic modelling, assumes the following generative process for each document w in a large and structured set of texts D: 1. Choose N ∼ Poisson(ξ) 2. Choose θ ∼ Dir (α). 3. For each of the N words wn: a. Choose a topic zn ∼ Multinomial (θ). b. Choose a word wn from p (wn |zn, β), a multinomial probability conditioned on the topic zn. A k-dimensional Dirichlet random variable θ can take values in the (k −1)-simplex (a k- vector θ lies in the (k−1)-simplex if θi ≥ 0, Σk i=1 θi = 1), and has the following probability density on this simplex: 2) Diverse Keyword Extraction Algorithmic form is as follows: Input: A given text t, a set of topics Z, the number of keywords k; Output: a set of keywords S S ← ∅; while |S| ≤ k do S ← S ∪ {argmaxw∈t S (h (w)) where h (w) = Pz∈Z p(z|t)[ r{w},z + rS,z] λ }; end return S; Where, rS,z is the topical similarity with respect to topic z of the keyword set S selected from the fragment t. V. SYSTEM ARCHITECTURE Figure 4 shows ASR system records user’s conversation. The output of ASR is send further to noise removal function. Then diverse keywords are extracted using keyword extraction method. After that these diverse keywords are clustered. Further these keywords are used for obtaining result of implicit queries. Then relevant documents are recommended to user. Fig: System Architecture VI. SYSTEM SCREENSHOTS
  • 6. International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016 ISSN: 2395-1303 http://www.ijetjournal.org Page 81 Fig. 1: Administrator UI Fig. 2: Uploading documents to local repository Fig. 3: Setting filter words Fig. 4: Android Application (ASR Engine) Fig. 5.1: Relevant Document Retrieved in Client UI
  • 7. International Journal of Engineering and Techniques - Volume 2 Issue 3, May – June 2016 ISSN: 2395-1303 http://www.ijetjournal.org Page 82 Fig. 5.2: Relevant Document Retrieved in Client UI VII. FUTURE SCOPE With more processing power, the system could be made real-time recommender. Also, the system can be linked with online encyclopedias like Wikipedia, etc. for more detailed recommendations. The common APIs used in ASR system can be added with region-specific APIs to improve its efficiency. VIII. CONCLUSION In our system architecture, there are four steps of processing after obtaining the ASR output. The speed and precision are the main criterions to judge a just-in-time recommender. Topic diversity is maintained by extracting the keywords using a diverse keyword search algorithm. There are many practical applications like meeting rooms, conferences, workshops, etc. because of implicit query formation process. IX. ACKNOWLEDGEMENT We would like to thank our guide Prof. B. S. Satpute for helping us in understanding complex technical issues regarding project and for helping in all the phases of system development. We are also grateful to Dr. R. K. Jain and Dr. Pramod Patil for their wholehearted support, guidance and for providing us with required amenities and infrastructure. We would like to thank to all the authors, whose paper we have referred, for their research in this field. It helped us in understanding the algorithms and various techniques in detail. REFERENCES 1. B. Rhodes and T. Starner, “Remembrance Agent: A continuously running automated information retrieval system,” in Proc. 1st Int. Conf. Pract. Applicat. Intell. Agents Multi Agent Technol., London, U.K., 1996, pp. 487–495. 2. M. Habibi and A. Popescu-Belis, “Enforcing topic diversity in a document recommender for conversations,” in Proc. 25th Int. Conf. Comput. Linguist. (Coling), 2014, pp. 588–599. 3. A. Popescsu-Belis, M. Yazdani, A. Nanchen, and P. N. Garner, “A speech-based just-in-time retrieval system using semantic search,” in Proc. Annu. Conf. North Amer. Chap. ACL (HLT-NAACL), 2011, pp.80–85. 4. J Budzik and K. J. Hammond, “User interactions with everyday applications as context for just-in-time information access,” in Proc. 5th Int. Conf. Intell. User Interfaces (IUI’00), 2000, pp. 44–51. 5. Maryam Habibi and Andrei Popescu-Belis, “Keyword Extraction and Clustering for Document Recommendation in Conversations”, in IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol. 23, NO. 4, April 2010