SlideShare a Scribd company logo
ECWAY TECHNOLOGIES
IEEE PROJECTS & SOFTWARE DEVELOPMENTS
OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE
CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111
VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com

AML EFFICIENT APPROXIMATE MEMBERSHIP LOCALIZATION WITHIN
A WEB-BASED JOIN FRAMEWORK

ABSTRACT:

In this paper, we propose a new type of Dictionary-based Entity Recognition Problem, named
Approximate Membership Localization (AML). The popular Approximate Membership
Extraction (AME) provides a full coverage to the true matched substrings from a given
document, but many redundancies cause a low efficiency of the AME process and deteriorate the
performance of real-world applications using the extracted substrings.
The AML problem targets at locating non overlapped substrings which is a better approximation
to the true matched substrings without generating overlapped redundancies. In order to perform
AML efficiently, we propose the optimized algorithm P-Prune that prunes a large part of
overlapped redundant matched substrings before generating them.
Our study using several real-word data sets demonstrates the efficiency of P-Prune over a
baseline method. We also study the AML in application to a proposed web-based join framework
scenario which is a search-based approach joining two tables using dictionary-based entity
recognition from web documents. The results not only prove the advantage of AML over AME,
but also demonstrate the effectiveness of our search-based approach.

More Related Content

Similar to Aml efficient approximate membership localization within a web based join framework

Aml efficient approximate membership localization within a web based join fra...
Aml efficient approximate membership localization within a web based join fra...Aml efficient approximate membership localization within a web based join fra...
Aml efficient approximate membership localization within a web based join fra...
ecway
 
Java aml efficient approximate membership localization within a web-based jo...
Java  aml efficient approximate membership localization within a web-based jo...Java  aml efficient approximate membership localization within a web-based jo...
Java aml efficient approximate membership localization within a web-based jo...
ecwayerode
 
Android aml efficient approximate membership localization within a web-based...
Android  aml efficient approximate membership localization within a web-based...Android  aml efficient approximate membership localization within a web-based...
Android aml efficient approximate membership localization within a web-based...
ecway
 
Model based analysis of wireless system architectures for real-time applications
Model based analysis of wireless system architectures for real-time applicationsModel based analysis of wireless system architectures for real-time applications
Model based analysis of wireless system architectures for real-time applications
ecwayprojects
 
Dotnet model-based analysis of wireless system architectures for real-time a...
Dotnet  model-based analysis of wireless system architectures for real-time a...Dotnet  model-based analysis of wireless system architectures for real-time a...
Dotnet model-based analysis of wireless system architectures for real-time a...
Ecwaytech
 
IEEE 2014 JAVA PARALLEL DISTRIBUTED PROJECTS Tracon interference aware schedu...
IEEE 2014 JAVA PARALLEL DISTRIBUTED PROJECTS Tracon interference aware schedu...IEEE 2014 JAVA PARALLEL DISTRIBUTED PROJECTS Tracon interference aware schedu...
IEEE 2014 JAVA PARALLEL DISTRIBUTED PROJECTS Tracon interference aware schedu...
IEEEMEMTECHSTUDENTPROJECTS
 
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT Tracon interference aware schedul...
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT Tracon interference aware schedul...2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT Tracon interference aware schedul...
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT Tracon interference aware schedul...
IEEEGLOBALSOFTSTUDENTSPROJECTS
 
Data Mesh @ Yelp - 2019
Data Mesh @ Yelp - 2019Data Mesh @ Yelp - 2019
Data Mesh @ Yelp - 2019
Steven Moy
 
50120140505006
5012014050500650120140505006
50120140505006
IAEME Publication
 
Enterprise applications in the cloud - are providers ready?
Enterprise applications in the cloud - are providers ready?Enterprise applications in the cloud - are providers ready?
Enterprise applications in the cloud - are providers ready?
Leonid Grinshpan, Ph.D.
 
Rhys Jones, RBS, "Why The Cloud Is The Next Excel"
Rhys Jones, RBS,  "Why The Cloud Is The Next Excel"Rhys Jones, RBS,  "Why The Cloud Is The Next Excel"
Rhys Jones, RBS, "Why The Cloud Is The Next Excel"
Chris Purrington
 
External XML Entities
External XML EntitiesExternal XML Entities
External XML Entities
William McKelphin
 
netsuite-integration-whitepaper
netsuite-integration-whitepapernetsuite-integration-whitepaper
netsuite-integration-whitepaper
Olivier Gagnon
 
Bridging the GAP: Selenium and RPA for Unparalleled Automation
Bridging the GAP: Selenium and RPA for Unparalleled AutomationBridging the GAP: Selenium and RPA for Unparalleled Automation
Bridging the GAP: Selenium and RPA for Unparalleled Automation
IJCI JOURNAL
 
Large graph analysis in the g mine system
Large graph analysis in the g mine systemLarge graph analysis in the g mine system
Large graph analysis in the g mine system
ecwayprojects
 
Dotnet large graph analysis in the g mine system
Dotnet  large graph analysis in the g mine systemDotnet  large graph analysis in the g mine system
Dotnet large graph analysis in the g mine system
Ecwayt
 
Dotnet large graph analysis in the g mine system
Dotnet  large graph analysis in the g mine systemDotnet  large graph analysis in the g mine system
Dotnet large graph analysis in the g mine system
Ecwaytech
 
Large graph analysis in the g mine system
Large graph analysis in the g mine systemLarge graph analysis in the g mine system
Large graph analysis in the g mine system
ecwayprojects
 
The business drivers for technical innovations in client-side web applications
The business drivers for technical innovations in client-side web applicationsThe business drivers for technical innovations in client-side web applications
The business drivers for technical innovations in client-side web applications
Alberici Andrea
 
Java model-based analysis of wireless system architectures for real-time app...
Java  model-based analysis of wireless system architectures for real-time app...Java  model-based analysis of wireless system architectures for real-time app...
Java model-based analysis of wireless system architectures for real-time app...
ecwayerode
 

Similar to Aml efficient approximate membership localization within a web based join framework (20)

Aml efficient approximate membership localization within a web based join fra...
Aml efficient approximate membership localization within a web based join fra...Aml efficient approximate membership localization within a web based join fra...
Aml efficient approximate membership localization within a web based join fra...
 
Java aml efficient approximate membership localization within a web-based jo...
Java  aml efficient approximate membership localization within a web-based jo...Java  aml efficient approximate membership localization within a web-based jo...
Java aml efficient approximate membership localization within a web-based jo...
 
Android aml efficient approximate membership localization within a web-based...
Android  aml efficient approximate membership localization within a web-based...Android  aml efficient approximate membership localization within a web-based...
Android aml efficient approximate membership localization within a web-based...
 
Model based analysis of wireless system architectures for real-time applications
Model based analysis of wireless system architectures for real-time applicationsModel based analysis of wireless system architectures for real-time applications
Model based analysis of wireless system architectures for real-time applications
 
Dotnet model-based analysis of wireless system architectures for real-time a...
Dotnet  model-based analysis of wireless system architectures for real-time a...Dotnet  model-based analysis of wireless system architectures for real-time a...
Dotnet model-based analysis of wireless system architectures for real-time a...
 
IEEE 2014 JAVA PARALLEL DISTRIBUTED PROJECTS Tracon interference aware schedu...
IEEE 2014 JAVA PARALLEL DISTRIBUTED PROJECTS Tracon interference aware schedu...IEEE 2014 JAVA PARALLEL DISTRIBUTED PROJECTS Tracon interference aware schedu...
IEEE 2014 JAVA PARALLEL DISTRIBUTED PROJECTS Tracon interference aware schedu...
 
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT Tracon interference aware schedul...
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT Tracon interference aware schedul...2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT Tracon interference aware schedul...
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT Tracon interference aware schedul...
 
Data Mesh @ Yelp - 2019
Data Mesh @ Yelp - 2019Data Mesh @ Yelp - 2019
Data Mesh @ Yelp - 2019
 
50120140505006
5012014050500650120140505006
50120140505006
 
Enterprise applications in the cloud - are providers ready?
Enterprise applications in the cloud - are providers ready?Enterprise applications in the cloud - are providers ready?
Enterprise applications in the cloud - are providers ready?
 
Rhys Jones, RBS, "Why The Cloud Is The Next Excel"
Rhys Jones, RBS,  "Why The Cloud Is The Next Excel"Rhys Jones, RBS,  "Why The Cloud Is The Next Excel"
Rhys Jones, RBS, "Why The Cloud Is The Next Excel"
 
External XML Entities
External XML EntitiesExternal XML Entities
External XML Entities
 
netsuite-integration-whitepaper
netsuite-integration-whitepapernetsuite-integration-whitepaper
netsuite-integration-whitepaper
 
Bridging the GAP: Selenium and RPA for Unparalleled Automation
Bridging the GAP: Selenium and RPA for Unparalleled AutomationBridging the GAP: Selenium and RPA for Unparalleled Automation
Bridging the GAP: Selenium and RPA for Unparalleled Automation
 
Large graph analysis in the g mine system
Large graph analysis in the g mine systemLarge graph analysis in the g mine system
Large graph analysis in the g mine system
 
Dotnet large graph analysis in the g mine system
Dotnet  large graph analysis in the g mine systemDotnet  large graph analysis in the g mine system
Dotnet large graph analysis in the g mine system
 
Dotnet large graph analysis in the g mine system
Dotnet  large graph analysis in the g mine systemDotnet  large graph analysis in the g mine system
Dotnet large graph analysis in the g mine system
 
Large graph analysis in the g mine system
Large graph analysis in the g mine systemLarge graph analysis in the g mine system
Large graph analysis in the g mine system
 
The business drivers for technical innovations in client-side web applications
The business drivers for technical innovations in client-side web applicationsThe business drivers for technical innovations in client-side web applications
The business drivers for technical innovations in client-side web applications
 
Java model-based analysis of wireless system architectures for real-time app...
Java  model-based analysis of wireless system architectures for real-time app...Java  model-based analysis of wireless system architectures for real-time app...
Java model-based analysis of wireless system architectures for real-time app...
 

More from Ecway2004

Covering points of interest with mobile sensors
Covering points of interest with mobile sensorsCovering points of interest with mobile sensors
Covering points of interest with mobile sensors
Ecway2004
 
Coloring based inter-wban scheduling for mobile wireless body area networks
Coloring based inter-wban scheduling for mobile wireless body area networksColoring based inter-wban scheduling for mobile wireless body area networks
Coloring based inter-wban scheduling for mobile wireless body area networks
Ecway2004
 
Code modulation based encryption & decryption technique for secure communicat...
Code modulation based encryption & decryption technique for secure communicat...Code modulation based encryption & decryption technique for secure communicat...
Code modulation based encryption & decryption technique for secure communicat...
Ecway2004
 
Clustering sentence level text using a novel fuzzy relational clustering algo...
Clustering sentence level text using a novel fuzzy relational clustering algo...Clustering sentence level text using a novel fuzzy relational clustering algo...
Clustering sentence level text using a novel fuzzy relational clustering algo...
Ecway2004
 
Clustering large probabilistic graphs
Clustering large probabilistic graphsClustering large probabilistic graphs
Clustering large probabilistic graphs
Ecway2004
 
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
Cloudsim  t-drive enhancing driving directions with taxi drivers’ intelligenceCloudsim  t-drive enhancing driving directions with taxi drivers’ intelligence
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
Ecway2004
 
Cloudsim ranking on data manifold with sink points
Cloudsim  ranking on data manifold with sink pointsCloudsim  ranking on data manifold with sink points
Cloudsim ranking on data manifold with sink points
Ecway2004
 
Cloudsim quality-differentiated video multicast in multirate wireless networks
Cloudsim  quality-differentiated video multicast in multirate wireless networksCloudsim  quality-differentiated video multicast in multirate wireless networks
Cloudsim quality-differentiated video multicast in multirate wireless networks
Ecway2004
 
Cloudsim power allocation for statistical qo s provisioning in opportunistic...
Cloudsim  power allocation for statistical qo s provisioning in opportunistic...Cloudsim  power allocation for statistical qo s provisioning in opportunistic...
Cloudsim power allocation for statistical qo s provisioning in opportunistic...
Ecway2004
 
Cloudsim distributed web systems performance forecasting using turning bands...
Cloudsim  distributed web systems performance forecasting using turning bands...Cloudsim  distributed web systems performance forecasting using turning bands...
Cloudsim distributed web systems performance forecasting using turning bands...
Ecway2004
 
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
Cloudsim  distributed processing of probabilistic top-k queries in wireless s...Cloudsim  distributed processing of probabilistic top-k queries in wireless s...
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
Ecway2004
 
Civil 2013 titles
Civil 2013 titlesCivil 2013 titles
Civil 2013 titles
Ecway2004
 
Chopper based dc motor speed control
Chopper based dc motor speed controlChopper based dc motor speed control
Chopper based dc motor speed control
Ecway2004
 
Channel assignment for throughput optimization in multichannel multiradio wir...
Channel assignment for throughput optimization in multichannel multiradio wir...Channel assignment for throughput optimization in multichannel multiradio wir...
Channel assignment for throughput optimization in multichannel multiradio wir...
Ecway2004
 
Channel allocation and routing in hybrid multichannel multiradio wireless mes...
Channel allocation and routing in hybrid multichannel multiradio wireless mes...Channel allocation and routing in hybrid multichannel multiradio wireless mes...
Channel allocation and routing in hybrid multichannel multiradio wireless mes...
Ecway2004
 
Casual stereoscopic photo authoring
Casual stereoscopic photo authoringCasual stereoscopic photo authoring
Casual stereoscopic photo authoring
Ecway2004
 
Casual stereoscopic photo authoring
Casual stereoscopic photo authoringCasual stereoscopic photo authoring
Casual stereoscopic photo authoring
Ecway2004
 
Capacity of hybrid wireless mesh networks with random a ps
Capacity of hybrid wireless mesh networks with random a psCapacity of hybrid wireless mesh networks with random a ps
Capacity of hybrid wireless mesh networks with random a ps
Ecway2004
 
Bomb detection robot with wireless camera
Bomb detection robot with wireless cameraBomb detection robot with wireless camera
Bomb detection robot with wireless camera
Ecway2004
 
Bed side patients monitoring system with emergency alert
Bed side patients monitoring system with  emergency alertBed side patients monitoring system with  emergency alert
Bed side patients monitoring system with emergency alert
Ecway2004
 

More from Ecway2004 (20)

Covering points of interest with mobile sensors
Covering points of interest with mobile sensorsCovering points of interest with mobile sensors
Covering points of interest with mobile sensors
 
Coloring based inter-wban scheduling for mobile wireless body area networks
Coloring based inter-wban scheduling for mobile wireless body area networksColoring based inter-wban scheduling for mobile wireless body area networks
Coloring based inter-wban scheduling for mobile wireless body area networks
 
Code modulation based encryption & decryption technique for secure communicat...
Code modulation based encryption & decryption technique for secure communicat...Code modulation based encryption & decryption technique for secure communicat...
Code modulation based encryption & decryption technique for secure communicat...
 
Clustering sentence level text using a novel fuzzy relational clustering algo...
Clustering sentence level text using a novel fuzzy relational clustering algo...Clustering sentence level text using a novel fuzzy relational clustering algo...
Clustering sentence level text using a novel fuzzy relational clustering algo...
 
Clustering large probabilistic graphs
Clustering large probabilistic graphsClustering large probabilistic graphs
Clustering large probabilistic graphs
 
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
Cloudsim  t-drive enhancing driving directions with taxi drivers’ intelligenceCloudsim  t-drive enhancing driving directions with taxi drivers’ intelligence
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
 
Cloudsim ranking on data manifold with sink points
Cloudsim  ranking on data manifold with sink pointsCloudsim  ranking on data manifold with sink points
Cloudsim ranking on data manifold with sink points
 
Cloudsim quality-differentiated video multicast in multirate wireless networks
Cloudsim  quality-differentiated video multicast in multirate wireless networksCloudsim  quality-differentiated video multicast in multirate wireless networks
Cloudsim quality-differentiated video multicast in multirate wireless networks
 
Cloudsim power allocation for statistical qo s provisioning in opportunistic...
Cloudsim  power allocation for statistical qo s provisioning in opportunistic...Cloudsim  power allocation for statistical qo s provisioning in opportunistic...
Cloudsim power allocation for statistical qo s provisioning in opportunistic...
 
Cloudsim distributed web systems performance forecasting using turning bands...
Cloudsim  distributed web systems performance forecasting using turning bands...Cloudsim  distributed web systems performance forecasting using turning bands...
Cloudsim distributed web systems performance forecasting using turning bands...
 
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
Cloudsim  distributed processing of probabilistic top-k queries in wireless s...Cloudsim  distributed processing of probabilistic top-k queries in wireless s...
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
 
Civil 2013 titles
Civil 2013 titlesCivil 2013 titles
Civil 2013 titles
 
Chopper based dc motor speed control
Chopper based dc motor speed controlChopper based dc motor speed control
Chopper based dc motor speed control
 
Channel assignment for throughput optimization in multichannel multiradio wir...
Channel assignment for throughput optimization in multichannel multiradio wir...Channel assignment for throughput optimization in multichannel multiradio wir...
Channel assignment for throughput optimization in multichannel multiradio wir...
 
Channel allocation and routing in hybrid multichannel multiradio wireless mes...
Channel allocation and routing in hybrid multichannel multiradio wireless mes...Channel allocation and routing in hybrid multichannel multiradio wireless mes...
Channel allocation and routing in hybrid multichannel multiradio wireless mes...
 
Casual stereoscopic photo authoring
Casual stereoscopic photo authoringCasual stereoscopic photo authoring
Casual stereoscopic photo authoring
 
Casual stereoscopic photo authoring
Casual stereoscopic photo authoringCasual stereoscopic photo authoring
Casual stereoscopic photo authoring
 
Capacity of hybrid wireless mesh networks with random a ps
Capacity of hybrid wireless mesh networks with random a psCapacity of hybrid wireless mesh networks with random a ps
Capacity of hybrid wireless mesh networks with random a ps
 
Bomb detection robot with wireless camera
Bomb detection robot with wireless cameraBomb detection robot with wireless camera
Bomb detection robot with wireless camera
 
Bed side patients monitoring system with emergency alert
Bed side patients monitoring system with  emergency alertBed side patients monitoring system with  emergency alert
Bed side patients monitoring system with emergency alert
 

Aml efficient approximate membership localization within a web based join framework

  • 1. ECWAY TECHNOLOGIES IEEE PROJECTS & SOFTWARE DEVELOPMENTS OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111 VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com AML EFFICIENT APPROXIMATE MEMBERSHIP LOCALIZATION WITHIN A WEB-BASED JOIN FRAMEWORK ABSTRACT: In this paper, we propose a new type of Dictionary-based Entity Recognition Problem, named Approximate Membership Localization (AML). The popular Approximate Membership Extraction (AME) provides a full coverage to the true matched substrings from a given document, but many redundancies cause a low efficiency of the AME process and deteriorate the performance of real-world applications using the extracted substrings. The AML problem targets at locating non overlapped substrings which is a better approximation to the true matched substrings without generating overlapped redundancies. In order to perform AML efficiently, we propose the optimized algorithm P-Prune that prunes a large part of overlapped redundant matched substrings before generating them. Our study using several real-word data sets demonstrates the efficiency of P-Prune over a baseline method. We also study the AML in application to a proposed web-based join framework scenario which is a search-based approach joining two tables using dictionary-based entity recognition from web documents. The results not only prove the advantage of AML over AME, but also demonstrate the effectiveness of our search-based approach.