SlideShare a Scribd company logo
INTERNAL WEB-CRAWLER
Case study of Spider project
OVERVIEW
• An internal web crawler for acquisition team which spiders the web to collect
specific information based on the keyword and URL seeds
• The collected information is then stored to database as tickets with
screenshots of the identified target pages which is later reviewed and closed
by the acquisition team
• It consists of admin tool interfaces to manage the seeds and target data with
ticket management too
CHALLENGES
• Outdated and closed ads, offers, notices, invitations, referrals etc information
control over the web had become imperative and humongous task
• Legal issues and notices were huge due to incomparable task of covering entire
web to ensure compliance of agreement and terms with affiliates / 3rd party
service providers, carriers & integrators etc.
• Slippage and delays in manual activity was resulting in hefty fines and penalties.
Sometimes even to the extent of legal suites
• Though manually identified and resolved, at times suspended or dead websites
coming alive was making the manual task more inconsistent and erroneous
resulting in repetitive / redundant job for dedicated teams while adding cost
overheads
• Issue identification and resolve before complaints raised couldn’t be achieved
for long time
SOLUTION
• Implemented web crawler mechanism to crawl the web using most renowned
search engines to automatically search keywords, campaigns, affiliate ads and
outdated information configured via specifically developed admin / management
tool
• Search result parsing and depth traversing
• Multi threaded engine with parallel jobs running with high accuracy (avoiding
redundant results and tickets)
• Proofing, analyzing and validation of content in site / page
• Filtered search result storage with stitched hierarchy of resulting pages (depth)
• Capture identified problematic web page as full length image /screenshots
(including the scrollable/hidden content)
• Ticket creation with found result and email alerts / notifications
• Post ticket closure, recheck for specified duration for reappearance based on
set status and configurations
RESULTS
• After this implementation and tool’s usage, complaints from respective
parties reduced drastically (about 90%)
• Any such violations (if any) were notified to business teams on an immediate
basis for further actions (even before someone could raise objections)
• Business had valid and authentic campaigns/Ads/Referrals/Offers etc
information over the web – this avoided end user confusions and also helped
to up-hold all 3rd parties interest by being within agreed terms and
compliance
• Helped customer to be in good terms with all associated external parties.
• Reduced cost overhead of maintaining big teams to perform these activities
manually
• Increased efficiency of the internal arbitration teams with valid records and
proofs
• TECHNOLOGY
• Python QT4 (headless browser tools), JAVA, PHP, MySQL, JQuery, Plugins,
JSON etc

More Related Content

Similar to AMS Spider

AMS Audit tool
AMS Audit toolAMS Audit tool
AMS Audit tool
Atlas Systems
 
How a turnpike commission manages client requests case study by WorkflowGen
How a turnpike commission manages client requests case study by WorkflowGenHow a turnpike commission manages client requests case study by WorkflowGen
How a turnpike commission manages client requests case study by WorkflowGen
Alain Bezançon
 
E crm final
E crm finalE crm final
E crm finalnamste
 
Facility management: A process driven approach
Facility management: A process driven approachFacility management: A process driven approach
Facility management: A process driven approach
Stefan Hermes
 
Five performance factors you need to know about in 2018
Five performance factors you need to know about in 2018Five performance factors you need to know about in 2018
Five performance factors you need to know about in 2018
Fredric Lundgren
 
QuickFMS Introduction
QuickFMS IntroductionQuickFMS Introduction
QuickFMS Introduction
Amit Prasad
 
Traumasoft Presentation
Traumasoft PresentationTraumasoft Presentation
Traumasoft Presentation
laschul
 
What Your Marketing Automation Vendor Won't Ever Tell You
What Your Marketing Automation Vendor Won't Ever Tell YouWhat Your Marketing Automation Vendor Won't Ever Tell You
What Your Marketing Automation Vendor Won't Ever Tell You
Esanosys
 
How a large corporation streamlines legal authorizations case study by Workfl...
How a large corporation streamlines legal authorizations case study by Workfl...How a large corporation streamlines legal authorizations case study by Workfl...
How a large corporation streamlines legal authorizations case study by Workfl...
Alain Bezançon
 
DigiServe, a Service Management Application - Training Document
DigiServe, a Service Management Application - Training DocumentDigiServe, a Service Management Application - Training Document
DigiServe, a Service Management Application - Training Document
Digivertical Technologies Pvt. Ltd
 
SACME power point presentation
SACME power point presentationSACME power point presentation
SACME power point presentation
Joshua Wagner
 
Resume (2) Modify
Resume (2) ModifyResume (2) Modify
Resume (2) Modifyvijay kumar
 
Why Workflow Automation and Why Now?
Why Workflow Automation and Why Now?Why Workflow Automation and Why Now?
Why Workflow Automation and Why Now?
Integrify
 
SuiteStation - Netsuite Development,Customization and Consulting Company Profile
SuiteStation - Netsuite Development,Customization and Consulting Company ProfileSuiteStation - Netsuite Development,Customization and Consulting Company Profile
SuiteStation - Netsuite Development,Customization and Consulting Company Profile
Sunny K
 
Sample Contract Management
Sample Contract ManagementSample Contract Management
Sample Contract Management
Rapidsoft Systems Inc
 
Integrated Order to Cash (O2C) Automation Software for Global Shared Services...
Integrated Order to Cash (O2C) Automation Software for Global Shared Services...Integrated Order to Cash (O2C) Automation Software for Global Shared Services...
Integrated Order to Cash (O2C) Automation Software for Global Shared Services...
Emagia
 
Solving 21st Century App Performance Problems Without 21 People
Solving 21st Century App Performance Problems Without 21 PeopleSolving 21st Century App Performance Problems Without 21 People
Solving 21st Century App Performance Problems Without 21 People
Dynatrace
 
North American Insurance Company Innovates with Streamlining Content and Proc...
North American Insurance Company Innovates with Streamlining Content and Proc...North American Insurance Company Innovates with Streamlining Content and Proc...
North American Insurance Company Innovates with Streamlining Content and Proc...
Pyramid Solutions, Inc.
 
How to Master your Marketing Data - Cody Crumrine, Data Aptitude
How to Master your Marketing Data - Cody Crumrine, Data AptitudeHow to Master your Marketing Data - Cody Crumrine, Data Aptitude
How to Master your Marketing Data - Cody Crumrine, Data Aptitude
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 

Similar to AMS Spider (20)

AMS Audit tool
AMS Audit toolAMS Audit tool
AMS Audit tool
 
How a turnpike commission manages client requests case study by WorkflowGen
How a turnpike commission manages client requests case study by WorkflowGenHow a turnpike commission manages client requests case study by WorkflowGen
How a turnpike commission manages client requests case study by WorkflowGen
 
E crm final
E crm finalE crm final
E crm final
 
Facility management: A process driven approach
Facility management: A process driven approachFacility management: A process driven approach
Facility management: A process driven approach
 
Five performance factors you need to know about in 2018
Five performance factors you need to know about in 2018Five performance factors you need to know about in 2018
Five performance factors you need to know about in 2018
 
QuickFMS Introduction
QuickFMS IntroductionQuickFMS Introduction
QuickFMS Introduction
 
Traumasoft Presentation
Traumasoft PresentationTraumasoft Presentation
Traumasoft Presentation
 
Resume_(2)vijay
Resume_(2)vijayResume_(2)vijay
Resume_(2)vijay
 
What Your Marketing Automation Vendor Won't Ever Tell You
What Your Marketing Automation Vendor Won't Ever Tell YouWhat Your Marketing Automation Vendor Won't Ever Tell You
What Your Marketing Automation Vendor Won't Ever Tell You
 
How a large corporation streamlines legal authorizations case study by Workfl...
How a large corporation streamlines legal authorizations case study by Workfl...How a large corporation streamlines legal authorizations case study by Workfl...
How a large corporation streamlines legal authorizations case study by Workfl...
 
DigiServe, a Service Management Application - Training Document
DigiServe, a Service Management Application - Training DocumentDigiServe, a Service Management Application - Training Document
DigiServe, a Service Management Application - Training Document
 
SACME power point presentation
SACME power point presentationSACME power point presentation
SACME power point presentation
 
Resume (2) Modify
Resume (2) ModifyResume (2) Modify
Resume (2) Modify
 
Why Workflow Automation and Why Now?
Why Workflow Automation and Why Now?Why Workflow Automation and Why Now?
Why Workflow Automation and Why Now?
 
SuiteStation - Netsuite Development,Customization and Consulting Company Profile
SuiteStation - Netsuite Development,Customization and Consulting Company ProfileSuiteStation - Netsuite Development,Customization and Consulting Company Profile
SuiteStation - Netsuite Development,Customization and Consulting Company Profile
 
Sample Contract Management
Sample Contract ManagementSample Contract Management
Sample Contract Management
 
Integrated Order to Cash (O2C) Automation Software for Global Shared Services...
Integrated Order to Cash (O2C) Automation Software for Global Shared Services...Integrated Order to Cash (O2C) Automation Software for Global Shared Services...
Integrated Order to Cash (O2C) Automation Software for Global Shared Services...
 
Solving 21st Century App Performance Problems Without 21 People
Solving 21st Century App Performance Problems Without 21 PeopleSolving 21st Century App Performance Problems Without 21 People
Solving 21st Century App Performance Problems Without 21 People
 
North American Insurance Company Innovates with Streamlining Content and Proc...
North American Insurance Company Innovates with Streamlining Content and Proc...North American Insurance Company Innovates with Streamlining Content and Proc...
North American Insurance Company Innovates with Streamlining Content and Proc...
 
How to Master your Marketing Data - Cody Crumrine, Data Aptitude
How to Master your Marketing Data - Cody Crumrine, Data AptitudeHow to Master your Marketing Data - Cody Crumrine, Data Aptitude
How to Master your Marketing Data - Cody Crumrine, Data Aptitude
 

More from Atlas Systems

DB Tenzing
DB TenzingDB Tenzing
DB Tenzing
Atlas Systems
 
DBA for leading financial company in NJ
DBA for leading financial company in NJDBA for leading financial company in NJ
DBA for leading financial company in NJ
Atlas Systems
 
DBA Discovery
DBA DiscoveryDBA Discovery
DBA Discovery
Atlas Systems
 
SAP automobile major
SAP automobile majorSAP automobile major
SAP automobile major
Atlas Systems
 
ISV Net iq
ISV Net iqISV Net iq
ISV Net iq
Atlas Systems
 
DBA Thomson reuters db
DBA Thomson reuters dbDBA Thomson reuters db
DBA Thomson reuters db
Atlas Systems
 
Cloud Openshift paas
Cloud Openshift paasCloud Openshift paas
Cloud Openshift paas
Atlas Systems
 
Cloud Forge Rock
Cloud Forge RockCloud Forge Rock
Cloud Forge Rock
Atlas Systems
 
BD Thomson Reuters
BD Thomson ReutersBD Thomson Reuters
BD Thomson Reuters
Atlas Systems
 
BD JD power_1
BD JD power_1BD JD power_1
BD JD power_1
Atlas Systems
 
AMS Trilogy
AMS TrilogyAMS Trilogy
AMS Trilogy
Atlas Systems
 
AMS Skill derby
AMS Skill derbyAMS Skill derby
AMS Skill derby
Atlas Systems
 
AMS Reged
AMS RegedAMS Reged
AMS Reged
Atlas Systems
 
AMS Net iq
AMS Net iqAMS Net iq
AMS Net iq
Atlas Systems
 
AMS Narus
AMS NarusAMS Narus
AMS Narus
Atlas Systems
 
AMS Miggly
AMS MigglyAMS Miggly
AMS Miggly
Atlas Systems
 
AMS Love sms
AMS Love smsAMS Love sms
AMS Love sms
Atlas Systems
 
AMS IT client
AMS IT clientAMS IT client
AMS IT client
Atlas Systems
 
AMS Invision
AMS InvisionAMS Invision
AMS Invision
Atlas Systems
 
AMS Guardian 360
AMS Guardian 360AMS Guardian 360
AMS Guardian 360
Atlas Systems
 

More from Atlas Systems (20)

DB Tenzing
DB TenzingDB Tenzing
DB Tenzing
 
DBA for leading financial company in NJ
DBA for leading financial company in NJDBA for leading financial company in NJ
DBA for leading financial company in NJ
 
DBA Discovery
DBA DiscoveryDBA Discovery
DBA Discovery
 
SAP automobile major
SAP automobile majorSAP automobile major
SAP automobile major
 
ISV Net iq
ISV Net iqISV Net iq
ISV Net iq
 
DBA Thomson reuters db
DBA Thomson reuters dbDBA Thomson reuters db
DBA Thomson reuters db
 
Cloud Openshift paas
Cloud Openshift paasCloud Openshift paas
Cloud Openshift paas
 
Cloud Forge Rock
Cloud Forge RockCloud Forge Rock
Cloud Forge Rock
 
BD Thomson Reuters
BD Thomson ReutersBD Thomson Reuters
BD Thomson Reuters
 
BD JD power_1
BD JD power_1BD JD power_1
BD JD power_1
 
AMS Trilogy
AMS TrilogyAMS Trilogy
AMS Trilogy
 
AMS Skill derby
AMS Skill derbyAMS Skill derby
AMS Skill derby
 
AMS Reged
AMS RegedAMS Reged
AMS Reged
 
AMS Net iq
AMS Net iqAMS Net iq
AMS Net iq
 
AMS Narus
AMS NarusAMS Narus
AMS Narus
 
AMS Miggly
AMS MigglyAMS Miggly
AMS Miggly
 
AMS Love sms
AMS Love smsAMS Love sms
AMS Love sms
 
AMS IT client
AMS IT clientAMS IT client
AMS IT client
 
AMS Invision
AMS InvisionAMS Invision
AMS Invision
 
AMS Guardian 360
AMS Guardian 360AMS Guardian 360
AMS Guardian 360
 

Recently uploaded

Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 

Recently uploaded (20)

Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 

AMS Spider

  • 2. OVERVIEW • An internal web crawler for acquisition team which spiders the web to collect specific information based on the keyword and URL seeds • The collected information is then stored to database as tickets with screenshots of the identified target pages which is later reviewed and closed by the acquisition team • It consists of admin tool interfaces to manage the seeds and target data with ticket management too
  • 3. CHALLENGES • Outdated and closed ads, offers, notices, invitations, referrals etc information control over the web had become imperative and humongous task • Legal issues and notices were huge due to incomparable task of covering entire web to ensure compliance of agreement and terms with affiliates / 3rd party service providers, carriers & integrators etc. • Slippage and delays in manual activity was resulting in hefty fines and penalties. Sometimes even to the extent of legal suites • Though manually identified and resolved, at times suspended or dead websites coming alive was making the manual task more inconsistent and erroneous resulting in repetitive / redundant job for dedicated teams while adding cost overheads • Issue identification and resolve before complaints raised couldn’t be achieved for long time
  • 4. SOLUTION • Implemented web crawler mechanism to crawl the web using most renowned search engines to automatically search keywords, campaigns, affiliate ads and outdated information configured via specifically developed admin / management tool • Search result parsing and depth traversing • Multi threaded engine with parallel jobs running with high accuracy (avoiding redundant results and tickets) • Proofing, analyzing and validation of content in site / page • Filtered search result storage with stitched hierarchy of resulting pages (depth) • Capture identified problematic web page as full length image /screenshots (including the scrollable/hidden content) • Ticket creation with found result and email alerts / notifications • Post ticket closure, recheck for specified duration for reappearance based on set status and configurations
  • 5. RESULTS • After this implementation and tool’s usage, complaints from respective parties reduced drastically (about 90%) • Any such violations (if any) were notified to business teams on an immediate basis for further actions (even before someone could raise objections) • Business had valid and authentic campaigns/Ads/Referrals/Offers etc information over the web – this avoided end user confusions and also helped to up-hold all 3rd parties interest by being within agreed terms and compliance • Helped customer to be in good terms with all associated external parties. • Reduced cost overhead of maintaining big teams to perform these activities manually • Increased efficiency of the internal arbitration teams with valid records and proofs • TECHNOLOGY • Python QT4 (headless browser tools), JAVA, PHP, MySQL, JQuery, Plugins, JSON etc