Association of deep learning algorithm with fuzzy logic for multi-document text summarization

•Download as PPTX, PDF•

0 likes•66 views

This presentation contains an introduction to text summarization, ways to summary a text, improve summary with fuzzy classifier and deep learning model (RBM) and some results.

Software

LEARNING ALGORITHM WITH
FUZZY LOGIC FOR
MULTIDOCUMENT TEXT
SUMMARIZATION
Abd Almughith Alzabibi
Ahmad Ataya
Baraa Salhany
Mohammad Salem Kabbani

INTRODUCTION
With the rapid growth in the quantity and
complexity of documents sources on the
internet, it has become increasingly
important to provide improved mechanism to
user to find exact information from available
documents.

AUTOMATIC TEXT
SUMMARIZATION DEFINITION
Automatic text summarization is the summary of
the source version of the original text while
keeping its main content and helps the user to
quickly understand large volumes of information.

TEXT SUMMARIZATION
CAN BE CLASSIFIED IN
TWO WAYS:
• abstractive summarization
• extractive summarization

MAIN OBJECTIVE OF
EXTRACTION APPROACH
The main objective of text summarization
based on extraction approach is the
choosing of appropriate sentence as per the
requirement of a user.

PREPROCESSING
PHASE
• Sentence Segmentation
• Stop Words Removal
• Stemming

DEFINE SET OF FIVE
FEATURES FOR EACH
SENTENCE
 Title Similarity Feature:
The ratio of the number of words in the
sentence that occur in title to the total
number of words in the title.

DEFINE SET OF FIVE
FEATURES FOR EACH
SENTENCE
 Positional Feature:

DEFINE SET OF FIVE
FEATURES FOR EACH
SENTENCE
 Term Weight Feature:

DEFINE SET OF FIVE
FEATURES FOR EACH
SENTENCE
 Concept Feature:

DEFINE SET OF FIVE
FEATURES FOR EACH
SENTENCE
 POS Tagger Feature.

FUZZY LOGIC SYSTEM
The fuzzier: VERY LOW / LOW / MEDIUM
/ HIGH / VERY HIGH.

FUZZY LOGIC SYSTEM
Set of rules are constructed by comparing the
sentences from the set of documents and the
sentences from the text summary.

FUZZY LOGIC SYSTEM
The defuzzifier finally modifies the feature
matrix based on the feature values assigned
to a particular rule and derives the fuzzy
score by evaluating the features values.

RESTRICTED
BOLTZMANN MACHINE
• RBM is a stochastic neural
network
• Consists of one layer of visible
units (neurons) and one layer of
hidden units
• Units in each layer have no
connections between them and
are connected to all other units in
other layer as shown below

OPTIMAL FEATURE
MATRIX
After obtaining the refined sentence matrix from the
RBM it is further tested on a particular threshold
value for each feature we have calculated.
Ex: If for any sentence:
𝑓4 < 𝑡ℎ𝑟4
then it will be filtered

To fine tune the feature vector set optimally we
use back propagation algorithm.
The deep learning algorithm in this phase uses
cross-entropy error to fine tune the obtained
feature vector set. The cross-entropy error for
adjustment is calculated for every feature of the
sentence.
OPTIMAL FEATURE
MATRIX

RANKING OF
SENTENCES
Ranking of the sentence is performed on the
basis of the sentence score obtained in
previous step.

COMPRESSION RATE
Top-N sentences are selected on the basis of
compression rate given by the user:

EVALUATION METRICS
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = 0.86
𝑅𝑒𝑐𝑎𝑙𝑙 = 0.37
𝐹 − 𝑀𝑒𝑎𝑠𝑢𝑟𝑒 = 0.50

Similar to Association of deep learning algorithm with fuzzy logic for multi-document text summarization

Following are the questions which I tried to answer in this ppt What is text summarization. What is automatic text summarization? How it has evolved over the time? What are different methods? How deep learning is used for text summarization? business application in first few slides extractive summarization is explained, with pro and cons in next section abstractive on is explained. In the last section business application of each one is highlighted

Text summarization

prateek khandelwal

team10.ppt.pptx

REMEGIUSPRAVEENSAHAY

Information Extraction

ssbd6985

Information Extraction

ssbd6985

Information Extraction

ssbd6985

Text Summarization of Food Reviews using AbstractiveSummarization and Recurre...

IRJET Journal

Keyword_extraction.pptx

BiswarupDas18

Automatic text summarization generates a summary that contains sentences reflecting the essential and relevant information of the original documents. Extractive summarization requires semantic understanding, while abstractive summarization requires a better intermediate text representation. This paper proposes a hybrid approach for generating text summaries that combine extractive and abstractive methods. To improve the semantic understanding of the model, we propose two novel extractive methods: semantic latent Dirichlet allocation (semantic LDA) and sentence concept mapping. We then generate an intermediate summary by applying our proposed sentence ranking algorithm over the sentence concept mapping. This intermediate summary is input to a transformer-based abstractive model fine-tuned with a multi-head attention mechanism. Our experimental results demonstrate that the proposed hybrid model generates coherent summaries using the intermediate extractive summary covering semantics. As we increase the concepts and number of words in the summary the rouge scores are improved for precision and F1 scores in our proposed model.

A hybrid approach for text summarization using semantic latent Dirichlet allo...

IJECEIAES

Comparative Analysis of Text Summarization Techniques

ugginaramesh

Natural Language Processing Advancements By Deep Learning: A Survey

Rimzim Thube

RDBMS

sowfi

I- Tasser

Animesh Kumar

Cloudsim a fast clustering-based feature subset selection algorithm for high...

ecway

A fast clustering based feature subset selection algorithm for high-dimension...

ecway

Android a fast clustering-based feature subset selection algorithm for high-...

ecway

Object oriented programming

Nadeesha Thilakarathne

JAVA 2013 IEEE PROJECT A fast clustering based feature subset selection algor...

IEEEGLOBALSOFTTECHNOLOGIES

A fast clustering based feature subset selection algorithm for high-dimension...

IEEEFINALYEARPROJECTS

JAVA 2013 IEEE CLOUDCOMPUTING PROJECT A fast clustering based feature subset ...

IEEEGLOBALSOFTTECHNOLOGIES

IEEE 2014 JAVA DATA MINING PROJECTS A fast clustering based feature subset se...

IEEEFINALYEARSTUDENTPROJECTS

Similar to Association of deep learning algorithm with fuzzy logic for multi-document text summarization (20)

Text summarization

team10.ppt.pptx

Information Extraction

Text Summarization of Food Reviews using AbstractiveSummarization and Recurre...

Keyword_extraction.pptx

A hybrid approach for text summarization using semantic latent Dirichlet allo...

Comparative Analysis of Text Summarization Techniques

Natural Language Processing Advancements By Deep Learning: A Survey

RDBMS

I- Tasser

Cloudsim a fast clustering-based feature subset selection algorithm for high...

A fast clustering based feature subset selection algorithm for high-dimension...

Android a fast clustering-based feature subset selection algorithm for high-...

Object oriented programming

JAVA 2013 IEEE PROJECT A fast clustering based feature subset selection algor...

A fast clustering based feature subset selection algorithm for high-dimension...

JAVA 2013 IEEE CLOUDCOMPUTING PROJECT A fast clustering based feature subset ...

IEEE 2014 JAVA DATA MINING PROJECTS A fast clustering based feature subset se...

Recently uploaded

Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...

Bert Jan Schrijver

WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...

WSO2

Computation is increasingly constrained by power. With each advancement in the manufacturing process, a decreasing percentage of the CPU can operate at full capacity, leading to the emergence of the term 'dark silicon'. This trend necessitates techniques that utilize chip area to optimize power efficiency through specialized accelerators. The presentation will outline key concepts that led to the dark silicon such as Moore’s law and breakdown of Dennard scaling, followed by an overview of current and upcoming CPU accelerators. The focus will then shift to vector units and the specifics of vector programming. Attendees will be introduced to registers, a range of vector operations, and methods to develop branchless algorithms such as sorting networks. The session will conclude with an overview of the new Java Vector API and how it was already picked up by projects to do AI inference (Llama 2) and vector search (AstraDB and Cassandra).

[GeeCON2024] How I learned to stop worrying and love the dark silicon apocalypse

Tomasz Kowalczewski

Azure Native Qumulo scales elastically for common High Performance Compute (HPC) workloads based on application requirements for: Financial Services, Automotive, Genomics / Life Sciences, Media and Entertainment, Energy, Oil and Gas, and more. Performance can be increased (and elastically decreased) much higher than the examples shown here. These slides offer a glimpse into ANQ's HPC capabilities, although at a smaller scale. We invite YOU to do your own testing (with a free ANQ trial) and work with us to test your HPC workloads in Azure.

AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf

ryanfarris8

WSO2CON 2024 Slides - Open Source to SaaS

WSO2

WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...

WSO2

WSO2Con2024 - Unleashing the Financial Potential of 13 Million People

WSO2

This presentation covers the following topics: What is logging? The purpose of logging: Debugging The purpose of logging: Security The purpose of logging: Stats & analytics Traditional logging Traditional logging: Advantages Traditional logging: Disadvantages The solution: Large-scale logging Large-scale logging: Core principles Large-scale logging: Solution types Large-scale logging: Cloud vs on-prem Large-scale logging: Operational complexity Large-scale logging: Security Large-scale logging: Costs Large-scale logging: On-prem comparison - Elasticsearch - Grafana Loki - VictoriaLogs On-prem comparison: Setup and operation On-prem comparison: Costs On-prem comparison: Full-text search support On-prem comparison: How to efficiently query 100TB of logs? On-prem comparison: Integration with CLI tools VictoriaLogs for large-scale logging VictoriaLogs demo instance - Ingestion rate: 3600 messages / minute - The number of log messages: 1.1 billion - Uncompressed log messages’ size: 1.5TB - Compressed log messages’ size: 23GB - Compression ratio: 47x - Memory usage: 150MB VictoriaLogs CLI integration demo - Which errors have occurred in all the apps during the last hour? - How many errors have occurred during the last hour? - Which apps generated the most of errors during the last hour? - The number of per-minute errors for the last 10 minutes - Status codes for the last hour - Non-200 status codes for the last week - Top client IPs for the last 4 weeks with 404 and 500 response status codes - Per-month stats for the given IP across all the logs Large-scale logging solution MUST provide excellent CLI integration VictoriaLogs: (temporary) drawbacks VictoriaLogs: Recap - Easy to setup and operate - The lowest RAM usage and disk space usage (up to 30x less than Elasticsearch and Grafana Loki) - Fast full-text search - Excellent integration with traditional command-line tools for log analysis - Accepts logs from popular log shippers (Filebeat, Fluentbit, Logstash, Vector, Promtail, Grafana Agent) - Open source and free to use!

Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024

VictoriaMetrics

WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...

WSO2

WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...

WSO2

WSO2Con204 - Hard Rock Presentation - Keynote

WSO2

WSO2Con2024 - Organization Management: The Revolution in B2B CIAM

WSO2

WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity

WSO2

BusinessGPT is a solution addressing security and governance requirements for using or deploying Generative AI. The core offering includes a unique real-time Firewall that mitigates AI risks by controlling AI usage, such as ChatGPT/Copilot and local AI services. BusinessGPT also provides a complete end-to-end Private/ On-prem AI solution for highly regulated companies, ensuring zero data exposure. The solution allows users to securely ask questions and responsibly use insights from all company data.

BusinessGPT - Security and Governance for Generative AI

AGATSoftware

WSO2Con2024 - Navigating the Digital Landscape: Transforming Healthcare with ...

WSO2

Announcing Codolex 2.0 from GDK Software

Jim McKeeth

WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation

WSO2

Novo Nordisk: When Knowledge Graphs meet LLMs

Neo4j

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in Tembisa ● Abortion Pills For Sale in Tembisa ● Tembisa 🏥🚑!! Abortion Clinic Near Me Cost, Price, Women's Clinic Near Me, Abortion Clinic Near, Abortion Doctors Near me, Abortion Services Near Me, Abortion Pills Over The Counter, Abortion Pill Doctors' Offices, Abortion Clinics, Abortion Places Near Me, Cheap Abortion Places Near Me, Medical Abortion & Surgical Abortion, approved cyctotec pills and womb cleaning pills too plus all the instructions needed This Discrete women’s Termination Clinic offers same day services that are safe and pain free, we use approved pills and we clean the womb so that no side effects are present. Our main goal is that of preventing unintended pregnancies and unwanted births every day to enable more women to have children by choice, not chance. We offer Terminations by Pill and The Morning After Pill.” Our Private VIP Abortion Service offers the ultimate in privacy, efficiency and discretion. we do safe and same day termination and we do also womb cleaning as well its done from 1 week up to 28 weeks. We do delivery of our services world wide SAFE ABORTION CLINICS/PILLS ON SALE WE DO DELIVERY OF PILLS ALSO Abortion clinic at very low costs, 100% Guaranteed and it’s safe, pain free and a same day service. It Is A 45 Minutes Procedure, we use tested abortion pills and we do womb cleaning as well. Alternatively the medical abortion pill and womb cleansing !!!

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...

Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg

WSO2Con2024 - Hello Choreo Presentation - Kanchana

WSO2

Recently uploaded (20)

Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...

WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...

[GeeCON2024] How I learned to stop worrying and love the dark silicon apocalypse

AzureNativeQumulo_HPC_Cloud_Native_Benchmarks.pdf

WSO2CON 2024 Slides - Open Source to SaaS

WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...

WSO2Con2024 - Unleashing the Financial Potential of 13 Million People

Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024

WSO2Con2024 - GitOps in Action: Navigating Application Deployment in the Plat...

WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...

WSO2Con204 - Hard Rock Presentation - Keynote

WSO2Con2024 - Organization Management: The Revolution in B2B CIAM

WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity

BusinessGPT - Security and Governance for Generative AI

WSO2Con2024 - Navigating the Digital Landscape: Transforming Healthcare with ...

Announcing Codolex 2.0 from GDK Software

WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation

Novo Nordisk: When Knowledge Graphs meet LLMs

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...

WSO2Con2024 - Hello Choreo Presentation - Kanchana

Association of deep learning algorithm with fuzzy logic for multi-document text summarization

1. LEARNING ALGORITHM WITH FUZZY LOGIC FOR MULTIDOCUMENT TEXT SUMMARIZATION Abd Almughith Alzabibi Ahmad Ataya Baraa Salhany Mohammad Salem Kabbani

2. INTRODUCTION With the rapid growth in the quantity and complexity of documents sources on the internet, it has become increasingly important to provide improved mechanism to user to find exact information from available documents.

3. AUTOMATIC TEXT SUMMARIZATION DEFINITION Automatic text summarization is the summary of the source version of the original text while keeping its main content and helps the user to quickly understand large volumes of information.

4. TEXT SUMMARIZATION CAN BE CLASSIFIED IN TWO WAYS: • abstractive summarization • extractive summarization

5. MAIN OBJECTIVE OF EXTRACTION APPROACH The main objective of text summarization based on extraction approach is the choosing of appropriate sentence as per the requirement of a user.

6. OVERVIEW

7. OVERVIEW

8. PREPROCESSING PHASE • Sentence Segmentation • Stop Words Removal • Stemming

9. DEFINE SET OF FIVE FEATURES FOR EACH SENTENCE  Title Similarity Feature: The ratio of the number of words in the sentence that occur in title to the total number of words in the title.

10. DEFINE SET OF FIVE FEATURES FOR EACH SENTENCE  Positional Feature:

11. DEFINE SET OF FIVE FEATURES FOR EACH SENTENCE  Term Weight Feature:

12. DEFINE SET OF FIVE FEATURES FOR EACH SENTENCE  Concept Feature:

13. DEFINE SET OF FIVE FEATURES FOR EACH SENTENCE  POS Tagger Feature.

14. FEATURE MATRIX

15. FUZZY LOGIC SYSTEM

16. FUZZY LOGIC SYSTEM The fuzzier: VERY LOW / LOW / MEDIUM / HIGH / VERY HIGH.

17. FUZZY LOGIC SYSTEM Set of rules are constructed by comparing the sentences from the set of documents and the sentences from the text summary.

18. FUZZY LOGIC SYSTEM The defuzzifier finally modifies the feature matrix based on the feature values assigned to a particular rule and derives the fuzzy score by evaluating the features values.

19. FEATURE MATRIX

20. RESTRICTED BOLTZMANN MACHINE • RBM is a stochastic neural network • Consists of one layer of visible units (neurons) and one layer of hidden units • Units in each layer have no connections between them and are connected to all other units in other layer as shown below

21. RESTRICTED BOLTZMANN MACHINE

22. OPTIMAL FEATURE MATRIX After obtaining the refined sentence matrix from the RBM it is further tested on a particular threshold value for each feature we have calculated. Ex: If for any sentence: 𝑓4 < 𝑡ℎ𝑟4 then it will be filtered

23. To fine tune the feature vector set optimally we use back propagation algorithm. The deep learning algorithm in this phase uses cross-entropy error to fine tune the obtained feature vector set. The cross-entropy error for adjustment is calculated for every feature of the sentence. OPTIMAL FEATURE MATRIX

24. SENTENCE SCORE

25. RANKING OF SENTENCES Ranking of the sentence is performed on the basis of the sentence score obtained in previous step.

26. COMPRESSION RATE Top-N sentences are selected on the basis of compression rate given by the user:

27. EVALUATION METRICS

28. EVALUATION METRICS 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = 0.86 𝑅𝑒𝑐𝑎𝑙𝑙 = 0.37 𝐹 − 𝑀𝑒𝑎𝑠𝑢𝑟𝑒 = 0.50

29. THE END Thanks for Listening

Editor's Notes

Natural Language Processing (NLP) technique is used for parsing, reduction of words and to generate text summery in abstractive summarization. Extractive summarization is flexible and consumes less time as compared to abstractive summarization
Stop words are removed mainly to reduce the insignificant and noisy words.
The weight of the sentence can be calculated by adding the weight of all the terms in the sentence and dividing it by total number of terms in that sentence
In addition to the five features, an additional attribute also associated with the feature matrix. The addition feature associated with the feature matrix is the class labels for each sentence. The fuzzy classifier assigns the class labels to the sentences according to the fuzzy rules by processing the sentences.
RBM is a stochastic neural network (that is a network of neurons where each neuron has some random behavior when activated).

Association of deep learning algorithm with fuzzy logic for multi-document text summarization

Recommended

Recommended

More Related Content

Similar to Association of deep learning algorithm with fuzzy logic for multi-document text summarization

Similar to Association of deep learning algorithm with fuzzy logic for multi-document text summarization (20)

Recently uploaded

Recently uploaded (20)

Association of deep learning algorithm with fuzzy logic for multi-document text summarization

Editor's Notes