SlideShare a Scribd company logo
1 of 13
Why and What is Graph Mining?
• Graphs allows us to model complicated structures
• Chemical compounds ( Cheminformatics )
• Protein structures, biological pathways/networks ( Bioinformatics )
• Program control flow, traffic flow and social network analysis
• XML documents, the Web and social network analysis
• Graph search algorithms have been developed in chemical
informatics, computer vision, video indexing and text retrieval.
• Graph Mining has become an active and important theme in data
mining.
Graph Patterns
• Frequent subgraphs
• A subgraph is frequent if its support ( occurrence frequency ) in
a given dataset is no less than a minimum support threshold
• What is support?
• Intuitively the number of transactions containing a single
occurrence
• Useful for
• Characterizing graph sets
• Discriminating different groups of graphs
• Similarity search in graph databases.
Methods for Mining Frequent Subgraphs
• An Apriori based approach and a pattern-growth approach
• Apriori-based algorithms for frequent substructure mining include AGM,
FSG, and a path-join method
• AGM shares similar characteristics with Apriori-based itemset mining.
• FSG and the path-join method explore edges and connections
• Pattern Growth Graph Approach is simplistic pattern growth-based
frequent substructure mining.
Social Network
• A social network is a heterogeneous and multirelational data set
represented by a graph.
• Nodes corresponding to objects and edges corresponding to links
representing relationships or interactions between objects.
• Both nodes and links have attributes.
• Social networks need not be social in context.
• There are many real-world instances of technological, business, economic,
and biologic social networks.
• Examples:
• Electrical power grids
• Telephone call graphs
• the spread of computer viruses
• the World Wide Web
Set Valued Attribute
• A set-valued attribute may be of homogeneous or heterogeneous type.
• A set-valued data can be generalized by:
1. Generalization of each value in the set to its corresponding higher-
level concept.
2. Derivation of the general behavior of the set.
• Generalization can be performed by applying different generalization
operators to explore alternative generalization paths.
• The result of generalization is a heterogeneous set.
Set and Listed Valued Attribute Example
• that the hobby of a person is a set-valued attribute containing the set of
values Tennis, Hockey, Soccer, Violin.
• This set can be generalized to a set of high-level concepts, such as {sports,
music, computer games} or into the number 5.
• a count can be associated with a generalized value to indicate how many
elements are generalized to that value, as in {sports(3),music(1), computer
games(1)}, where sports(3) indicates three kinds of sports. and so on.
Listed Valued Attribute
• List-valued attributes can be generalized in a manner similar to that for
set-valued attributes except that the order of the elements in the list
should be preserved in the generalization.
• A list can be generalized according to its general behavior, such as the
length of the list, the type of list elements, the value range, the weighted
average value for numerical data, or by dropping unimportant elements in
the list.
Listed Valued Attribute Example
• List of data for a person’s education record:
• “((B.Sc. in Electrical Engineering, U.B.C., Dec., 1998),
• (M.Sc. in Computer Engineering, U. Maryland, May, 2001),
• (Ph.D. in Computer Science, UCLA, Aug., 2005))”.
• We can generalize by dropping less important descriptions (attributes) of
each tuple in the list, such as by dropping the month attribute to obtain
“((B.Sc., U.B.C., 1998), : : :)”, and/or by retaining only the most important
tuple(s) in the list,
• Example: “(Ph.D. in Computer Science, UCLA, 2005)”.
Dimensions in Spatial Data Cube
• Three types of dimensions in a spatial data cube:
• A nonspatial dimension
• A spatial-to-nonspatial dimension
• A spatial-to-spatial dimension
• A nonspatial dimension contains only nonspatial data. Nonspatial
dimensions temperature and precipitation can be constructed for the
warehouse.
• Example: Contains nonspatial data whose generalizations are nonspatial
(such as “hot” for temperature and “wet” for precipitation).
• A spatial-to-nonspatial dimension is a dimension whose primitive-level
data are spatial but whose generalization, starting at a certain high level,
becomes nonspatial.
Dimensions in Spatial Data Cube
• Example: Suppose that the dimension’s spatial representation of Mumbai is
generalized to the string “south mumbai.” Although “south mumbai” is a
spatial concept, its representation is not spatial.
• A spatial-to-spatial dimension is a dimension whose primitive level and all
of its high level generalized data are spatial.
• Example: the dimension equi temperature region contains spatial data, as
do all of its generalizations, such as with regions covering 0-5 degrees
(Celsius), 5-10 degree.
Plan, Plan Database and Plan Mining
• A plan consists of a variable sequence of actions.
• A plan database or planbase, is a large collection of plans.
• Plan mining is the task of mining significant patterns or knowledge from a
planbase.
• Plan mining can be used to discover travel patterns of business passengers
in an air flight database or to find significant patterns from the sequences
of actions in the repair of automobiles.
• Plan mining is the extraction of important or significant generalized
(sequential) patterns from a planbase.
Spatial Aggregation and Approximation
• Aggregation and approximation are especially useful for generalizing attributes
with large sets of values, complex structures, and spatial or multimedia data.
• We would like to generalize detailed geographic points into clustered regions,
such as business, residential, industrial, or agricultural areas, according to land
usage.
• A multimedia database may contain complex texts, graphics, images, video
fragments, maps, voice, music, and other forms of audio/video information.
• Generalization on multimedia data can be performed by recognition and
extraction of the essential features and/or general patterns of such data.
• For an image, the size, color, shape, texture, orientation, and relative positions and
structures of the contained objects or regions in the image can be extracted by
aggregation and/or approximation.
• For a segment of music, its melody can be summarized based on the approximate
patterns that repeatedly occur in the segment, while its style can be summarized
based on its tone, tempo, or the major musical instruments played.
• Technologies developed in spatial databases and multimedia databases such as
• spatial data accessing and analysis techniques
• pattern recognition
• image analysis
• text analysis
• content-based image/text retrieval
• multidimensional indexing methods
• Example:
• Suppose that we have different pieces of land for various purposes of
agricultural usage, such as the planting of vegetables, grains, and fruits. These
pieces can be merged or aggregated into one large piece of agricultural land
by a spatial merge.
• However, such a piece of agricultural land may contain highways, houses, and
small stores.
• If the majority of the land is used for agriculture, the scattered regions for
other purposes can be ignored, and the whole region can be claimed as an
agricultural area by approximation.

More Related Content

Viewers also liked

Google Presentation
Google PresentationGoogle Presentation
Google Presentation
guesta599e2
 

Viewers also liked (17)

Darryl Schultz_Resume
Darryl Schultz_ResumeDarryl Schultz_Resume
Darryl Schultz_Resume
 
Soal latihan tik materi excel
Soal latihan tik materi excelSoal latihan tik materi excel
Soal latihan tik materi excel
 
Sni 2836-2008-tata cara perhitungan harga satuan pekerjaan pondasi untuk kons...
Sni 2836-2008-tata cara perhitungan harga satuan pekerjaan pondasi untuk kons...Sni 2836-2008-tata cara perhitungan harga satuan pekerjaan pondasi untuk kons...
Sni 2836-2008-tata cara perhitungan harga satuan pekerjaan pondasi untuk kons...
 
Object Relational Database Management System
Object Relational Database Management SystemObject Relational Database Management System
Object Relational Database Management System
 
Sni 7395-2008-tata cara perhitungan harga satuan pekerjaan penutup lantai dan...
Sni 7395-2008-tata cara perhitungan harga satuan pekerjaan penutup lantai dan...Sni 7395-2008-tata cara perhitungan harga satuan pekerjaan penutup lantai dan...
Sni 7395-2008-tata cara perhitungan harga satuan pekerjaan penutup lantai dan...
 
Octave
OctaveOctave
Octave
 
MINING HEALTH EXAMINATION RECORDS A GRAPH-BASED APPROACH
MINING HEALTH EXAMINATION RECORDS  A GRAPH-BASED APPROACHMINING HEALTH EXAMINATION RECORDS  A GRAPH-BASED APPROACH
MINING HEALTH EXAMINATION RECORDS A GRAPH-BASED APPROACH
 
Mining Electronic Health Records for Insights
Mining Electronic Health Records for InsightsMining Electronic Health Records for Insights
Mining Electronic Health Records for Insights
 
Data Mining Seminar - Graph Mining and Social Network Analysis
Data Mining Seminar - Graph Mining and Social Network AnalysisData Mining Seminar - Graph Mining and Social Network Analysis
Data Mining Seminar - Graph Mining and Social Network Analysis
 
Vat+dyes
Vat+dyesVat+dyes
Vat+dyes
 
Social Data Mining
Social Data MiningSocial Data Mining
Social Data Mining
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisData Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
 
Social media mining PPT
Social media mining PPTSocial media mining PPT
Social media mining PPT
 
Google Presentation
Google PresentationGoogle Presentation
Google Presentation
 
Computer science seminar topics
Computer science seminar topicsComputer science seminar topics
Computer science seminar topics
 
Buffer Overflow Countermeasures, DEP, Security Assessment
Buffer Overflow Countermeasures, DEP, Security AssessmentBuffer Overflow Countermeasures, DEP, Security Assessment
Buffer Overflow Countermeasures, DEP, Security Assessment
 
Google Ppt
Google PptGoogle Ppt
Google Ppt
 

Similar to Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Spatial Data

Data structure chapter 1.pptx
Data structure chapter 1.pptxData structure chapter 1.pptx
Data structure chapter 1.pptx
Kami503928
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrieval
Sadaf Rafiq
 

Similar to Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Spatial Data (20)

Exploring Data (1).pptx
Exploring Data (1).pptxExploring Data (1).pptx
Exploring Data (1).pptx
 
Review presentation for Orientation 2014
Review presentation for Orientation 2014Review presentation for Orientation 2014
Review presentation for Orientation 2014
 
Building a Spatial Database in PostgreSQL
Building a Spatial Database in PostgreSQLBuilding a Spatial Database in PostgreSQL
Building a Spatial Database in PostgreSQL
 
UNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data MiningUNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data Mining
 
STATISTICAL METHODS IN GEOGRAPHY
STATISTICAL METHODS IN GEOGRAPHYSTATISTICAL METHODS IN GEOGRAPHY
STATISTICAL METHODS IN GEOGRAPHY
 
Graph Models for Deep Learning
Graph Models for Deep LearningGraph Models for Deep Learning
Graph Models for Deep Learning
 
Lecture 01 Intro to DSA
Lecture 01 Intro to DSALecture 01 Intro to DSA
Lecture 01 Intro to DSA
 
Unit 3 part i Data mining
Unit 3 part i Data miningUnit 3 part i Data mining
Unit 3 part i Data mining
 
1. Introduction to Data Structure.pptx
1. Introduction to Data Structure.pptx1. Introduction to Data Structure.pptx
1. Introduction to Data Structure.pptx
 
Chapter 2 - Introduction to Data Science.pptx
Chapter 2 - Introduction to Data Science.pptxChapter 2 - Introduction to Data Science.pptx
Chapter 2 - Introduction to Data Science.pptx
 
Data Mining-2023 (2).ppt
Data Mining-2023 (2).pptData Mining-2023 (2).ppt
Data Mining-2023 (2).ppt
 
ch2 DS.pptx
ch2 DS.pptxch2 DS.pptx
ch2 DS.pptx
 
Data structure chapter 1.pptx
Data structure chapter 1.pptxData structure chapter 1.pptx
Data structure chapter 1.pptx
 
Data analytics for engineers- introduction
Data analytics for engineers-  introductionData analytics for engineers-  introduction
Data analytics for engineers- introduction
 
Lec 3.pptx
Lec 3.pptxLec 3.pptx
Lec 3.pptx
 
Ordbms
OrdbmsOrdbms
Ordbms
 
1.1 introduction to Data Structures.ppt
1.1 introduction to Data Structures.ppt1.1 introduction to Data Structures.ppt
1.1 introduction to Data Structures.ppt
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrieval
 
GIS ANALYTICS-2011
GIS ANALYTICS-2011GIS ANALYTICS-2011
GIS ANALYTICS-2011
 
Lec01-Algorithems - Introduction and Overview.pdf
Lec01-Algorithems - Introduction and Overview.pdfLec01-Algorithems - Introduction and Overview.pdf
Lec01-Algorithems - Introduction and Overview.pdf
 

Recently uploaded

The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 

Recently uploaded (20)

Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
LEVEL 5 - SESSION 1 2023 (1).pptx - PDF 123456
LEVEL 5   - SESSION 1 2023 (1).pptx - PDF 123456LEVEL 5   - SESSION 1 2023 (1).pptx - PDF 123456
LEVEL 5 - SESSION 1 2023 (1).pptx - PDF 123456
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 

Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Spatial Data

  • 1. Why and What is Graph Mining? • Graphs allows us to model complicated structures • Chemical compounds ( Cheminformatics ) • Protein structures, biological pathways/networks ( Bioinformatics ) • Program control flow, traffic flow and social network analysis • XML documents, the Web and social network analysis • Graph search algorithms have been developed in chemical informatics, computer vision, video indexing and text retrieval. • Graph Mining has become an active and important theme in data mining.
  • 2. Graph Patterns • Frequent subgraphs • A subgraph is frequent if its support ( occurrence frequency ) in a given dataset is no less than a minimum support threshold • What is support? • Intuitively the number of transactions containing a single occurrence • Useful for • Characterizing graph sets • Discriminating different groups of graphs • Similarity search in graph databases.
  • 3. Methods for Mining Frequent Subgraphs • An Apriori based approach and a pattern-growth approach • Apriori-based algorithms for frequent substructure mining include AGM, FSG, and a path-join method • AGM shares similar characteristics with Apriori-based itemset mining. • FSG and the path-join method explore edges and connections • Pattern Growth Graph Approach is simplistic pattern growth-based frequent substructure mining.
  • 4. Social Network • A social network is a heterogeneous and multirelational data set represented by a graph. • Nodes corresponding to objects and edges corresponding to links representing relationships or interactions between objects. • Both nodes and links have attributes. • Social networks need not be social in context. • There are many real-world instances of technological, business, economic, and biologic social networks. • Examples: • Electrical power grids • Telephone call graphs • the spread of computer viruses • the World Wide Web
  • 5. Set Valued Attribute • A set-valued attribute may be of homogeneous or heterogeneous type. • A set-valued data can be generalized by: 1. Generalization of each value in the set to its corresponding higher- level concept. 2. Derivation of the general behavior of the set. • Generalization can be performed by applying different generalization operators to explore alternative generalization paths. • The result of generalization is a heterogeneous set.
  • 6. Set and Listed Valued Attribute Example • that the hobby of a person is a set-valued attribute containing the set of values Tennis, Hockey, Soccer, Violin. • This set can be generalized to a set of high-level concepts, such as {sports, music, computer games} or into the number 5. • a count can be associated with a generalized value to indicate how many elements are generalized to that value, as in {sports(3),music(1), computer games(1)}, where sports(3) indicates three kinds of sports. and so on.
  • 7. Listed Valued Attribute • List-valued attributes can be generalized in a manner similar to that for set-valued attributes except that the order of the elements in the list should be preserved in the generalization. • A list can be generalized according to its general behavior, such as the length of the list, the type of list elements, the value range, the weighted average value for numerical data, or by dropping unimportant elements in the list.
  • 8. Listed Valued Attribute Example • List of data for a person’s education record: • “((B.Sc. in Electrical Engineering, U.B.C., Dec., 1998), • (M.Sc. in Computer Engineering, U. Maryland, May, 2001), • (Ph.D. in Computer Science, UCLA, Aug., 2005))”. • We can generalize by dropping less important descriptions (attributes) of each tuple in the list, such as by dropping the month attribute to obtain “((B.Sc., U.B.C., 1998), : : :)”, and/or by retaining only the most important tuple(s) in the list, • Example: “(Ph.D. in Computer Science, UCLA, 2005)”.
  • 9. Dimensions in Spatial Data Cube • Three types of dimensions in a spatial data cube: • A nonspatial dimension • A spatial-to-nonspatial dimension • A spatial-to-spatial dimension • A nonspatial dimension contains only nonspatial data. Nonspatial dimensions temperature and precipitation can be constructed for the warehouse. • Example: Contains nonspatial data whose generalizations are nonspatial (such as “hot” for temperature and “wet” for precipitation). • A spatial-to-nonspatial dimension is a dimension whose primitive-level data are spatial but whose generalization, starting at a certain high level, becomes nonspatial.
  • 10. Dimensions in Spatial Data Cube • Example: Suppose that the dimension’s spatial representation of Mumbai is generalized to the string “south mumbai.” Although “south mumbai” is a spatial concept, its representation is not spatial. • A spatial-to-spatial dimension is a dimension whose primitive level and all of its high level generalized data are spatial. • Example: the dimension equi temperature region contains spatial data, as do all of its generalizations, such as with regions covering 0-5 degrees (Celsius), 5-10 degree.
  • 11. Plan, Plan Database and Plan Mining • A plan consists of a variable sequence of actions. • A plan database or planbase, is a large collection of plans. • Plan mining is the task of mining significant patterns or knowledge from a planbase. • Plan mining can be used to discover travel patterns of business passengers in an air flight database or to find significant patterns from the sequences of actions in the repair of automobiles. • Plan mining is the extraction of important or significant generalized (sequential) patterns from a planbase.
  • 12. Spatial Aggregation and Approximation • Aggregation and approximation are especially useful for generalizing attributes with large sets of values, complex structures, and spatial or multimedia data. • We would like to generalize detailed geographic points into clustered regions, such as business, residential, industrial, or agricultural areas, according to land usage. • A multimedia database may contain complex texts, graphics, images, video fragments, maps, voice, music, and other forms of audio/video information. • Generalization on multimedia data can be performed by recognition and extraction of the essential features and/or general patterns of such data. • For an image, the size, color, shape, texture, orientation, and relative positions and structures of the contained objects or regions in the image can be extracted by aggregation and/or approximation. • For a segment of music, its melody can be summarized based on the approximate patterns that repeatedly occur in the segment, while its style can be summarized based on its tone, tempo, or the major musical instruments played.
  • 13. • Technologies developed in spatial databases and multimedia databases such as • spatial data accessing and analysis techniques • pattern recognition • image analysis • text analysis • content-based image/text retrieval • multidimensional indexing methods • Example: • Suppose that we have different pieces of land for various purposes of agricultural usage, such as the planting of vegetables, grains, and fruits. These pieces can be merged or aggregated into one large piece of agricultural land by a spatial merge. • However, such a piece of agricultural land may contain highways, houses, and small stores. • If the majority of the land is used for agriculture, the scattered regions for other purposes can be ignored, and the whole region can be claimed as an agricultural area by approximation.