Weave-D - 2nd Progress Evaluation Presentation

Weave-D
A cognitive approach towards data
accumulation and fusion
Thushan Ganegedara
Ruwan Gunarathne
Lasindu Vidana Pathiranage
Buddhima Wijeweera

Why Weave-D?
Growth of amount of information
Handle
data
Temporal
Multi-
modal
Prevent
catastrophic
interference
Incremental
learning
algorithms
Visualizing
information
Intuitive
Simple
Apply
previous
knowledge
to acquire
new
knowledge
Conceptualization
Generalizati
on of
acquired
knowledge
??
??

What is Weave-D?
Accumulate
data (i.e.
Images, Text)
Feature
Extraction
Incremental
learning
Link
generation
Query & Visualize UI
Accumulates temporal, multi-
modal or multi source data in an
organized manner
Extract information from data (ex.
Color, Edge, Shape information of
images)
Incrementally learn using IKASL
algorithm
Links represent relationships
between multi-modal data

Major Research Problems
• Integrating incremental learning algorithm to
the selected artificial perception model
• What are the potential performance
improvements for selected unsupervised
learning algorithm?
• What are the suitable feature extraction
techniques for images and text?
• How to visualize complex learning outcomes to
user?

Major Challenges
• General
▫ Limited resources and novelty of the algorithms
▫ Finding suitable datasets
• Image Feature Extraction
▫ Deciding the best colors space to represent images
▫ Researching shape descriptor implementations
• Text Feature Extraction Techniques
▫ Researching suitable text feature extraction
techniques

Major Challenges (cont.)
• Unsupervised Learning Algorithms
▫ Implementing IKASL
▫ Testing and verifying correctness of IKASL
• Researching information visualization tools to fit
our requirements

Project Scope
• The proposed system will be implemented for
handling only Image and Text inputs
• System will be designed to be used by data
analysts
• System will,
▫ Extract feature vectors of images and texts
▫ Acquire knowledge using data input to Weave-D
▫ Generate links between data

Project Scope
• Persistence technique (ex. SQL DB,XML, etc.)
will be used to store acquired knowledge and
generated links
• Provide an interface for users to query/visualize
information

Assumptions & Limitations
• Selected features(e.g. color, shape,…) provide a
good representation of the data (e.g. images,…)
• In artificial perception model, perception at a
certain layer, can be represented by one most
significant feature from the layer below
• Input data should be compatible with feature
extractors (i.e. Type, Format, …)
• Tools required (e.g. Feature extraction,
Information Visualization) can be utilized in the
project with no/slight modifications

Deliverables
• JAVA implementation of the proposed system
including several sub-components
• Documentation
Incremental
learning
Information
persistence
Information
linking
Information
visualization
• Research Proposal • Literature Review
• Project Scope Document • Architectural Document
• Project Report • User Manual

Deliverables
• Image - Color feature extraction tools (C#)
• Image – Shape feature extraction tools (Matlab)
• Image – Edge feature extraction tools (Java)
• Text feature extraction tools (Java)
• Unsupervised learning algorithm testing and
visualization tools (GSOM, IKASL algorithms)
(Java)
• Modified information visualization tool (Java)

Feasibility of Deliverables
• Feature extraction tools
▫ Images (e.g. img (Rummager))
▫ Text (e.g. Wordnet, uClassify)
• Unsupervised learning algorithm tools
▫ GSOM
▫ IKASL
• Visualization
▫ Arena 3D (with modifications)

IKASL
GSOMSOM
Literature Review
Artificial Perception
Model
Unsupervised Learning
Algorithms
Information
Visualization
Feature Extraction
Techniques
Other Similar Systems
3D
2D
Text
Images

Artificial Perception Model [1]
• Inspired by human perceptive and cognitive
system
• Close resemblance to human brain
• Key features
▫ Supports multiple modalities
▫ Ability to generate high-level perceptions by
aggregating input stimuli belonging to multiple
modalities
▫ Conceptualization of information
[1] Bamunusinghe, Jeewanee, and Damminda Alahakoon. "Artificial Visual Percepts for Image Understanding." In
Proceedings of the International Conference on Intelligent Systems. 2010.

Artificial Perception Model
Specialization of vision
Specialization of
color

Unsupervised Learning
Algorithms

Self-Organizing Maps (SOM) [2]
• Visualization technique which reduces the
dimensions of data to help humans understand
high dimensional data.
• Self-Organizing Map (SOM) is a type of
unsupervised artificial neural network.
• Topology preserving map
[2] Kohonen, Teuvo. "The self-organizing map." Proceedings of the IEEE 78, no. 9 (1990):1464-1480

SOM
2-Dimensional output space High dimensional input space

Growing Self-Organizing Maps
(GSOMs) [3]
• GSOM is an extension of Self-Organizing maps (SOM),
which is very popular in knowledge discovery applications.
• GSOM algorithm overcomes several limitations of SOM.
• The main advantage of GSOM over SOM is that, GSOM has
the ability to grow and modify the shape to represent the
data space better.
• Other similar work are,
▫ Growing Cell Structures (GCS’s)
▫ Neural Gas Algorithm (NGA)
▫ Incremental Grid Growing (IGG)
[3] Alahakoon, Damminda, Saman K. Halgamuge, and Bala Srinivasan. "Dynamic self-organizing maps with controlled
growth for knowledge discovery." Neural Networks, IEEE Transactions on 11, no. 3 (2000): 601-614

GSOM Algorithm
• GSOM is an unsupervised neural network, which
is initialized with four nodes and develops to
represent the input data space.
• There are three main phases which can be
distinguished in GSOM algorithm
▫ Initialization phase
▫ Growing phase
▫ Smoothing phase

Initialization Phase
• Starting four nodes will be initialized with
random values from the input vector space.
(0,1) (1,1)
(0,0) (1,0)

Growing phase
(0,1) (1,1)
(0,0) (1,0)
Input
Euclidian
Distance
Winner
Neighborhood

Smoothing phase
• Growing phase stops when new node growth
saturates
• Reduce learning rate and fix a small starting
neighborhood.
• Find winner and adapt the weights of winner
and neighbors in the same way as in growing
phase.

SOM GSOM
Fixed number of nodes & Grid size Ability to grow and change the shape

IKASL Algorithm [4]
• Most current Hebbian rule based algorithms do not
encompass incremental learning and life-long
learning
• Hebbian rule based unsupervised incremental
learning algorithm
• Is both stable and plastic
• Can be understood as an n-layer structure
• A single layer comprises 2 sub-layers
▫ Learning Layer
▫ Generalized Layer
[4] De Silva, Daswin, and Damminda Alahakoon. "Incremental knowledge acquisition and self learning from text." In
Neural Networks (IJCNN), The 2010 International Joint Conference on, pp. 1-8. IEEE, 2010.

IKASL Algorithm
Learn layer (L1)
Generalized layer (G1)
Learn layer (L2)
Input 1
Input 2

IKASL Algorithm (ctd)
Learn layer (L1)
General layer (G1)
Learn layer (L2)
General layer (G2)
Learn layer (L3)
General layer (G3)

Image Feature Extraction
• In project we do not directly interact with raw
images
• There are lots of redundant data in images
• The solution is feature extraction techniques
• This transformation process of input data to a set of
feature vectors is known as feature extraction
• The Moving Picture Expert Group (MPEG) was
established and it has developed several
implementations
• In MPEG-7: Multimedia content description
interface was created

MPEG-7 Descriptors [5-7]
• Descriptors: a core set of quantitative measures
of audio-visual features
• Some of MPEG-7 Descriptors are,
▫ Dominant Colour Descriptor
▫ Colour Layout Descriptor
▫ Edge Histogram Descriptors
[5] Ortiz, Edward, Cesar Pantoja, and María Trujillo. "An MPEG-7 Browser." InLatin American Conference on Networked
Electronic Media. 2009.
[6] Wu, Peng, Yong Man Ro, Chee Sun Won, and Yanglim Choi. "Texture descriptors in MPEG-7." In Computer Analysis of
Images and Patterns, pp. 21-28. Springer Berlin Heidelberg, 2001
[7] Chatzichristofis, Savvas A., Yiannis S. Boutalis, and Mathias Lux. "Img (rummager): An interactive content based image
retrieval system." In Similarity Search and Applications, 2009. SISAP'09. Second International Workshop on, pp. 151-153.
IEEE, 2009.

Edge Histogram Descriptor[17]
[17] Eitz, Mathias, Kristian Hildebrand, Tamy Boubekeur, and Marc Alexa. "An evaluation of descriptors for large-scale image retrieval
from sketched feature lines." Computers & Graphics 34, no. 5 (2010): 482-498.

Proportion
20018.Jpg, 0.197, 0.162, 0.323, 0.319, 0.437
20019.jpg, 0.340, 0.076, 0.282, 0.303, 0.374
20020.jpg, 0.180, 0.212, 0.333, 0.275, 0.333
20021.jpg, 0.165, 0.222, 0.278, 0.335, 0.409
20024.jpg, 0.324, 0.100, 0.295, 0.281, 0.243
……….
20066.jpg, 0.069, 0.317, 0.257, 0.358, 0.362
Position
20018.jpg, 4,4,4,4,4,4,4,3,4,4,4,4,4,4,4,4
20019.jpg, 4,0,4,4,0,4,4,4,4,4,0,4,3,3,4,4
20020.jpg, 4,4,4,4,2,2,4,2,4,4,1,1,4,4,2,4
20021.jpg, 4,3,3,4,4,4,4,4,4,4,4,4,1,1,1,2
20024.jpg, 3,0,2,0,4,4,4,4,4,3,4,1,3,3,0,0
……….
Feature Vectors generated with EHG
20018.Jpg , 0, 0, 1, 1, 1
20019.Jpg , 1, 0, 0, 1, 1
20020.Jpg , 0, 0, 1, 1, 1
20021.Jpg , 0, 0, 1, 1, 1
20024.Jpg , 1, 0, 1, 1, 0
………
………
Existence

Results (Texture)
Existence Proportion Position

Shape Descriptors
[8] Bosch, Anna, Andrew Zisserman, and Xavier Munoz. "Representing shape with a spatial pyramid kernel." In
Proceedings of the 6th ACM international conference on Image and video retrieval, pp. 401-408. ACM, 2007.
• PHOG Descriptor [8]
▫ Outcomes
 Local shape (Given by each divided region)
 Spatial layout (Given by HOGs of regions of finer spatial grids)

Shape Descriptors
• GIST Descriptor [9]
▫ A holistic representation of an image
▫ Spatial Envelope
 Described by boundary of surface of image and inner
textures
 Properties
 Naturalness, Openness, Roughness, Ruggedness,
Expansion
▫ Estimating spatial envelope properties
 By calculating the energy spectrum of the image
(DFT)
[9] Oliva, Aude, and Antonio Torralba. "Modeling the shape of the scene: A holistic representation of the spatial envelope."
International journal of computer vision 42, no. 3 (2001): 145-175.

Text Feature Extraction
• Suitable text feature extraction techniques are
limited, why?
• Technique
• document is encoded as a histogram of words [10]
• select the set of keywords which are usually regarded
as an important keys, to create a feature vector [11]
• using WordNet lexical to create the feature vector [12]
• using uClassify web-service to create the feature
vector
[10] Kaski, Samuel, Timo Honkela, Krista Lagus, and Teuvo Kohonen. "WEBSOM–self-organizing maps of document
collections." Neurocomputing 21, no. 1 (1998): 101-117.
[11] Chumwatana, Todsanai, K. Wong, and Hong Xie. "A SOM-Based Document Clustering Using Frequent Max Substring
for Non-Segmented Texts." Journal of Intelligent Learning Systems & Applications 2 (2010): 117-125.
[12] Gharib, Tarek F., Mohammed M. Fouad, Abdulfattah Mashat, and Ibrahim Bidawi. "Self Organizing Map-based
Document Clustering Using WordNet Ontologies." International Journal of Computer Science 9 (2012).

Wordnet Lexical Categories
Act
Animal
Artifact
Food
.
.
Communication
Body
Creation
Emotion
Motion
.
.
Weather

uClassify output
docs sport games society Recreation Arts Science Business Computers Health Home
doc1 95.5 4.3 0.1 0 0 0 0 0 0 0
doc2 0 0 0 0 0 84 16 0 0 0
“Football refers to a number of sports that involve, to varying
degrees, kicking a ball with the foot to score a goal. The most popular
of these sports worldwide is association football, more commonly
known as just "football" or "soccer". Unqualified, the word football
applies to whichever form of football is the most popular in the
regional context in which the word appears, including association
football, as well as American football, Australian rules football,
Canadian football, Gaelic football, rugby league, rugby union and
other related games. These variations of football are known as
football codes.”
http://en.wikipedia.org/wiki/Football

Information Visualization
• The process of showing information in more intuitive
manner
• Today data analysts preferred to use computer generated
models
• Information Visualization can be represented by
following taxonomy
Visualizing
Tools
2D
2D
perspective
3D
2D perspective 3D
perspective

Existing Similar Systems
• Watson is an artificial intelligence computer
system capable of answering questions in
natural language.
IBM Watson [17]
[17] IBM Watson. n.d. http://www-03.ibm.com/innovation/us/watson/index.shtml (accessed April 28, 2013).

Significance of Watson
• The ability to discern double meanings of words,
puns, rhymes, and inferred hints.
• Extremely rapid responses
• The ability to process vast amounts of information to
make complex and subtle logical connections
Limitations of Watson
• Cannot process multi-modal data
• Cannot build a higher level perception of its data
• Watson does not learn incrementally
• Requires complex infrastructure

Contributions of Project Members
Major Task(s) Contributor
Implement SOM Ruwan
Research and Implement GSOM Thushan
Testing GSOM Lasindu
Research IKASL Thushan
Research Fuzzy integral Lasindu
Research Image Feature Extraction
Color Buddhima
Edge Ruwan
Shape Thushan
Research Text Feature Extraction Ruwan
Research WSD Lasindu
Research Information Visualization Buddhima

References
[13] Gephi, an open source graph visualization and manupulation software. n.d.
http://gephi.org/ (accessed April 28, 2013).
[14] BioLayout Express 3D. n.d. http://www.biolayout.org/ (accessed April 28, 2013).
[15] Ubigraph: Free dynamic graph visulization software. n.d.
http://ubietylab.net/ubigraph/ (accessed April 28, 2013).
[16] Secrier, Maria. Arena3D. n.d. http://arena3d.org/ (accessed April 28, 2013).

Weave-D - 2nd Progress Evaluation Presentation

Recommended

Recommended

More Related Content

What's hot

What's hot (18)

Viewers also liked

Viewers also liked (20)

Similar to Weave-D - 2nd Progress Evaluation Presentation

Similar to Weave-D - 2nd Progress Evaluation Presentation (20)

Recently uploaded

Recently uploaded (20)

Weave-D - 2nd Progress Evaluation Presentation

Editor's Notes