Bioinformatics Gaussian by ChARM’s

SUBMITTED TO:
Prof. Neeraj Bhargava
Professor and Head
Department of Computer Science
1
PRESENTED BY:
Arpit Kumar Sharma
M.Tech IV Sem
Department of Computer Science

CONTENT
• MATLAB SOFTWARE
• INTRODUCTION
• BIOINFORMATIC
• CHARM’S
• IMPLEMENTATION
• IMPLEMENTATION IMAGE
• METHODOLOGY
• RESULT & ANALYSIS
• RESULT OF DATASET
2

MATLAB SOFTWARE
• MATLAB is a high-performance language for
technical computing. It integrates computation,
visualization, and programming in an easy-to-use
environment-where problems and solutions are
expressed in familiar mathematical notation.
FEATURES :-
• Math and computation
• Algorithm development
3

Cont.. 4
• Data acquisition Modeling, simulation,
and prototyping Data analysis,
exploration, and visualization
• Ability to Scale
• Scientific and engineering graphic
• Application development, including
graphical user interface building.

INTRODUCTION
• Bioinformatics is an interdisciplinary field that
develops methods and software tools for
understanding biological data.
• As an interdisciplinary field of science,
bioinformatics combines biology, computer
science, mathematics and statistics to analyze and
interpret biological data.
5

Cont.. 6
•Bioinformatics has been used for in
silico analyses of biological queries using
mathematical and statistical techniques.

Cont.. 7
•Bioinformatics is both an umbrella term for
the body of biological studies that
use computer programming as part of their
methodology, as well as a reference to
specific analysis "pipelines" that are
repeatedly used, particularly in the field
of genomics.

8
• ChARM , an unsupervised method for discovering
combinatorial chromatin modification patterns, can
identify histone modifications that occur globally
• ChARM provides a scalable framework
•CHARM: An Efficient Algorithm for Closed
Association Rule Mining

9
•Feature extraction: A total of 953 features are
extracted on a whole-image basis using Cell Profiler.
•Dimension reduction: Features are projected in
principal components space, and a subset of
principal components analysis (PCA) vectors is
retained such that 98 % of the variance present in the
original data distribution is conserved.

Cont. 10
•Classification: Linear Discriminate Analysis
(LDA) is used to classify the selected PCA-
transformed feature vectors.
•Validation: The classifier’s performance is
assessed with 10-fold cross-validation.

METHODOLOGY 12
The National Center for Biotechnology
Information advances science and health by
providing access to biomedical and genomic
information.

Cont.. 13
After the Login NCBI provides the access of features
•Submit
•Download
•Learn
•Develop
• Analyze
•Research

Cont.. 14
•Submit
NCBI collects submissions of data for the world's
largest public repository of biological and scientific
information. Submit the data and track the status of
submission of Data .
.
•Download
The majority of NCBI data are available for
downloading, either directly from the NCBI FTP site
or by using software tools to download custom
datasets. The basic need of download feature provides
three types of scenario.

Cont.. 15
•Learn
NCBI creates a variety of educational products
including courses, workshops, webinars, training
materials and documentation. NCBI educational events
are free and open to everyone. All NCBI educational
materials are available for anyone to re-use and
distribute.
•Develop
NCBI provides a variety of resources that allow
developers to access and manipulate NCBI data in
their applications

Cont.. 16
•Research
Research in the NCBI Computational Biology Branch
(CBB) focuses on theoretical, analytical, and applied
computational approaches to a broad range of
fundamental problems in molecular biology and
medicine.
•Analyze
NCBI provides a wide variety of data analysis tools that
allow users to manipulate, align, visualize and evaluate
biological data.

ANALYSIS 17
•Use GEO2R(Web-Tool) to compare two or more
groups of Samples in order to identify genes that are
differentially expressed across experimental
conditions. Results are presented as a table of genes
ordered by significance. (My Database GEO accession
Name is GSE72586 )
•We Also derives the Value Distribution, Options,
Profile Graph, R-Script .

Bioinformatics Gaussian by ChARM’s

Recommended

Recommended

More Related Content

Similar to Bioinformatics Gaussian by ChARM’s

Similar to Bioinformatics Gaussian by ChARM’s (20)

More from Er. Arpit Sharma

More from Er. Arpit Sharma (9)

Recently uploaded

Recently uploaded (20)

Bioinformatics Gaussian by ChARM’s