SlideShare a Scribd company logo
Unit 2: Data
Processing Concepts
Outlines
•Introduction
•Data Processing Concepts
•Data Processing Activities
•Data Processing Cycle
•Data Hierarchy
•Data File structures
•Data File Structures
•Application Portfolio Management
•Introduction to Micro Database
Manager
3.1 Data Processing Concepts
Data
The word "data" is the plural of datum, which
means facts, observations, occurrence and
observations. Data are representations of facts
pertaining to people, things, ideas and events.
Data are represented by symbols such as
alphabets, numerals or special symbols.
Data Processing
Data processing is the act of manipulating or
handling data in some manner. Thus, the idea of
processing is to transform data into
information Thus, data processing can be defined
as a series of actions or steps, which converts
data into useful information.
Information
Information can be defined as 'data transformed
into useful and meaningful form for specific
purpose'. Thus, data is not useful until it is
organised and manipulated because after that
only data becomes information.
3.2 Data Processing Activities
Data processing consists of all those activities,
which are required to convert data into
information. There are some tools, which help
in processing of data; these tools can be
manual, mechanical, electromechanical or
electronic such as manual tools as paper and
pencil, mechanical tools as filing cabinets,
electromechanical tools such as typewriters and
adding machines and electronic tools such as
calculators and computers.
3.3 Data Processing Cycle
1.Input: The term input refers to the activities required
to record data and to make it available for processing.
The input can also include the steps necessary to
check, verify and validate data contents.
2.Processing: The term processing denotes the
actual data manipulation techniques such as
classifying, sorting, calculating,summarising,
comparing, etc. that convert data into information.
3.Output: It is a communication function, which
transmits the information generated after processing
of data, to persons who need the information.
•Sometimes output also includes decoding activity,
which converts the electronically generated
information into human-readable form.
4.Storage: It involves the filing of data and information
for future use.
2.7 Data Hierarchy
It shows arrangement of data in hierarchical form
having following fields,records, files and so on.
A data field will keep date as September 8, 1971.
A record is shows details about an employee with
name fields, address fields, date of birth field etc.
A file carries related records. It helps in keeping track of
employee details.
Files are arranged in database using DBMS.
In case of data storage, data fields have bytes which
consist of bits.
3.8 Data File Structures
This is a particular way that information is encoded
for storage in a computer file. Since a disk drive or indeed
any computer storage, can store only bits, the computer
must have some way of converting information to 0s and
1s and vice-versa. There are different kinds of formats
for different kinds of information.
Within any format type, e.g. word processor
documents, there will typically be several different
formats. File formats are divided into proprietary and
open formats.
1. Generality
It is seen that there are certain file formats which are
used to store only particular type of data. The
formats such as JPEG are used to store static
photographic images while GIF format is used to
store images and animations. Apart from this,
QuickTime format stores certain types of multimedia.
2. Specifications
Many file formats, including some of the most well-
known file formats, often have a published
specification document (often with a reference
implementation) that describes exactly how the data is to
be encoded and which can be used to determine whether
or not a particular program treats a particular file format
correctly.
3. Identifying the Type of a File
A method is required to determine the format of a
particular file within the file system- an example of
metadata.
4. Filename extension
An important feature in operating systems is to
determine the format of file based on section of its name
following final period which is known as the filename
extension.
We see that an HTML documents will have extension
of .html or .htm, which an image file is having an
extension of .gif, .PNG etc.
For example, a company logo may be needed in
both .tif format (for publishing) and .gif format (for web
sites). With the extensions visible, these would appear
at the unique filenames "CompanyLogo.tif" and
"CompanyLogo.gif". With the extensions hidden, these
would both appear to have the identical filename
"Company Logo", making it more difficult to determine,
which one to select for a particular application.
5. Internal metadata
A second way to identify a file format is to store information
regarding the format inside the file itself. Usually, such
information is written in one (or more) binary string(s),
tagged or raw texts placed in fixed, specific locations
within the file. Since the easiest place to locate them is at
the beginning of it, this area is usually called a file header
when it is greater than a few bytes or a magic number if it
is just a few bytes long.
6. File header
First, the meta-data contained in a file header are not
necessarily stored only at the beginning of it but might be
present in other areas too, often including the end of the
file that depends on the file format or the type of data it
contains. Character-based (text) files have character-
based human-readable headers,whereas binary
formats usually feature binary headers, the need to read
all the bytes/records before it.
7. External metadata
A good way to store a file format is to store the information
about format in file system instead of keeping within the file
itself. Such idea not only keeps the metadata away from
main data and name but also not very safe as compared to
either file extensions.
8. Mac OS type-codes
Mac OS File System will able to store codes for creator
and types that serves as part of directory entry for each
file. They are called as OS Types or HyperCard stack
file. Type of code will show the format of file, while
creator code specifies default program to run.
9. OS/2 Extended Attributes
It found that HPFS, FAT12 and FAT16 file systems
allows storage of extended attributes with files which
contains an arbitrary set of triplets with name, coded type
for value and a value having different names.
10. POSIX extended attributes
On UNIX and Unix-like systems, ext2, ext3, ReiserFS
version 3, XFS, JFS, FFS and HFS+ file systems allow
storage of extended attributes with files.
11. PRONOM Unique Identifiers (PUIDs)
PRONOM Persistent Unique Identifier is an extension of
constant, special and definite identifiers which is for
file formats that was created by National Archives of
UK which serve as part of its PRONOM technical registry
service.
12. MIME types
MIME types are useful in Internet-related applications
that carries standardised system of identifiers having type
and sub-type which is separated by slash.
13. File content based format identification
It is another way of finding file format by way of file
contents that shows distinguishable patterns.
14. File format identifiers (FFIDs)
File format identifiers is also a way to find file formats
as per their origin and file category. It has several digits
which are of form NNNNNNNNN-XX-YYYYYYY, where
NNNNNNNN as per company/standards organisation
database and XX and YYYYYYY shows file type in
hexadecimal.
3.9 Application Portfolio Management.
Application Portfolio Management is a system
which is applied in medium to large size
Information Technology organisations. Such
system uses lessons of financial portfolio
management in order to verify financial benefits of
application to be compared with costs of the
application maintenance and operations.
Portfolio
Definition of an application
Application in case of application portfolio
management is an important aspect.
Application software: There are certain
executable software components which are used to
create, update, manage, calculate or display
information for particular business.
Software component: This is an executable
computer instructions having single deployment
container which cannot be broken further
Inclusions
The purpose of inclusion is:
To provide three web services: Invoice Create,
Invoice Search and Invoice Detail Get.
To have a service-oriented business application
where a user interface to get invoice which calls
Invoice Create service.
To have a legacy system having rich client,
server-based middle tier and database.
To pull data from database using website publishing
system and publishes it to HTML format as sub-site
on public URL.
Exclusions
The following are not applications:
HTML website
Having database which is not part of series of steps
for business values.
Web service which is incapable of being set of
steps.
Having a stand-alone batch script which compares
contents of databases by
making calls
3.10 Introduction to Micro Data Base Manager
Micro DB Manager is a database abstraction
class written in PHP using object-oriented
technologies.
The class realises the following functionalities:
Connecting to the database
Executing queries
Converting results to associative array
Getting selected rows
Getting affected rows
Getting last insert id
Getting the number of executed queries
Getting the execution time of the queries
Getting error messages and codes.
3.11 Glossary
1. Data - It is related to facts, observations,
occurrence of information.
2. Data processing - It is a processing of carrying
out data in particular
manner.
3. Processing - It is a procedure of giving some
meaning to data.
4. Application Portfolio Management - It is a system
applied in medium to large size Information
Technology organisations.

More Related Content

What's hot

Multimedia database
Multimedia databaseMultimedia database
Multimedia database
Faizal Basheer
 
A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...
Leon Osinski
 
8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial
DocuFi, offering HAI and Infection Prevention Analytics
 
Multimedia database
Multimedia databaseMultimedia database
Multimedia database
SumitKeshri10
 
data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
5 data resource management
5 data resource management5 data resource management
5 data resource management
Nymphea Saraf
 
Uit9 ppt ch08_au_rev
Uit9 ppt ch08_au_revUit9 ppt ch08_au_rev
Uit9 ppt ch08_au_revidrissss dddd
 
Automatic file naming and routing for scanned documents and existing files.
Automatic file naming and routing for scanned documents and existing files.  Automatic file naming and routing for scanned documents and existing files.
Automatic file naming and routing for scanned documents and existing files.
DocuFi, offering HAI and Infection Prevention Analytics
 
An Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your RequirementsAn Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your Requirements
DocuFi, offering HAI and Infection Prevention Analytics
 
Database Systems
Database SystemsDatabase Systems
Database Systems
Usman Tariq
 
Multimedia db system
Multimedia db systemMultimedia db system
Multimedia db system
Yojana Nanaware
 
Alternate Data Streams
Alternate Data StreamsAlternate Data Streams
Alternate Data Streamsnephijohnson
 
Batch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp BatchBatch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp Batch
DocuFi, offering HAI and Infection Prevention Analytics
 
Windows 7
Windows 7Windows 7
Windows 7
Mumbere Joab
 
Chapter 05 pertemuan 7- donpas - manajemen data
Chapter 05 pertemuan 7- donpas - manajemen dataChapter 05 pertemuan 7- donpas - manajemen data
Chapter 05 pertemuan 7- donpas - manajemen data
UNIVERSITAS TEKNOKRAT INDONESIA
 
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
DocuFi, offering HAI and Infection Prevention Analytics
 
Database Systems
Database SystemsDatabase Systems
Database Systems
Usman Tariq
 

What's hot (20)

Chapter10
Chapter10Chapter10
Chapter10
 
Multimedia database
Multimedia databaseMultimedia database
Multimedia database
 
A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...
 
8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial
 
Multimedia database
Multimedia databaseMultimedia database
Multimedia database
 
data resource management
 data resource management data resource management
data resource management
 
5 data resource management
5 data resource management5 data resource management
5 data resource management
 
Uit9 ppt ch08_au_rev
Uit9 ppt ch08_au_revUit9 ppt ch08_au_rev
Uit9 ppt ch08_au_rev
 
Automatic file naming and routing for scanned documents and existing files.
Automatic file naming and routing for scanned documents and existing files.  Automatic file naming and routing for scanned documents and existing files.
Automatic file naming and routing for scanned documents and existing files.
 
Files
FilesFiles
Files
 
An Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your RequirementsAn Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your Requirements
 
Database Systems
Database SystemsDatabase Systems
Database Systems
 
Multimedia db system
Multimedia db systemMultimedia db system
Multimedia db system
 
Alternate Data Streams
Alternate Data StreamsAlternate Data Streams
Alternate Data Streams
 
Batch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp BatchBatch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp Batch
 
Windows 7
Windows 7Windows 7
Windows 7
 
Chapter 05 pertemuan 7- donpas - manajemen data
Chapter 05 pertemuan 7- donpas - manajemen dataChapter 05 pertemuan 7- donpas - manajemen data
Chapter 05 pertemuan 7- donpas - manajemen data
 
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
 
Database Systems
Database SystemsDatabase Systems
Database Systems
 
Database
Database Database
Database
 

Viewers also liked

Micro-Controller based jimma university Interactive voice response System
Micro-Controller based jimma university Interactive voice response SystemMicro-Controller based jimma university Interactive voice response System
Micro-Controller based jimma university Interactive voice response System
Turi Ismael
 
Data Cleaning
Data CleaningData Cleaning
Schritt für Schritt Anleitung zu deinem Förderprogramm für IT-Software
Schritt für Schritt Anleitung zu deinem Förderprogramm für IT-SoftwareSchritt für Schritt Anleitung zu deinem Förderprogramm für IT-Software
Schritt für Schritt Anleitung zu deinem Förderprogramm für IT-Software
erp_system
 
Hosting the archangels in your home
Hosting the archangels in your homeHosting the archangels in your home
Hosting the archangels in your home
Deepak-Atim Somaji-Sawant
 
5. pre production(20
5. pre production(205. pre production(20
5. pre production(20
louis harman
 
Bba203 unit 2 operating system concepts
Bba203   unit 2 operating system conceptsBba203   unit 2 operating system concepts
Bba203 unit 2 operating system concepts
kinjal patel
 
Equipo 5 Diferidos
Equipo 5 DiferidosEquipo 5 Diferidos
Equipo 5 Diferidos
luis jimenez
 
HOW TO PROCESS DATA IN VARIOUS GEO'S A COMPARATIVE ANALYSIS BY SANJEEV SINGH...
HOW TO PROCESS DATA IN VARIOUS GEO'S A  COMPARATIVE ANALYSIS BY SANJEEV SINGH...HOW TO PROCESS DATA IN VARIOUS GEO'S A  COMPARATIVE ANALYSIS BY SANJEEV SINGH...
HOW TO PROCESS DATA IN VARIOUS GEO'S A COMPARATIVE ANALYSIS BY SANJEEV SINGH...
Sanjeev Bharwan
 
Comprehensive Validation with Laravel 4
Comprehensive Validation with Laravel 4Comprehensive Validation with Laravel 4
Comprehensive Validation with Laravel 4
Kirk Bushell
 
Bitt I 2008 Lec1
Bitt I 2008 Lec1Bitt I 2008 Lec1
Bitt I 2008 Lec1
ambujm
 
HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)
HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)
HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)
Sanjeev Bharwan
 
Image Analysis using Definiens Professional 4
Image Analysis using Definiens Professional 4Image Analysis using Definiens Professional 4
Image Analysis using Definiens Professional 4
Kabir Uddin
 
Hyperspectral Imaging
Hyperspectral ImagingHyperspectral Imaging
Hyperspectral Imaging
Parikshith Beenaveni
 
CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...
CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...
CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...
Nicole Stephan
 

Viewers also liked (15)

Micro-Controller based jimma university Interactive voice response System
Micro-Controller based jimma university Interactive voice response SystemMicro-Controller based jimma university Interactive voice response System
Micro-Controller based jimma university Interactive voice response System
 
Data Cleaning
Data CleaningData Cleaning
Data Cleaning
 
Schritt für Schritt Anleitung zu deinem Förderprogramm für IT-Software
Schritt für Schritt Anleitung zu deinem Förderprogramm für IT-SoftwareSchritt für Schritt Anleitung zu deinem Förderprogramm für IT-Software
Schritt für Schritt Anleitung zu deinem Förderprogramm für IT-Software
 
Hosting the archangels in your home
Hosting the archangels in your homeHosting the archangels in your home
Hosting the archangels in your home
 
5. pre production(20
5. pre production(205. pre production(20
5. pre production(20
 
Bba203 unit 2 operating system concepts
Bba203   unit 2 operating system conceptsBba203   unit 2 operating system concepts
Bba203 unit 2 operating system concepts
 
Data cleansing
Data cleansingData cleansing
Data cleansing
 
Equipo 5 Diferidos
Equipo 5 DiferidosEquipo 5 Diferidos
Equipo 5 Diferidos
 
HOW TO PROCESS DATA IN VARIOUS GEO'S A COMPARATIVE ANALYSIS BY SANJEEV SINGH...
HOW TO PROCESS DATA IN VARIOUS GEO'S A  COMPARATIVE ANALYSIS BY SANJEEV SINGH...HOW TO PROCESS DATA IN VARIOUS GEO'S A  COMPARATIVE ANALYSIS BY SANJEEV SINGH...
HOW TO PROCESS DATA IN VARIOUS GEO'S A COMPARATIVE ANALYSIS BY SANJEEV SINGH...
 
Comprehensive Validation with Laravel 4
Comprehensive Validation with Laravel 4Comprehensive Validation with Laravel 4
Comprehensive Validation with Laravel 4
 
Bitt I 2008 Lec1
Bitt I 2008 Lec1Bitt I 2008 Lec1
Bitt I 2008 Lec1
 
HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)
HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)
HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)
 
Image Analysis using Definiens Professional 4
Image Analysis using Definiens Professional 4Image Analysis using Definiens Professional 4
Image Analysis using Definiens Professional 4
 
Hyperspectral Imaging
Hyperspectral ImagingHyperspectral Imaging
Hyperspectral Imaging
 
CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...
CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...
CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...
 

Similar to Bba203 unit 2data processing concepts

Database Management and it is definition
Database Management and it is definitionDatabase Management and it is definition
Database Management and it is definition
Rashed Barakzai
 
Data Science Process.pptx
Data Science Process.pptxData Science Process.pptx
Data Science Process.pptx
WidsoulDevil
 
Understanding EDP (Electronic Data Processing) Environment
Understanding EDP (Electronic Data Processing) EnvironmentUnderstanding EDP (Electronic Data Processing) Environment
Understanding EDP (Electronic Data Processing) Environment
Adetula Bunmi
 
File management
File managementFile management
File management
sangrampatil81
 
D I T211 Chapter 1
D I T211    Chapter 1D I T211    Chapter 1
D I T211 Chapter 1askme
 
D I T211 Chapter 1 1
D I T211    Chapter 1 1D I T211    Chapter 1 1
D I T211 Chapter 1 1askme
 
Dbms
DbmsDbms
Database, Lecture-1.ppt
Database, Lecture-1.pptDatabase, Lecture-1.ppt
Database, Lecture-1.ppt
MatshushimaSumaya
 
Dbms mca-section a
Dbms mca-section aDbms mca-section a
Dbms mca-section a
Vaibhav Kathuria
 
Information management systems
Information management systemsInformation management systems
Information management systems
m_rturnage
 
Degonto file management
Degonto file managementDegonto file management
Degonto file management
Degonto Islam
 
chapter 1 HARDWARE AND NETWORKING SERVICE.pptx
chapter 1 HARDWARE AND NETWORKING SERVICE.pptxchapter 1 HARDWARE AND NETWORKING SERVICE.pptx
chapter 1 HARDWARE AND NETWORKING SERVICE.pptx
sufiyanhussein798
 
Information Centric Network And Developing Channel Coding...
Information Centric Network And Developing Channel Coding...Information Centric Network And Developing Channel Coding...
Information Centric Network And Developing Channel Coding...
Kim Moore
 
Application portfolio development.advadisadvan.pptx
Application portfolio development.advadisadvan.pptxApplication portfolio development.advadisadvan.pptx
Application portfolio development.advadisadvan.pptx
AmanJain384694
 
Cibm work shop 2chapter six
Cibm  work shop 2chapter sixCibm  work shop 2chapter six
Cibm work shop 2chapter sixShaheen Khan
 
Database Management System Part-1.pptx
Database Management System Part-1.pptxDatabase Management System Part-1.pptx
Database Management System Part-1.pptx
ArshveerSinghDhillon
 
Disclosing Private Information from Metadata, hidden info and lost data
Disclosing Private Information from  Metadata, hidden info and lost data Disclosing Private Information from  Metadata, hidden info and lost data
Disclosing Private Information from Metadata, hidden info and lost data
Chema Alonso
 
SAP BI/BW
SAP BI/BWSAP BI/BW

Similar to Bba203 unit 2data processing concepts (20)

Database Management and it is definition
Database Management and it is definitionDatabase Management and it is definition
Database Management and it is definition
 
Data Science Process.pptx
Data Science Process.pptxData Science Process.pptx
Data Science Process.pptx
 
Metadata For Preservation Delos
Metadata For Preservation DelosMetadata For Preservation Delos
Metadata For Preservation Delos
 
8.DBMS.pptx
8.DBMS.pptx8.DBMS.pptx
8.DBMS.pptx
 
Understanding EDP (Electronic Data Processing) Environment
Understanding EDP (Electronic Data Processing) EnvironmentUnderstanding EDP (Electronic Data Processing) Environment
Understanding EDP (Electronic Data Processing) Environment
 
File management
File managementFile management
File management
 
D I T211 Chapter 1
D I T211    Chapter 1D I T211    Chapter 1
D I T211 Chapter 1
 
D I T211 Chapter 1 1
D I T211    Chapter 1 1D I T211    Chapter 1 1
D I T211 Chapter 1 1
 
Dbms
DbmsDbms
Dbms
 
Database, Lecture-1.ppt
Database, Lecture-1.pptDatabase, Lecture-1.ppt
Database, Lecture-1.ppt
 
Dbms mca-section a
Dbms mca-section aDbms mca-section a
Dbms mca-section a
 
Information management systems
Information management systemsInformation management systems
Information management systems
 
Degonto file management
Degonto file managementDegonto file management
Degonto file management
 
chapter 1 HARDWARE AND NETWORKING SERVICE.pptx
chapter 1 HARDWARE AND NETWORKING SERVICE.pptxchapter 1 HARDWARE AND NETWORKING SERVICE.pptx
chapter 1 HARDWARE AND NETWORKING SERVICE.pptx
 
Information Centric Network And Developing Channel Coding...
Information Centric Network And Developing Channel Coding...Information Centric Network And Developing Channel Coding...
Information Centric Network And Developing Channel Coding...
 
Application portfolio development.advadisadvan.pptx
Application portfolio development.advadisadvan.pptxApplication portfolio development.advadisadvan.pptx
Application portfolio development.advadisadvan.pptx
 
Cibm work shop 2chapter six
Cibm  work shop 2chapter sixCibm  work shop 2chapter six
Cibm work shop 2chapter six
 
Database Management System Part-1.pptx
Database Management System Part-1.pptxDatabase Management System Part-1.pptx
Database Management System Part-1.pptx
 
Disclosing Private Information from Metadata, hidden info and lost data
Disclosing Private Information from  Metadata, hidden info and lost data Disclosing Private Information from  Metadata, hidden info and lost data
Disclosing Private Information from Metadata, hidden info and lost data
 
SAP BI/BW
SAP BI/BWSAP BI/BW
SAP BI/BW
 

Recently uploaded

The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
chanes7
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
kimdan468
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
gb193092
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 

Recently uploaded (20)

The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 

Bba203 unit 2data processing concepts

  • 2. Outlines •Introduction •Data Processing Concepts •Data Processing Activities •Data Processing Cycle •Data Hierarchy •Data File structures •Data File Structures •Application Portfolio Management •Introduction to Micro Database Manager
  • 3. 3.1 Data Processing Concepts Data The word "data" is the plural of datum, which means facts, observations, occurrence and observations. Data are representations of facts pertaining to people, things, ideas and events. Data are represented by symbols such as alphabets, numerals or special symbols. Data Processing Data processing is the act of manipulating or handling data in some manner. Thus, the idea of processing is to transform data into information Thus, data processing can be defined as a series of actions or steps, which converts data into useful information.
  • 4. Information Information can be defined as 'data transformed into useful and meaningful form for specific purpose'. Thus, data is not useful until it is organised and manipulated because after that only data becomes information.
  • 5. 3.2 Data Processing Activities Data processing consists of all those activities, which are required to convert data into information. There are some tools, which help in processing of data; these tools can be manual, mechanical, electromechanical or electronic such as manual tools as paper and pencil, mechanical tools as filing cabinets, electromechanical tools such as typewriters and adding machines and electronic tools such as calculators and computers.
  • 6. 3.3 Data Processing Cycle 1.Input: The term input refers to the activities required to record data and to make it available for processing. The input can also include the steps necessary to check, verify and validate data contents. 2.Processing: The term processing denotes the actual data manipulation techniques such as classifying, sorting, calculating,summarising, comparing, etc. that convert data into information. 3.Output: It is a communication function, which transmits the information generated after processing of data, to persons who need the information. •Sometimes output also includes decoding activity, which converts the electronically generated information into human-readable form.
  • 7. 4.Storage: It involves the filing of data and information for future use.
  • 8. 2.7 Data Hierarchy It shows arrangement of data in hierarchical form having following fields,records, files and so on. A data field will keep date as September 8, 1971. A record is shows details about an employee with name fields, address fields, date of birth field etc. A file carries related records. It helps in keeping track of employee details. Files are arranged in database using DBMS. In case of data storage, data fields have bytes which consist of bits.
  • 9. 3.8 Data File Structures This is a particular way that information is encoded for storage in a computer file. Since a disk drive or indeed any computer storage, can store only bits, the computer must have some way of converting information to 0s and 1s and vice-versa. There are different kinds of formats for different kinds of information. Within any format type, e.g. word processor documents, there will typically be several different formats. File formats are divided into proprietary and open formats.
  • 10. 1. Generality It is seen that there are certain file formats which are used to store only particular type of data. The formats such as JPEG are used to store static photographic images while GIF format is used to store images and animations. Apart from this, QuickTime format stores certain types of multimedia. 2. Specifications Many file formats, including some of the most well- known file formats, often have a published specification document (often with a reference implementation) that describes exactly how the data is to be encoded and which can be used to determine whether or not a particular program treats a particular file format correctly.
  • 11. 3. Identifying the Type of a File A method is required to determine the format of a particular file within the file system- an example of metadata. 4. Filename extension An important feature in operating systems is to determine the format of file based on section of its name following final period which is known as the filename extension. We see that an HTML documents will have extension of .html or .htm, which an image file is having an extension of .gif, .PNG etc.
  • 12. For example, a company logo may be needed in both .tif format (for publishing) and .gif format (for web sites). With the extensions visible, these would appear at the unique filenames "CompanyLogo.tif" and "CompanyLogo.gif". With the extensions hidden, these would both appear to have the identical filename "Company Logo", making it more difficult to determine, which one to select for a particular application. 5. Internal metadata A second way to identify a file format is to store information regarding the format inside the file itself. Usually, such information is written in one (or more) binary string(s), tagged or raw texts placed in fixed, specific locations within the file. Since the easiest place to locate them is at the beginning of it, this area is usually called a file header when it is greater than a few bytes or a magic number if it is just a few bytes long.
  • 13. 6. File header First, the meta-data contained in a file header are not necessarily stored only at the beginning of it but might be present in other areas too, often including the end of the file that depends on the file format or the type of data it contains. Character-based (text) files have character- based human-readable headers,whereas binary formats usually feature binary headers, the need to read all the bytes/records before it. 7. External metadata A good way to store a file format is to store the information about format in file system instead of keeping within the file itself. Such idea not only keeps the metadata away from main data and name but also not very safe as compared to either file extensions.
  • 14. 8. Mac OS type-codes Mac OS File System will able to store codes for creator and types that serves as part of directory entry for each file. They are called as OS Types or HyperCard stack file. Type of code will show the format of file, while creator code specifies default program to run. 9. OS/2 Extended Attributes It found that HPFS, FAT12 and FAT16 file systems allows storage of extended attributes with files which contains an arbitrary set of triplets with name, coded type for value and a value having different names. 10. POSIX extended attributes On UNIX and Unix-like systems, ext2, ext3, ReiserFS version 3, XFS, JFS, FFS and HFS+ file systems allow storage of extended attributes with files.
  • 15. 11. PRONOM Unique Identifiers (PUIDs) PRONOM Persistent Unique Identifier is an extension of constant, special and definite identifiers which is for file formats that was created by National Archives of UK which serve as part of its PRONOM technical registry service. 12. MIME types MIME types are useful in Internet-related applications that carries standardised system of identifiers having type and sub-type which is separated by slash. 13. File content based format identification It is another way of finding file format by way of file contents that shows distinguishable patterns.
  • 16. 14. File format identifiers (FFIDs) File format identifiers is also a way to find file formats as per their origin and file category. It has several digits which are of form NNNNNNNNN-XX-YYYYYYY, where NNNNNNNN as per company/standards organisation database and XX and YYYYYYY shows file type in hexadecimal.
  • 17. 3.9 Application Portfolio Management. Application Portfolio Management is a system which is applied in medium to large size Information Technology organisations. Such system uses lessons of financial portfolio management in order to verify financial benefits of application to be compared with costs of the application maintenance and operations. Portfolio
  • 18. Definition of an application Application in case of application portfolio management is an important aspect. Application software: There are certain executable software components which are used to create, update, manage, calculate or display information for particular business. Software component: This is an executable computer instructions having single deployment container which cannot be broken further
  • 19. Inclusions The purpose of inclusion is: To provide three web services: Invoice Create, Invoice Search and Invoice Detail Get. To have a service-oriented business application where a user interface to get invoice which calls Invoice Create service. To have a legacy system having rich client, server-based middle tier and database. To pull data from database using website publishing system and publishes it to HTML format as sub-site on public URL.
  • 20. Exclusions The following are not applications: HTML website Having database which is not part of series of steps for business values. Web service which is incapable of being set of steps. Having a stand-alone batch script which compares contents of databases by making calls
  • 21. 3.10 Introduction to Micro Data Base Manager Micro DB Manager is a database abstraction class written in PHP using object-oriented technologies. The class realises the following functionalities: Connecting to the database Executing queries Converting results to associative array Getting selected rows Getting affected rows Getting last insert id Getting the number of executed queries Getting the execution time of the queries Getting error messages and codes.
  • 22. 3.11 Glossary 1. Data - It is related to facts, observations, occurrence of information. 2. Data processing - It is a processing of carrying out data in particular manner. 3. Processing - It is a procedure of giving some meaning to data. 4. Application Portfolio Management - It is a system applied in medium to large size Information Technology organisations.