PDF vs. TIFF, An Evaluation of Document Scanning File Formats

DocuFi, offering HAI and Infection Prevention Analytics
DocuFi, offering HAI and Infection Prevention AnalyticsPresident and CEO at DocuFi, offering HAI and Infection Prevention Analytics
k at file formats for document sca
PDF v. TIFF
Copyright ©2014
So you’ve
decided to
implement a
document
management or
search and
retrieval
system for
all your
paper
documents.
You
have a
lot of
decisio
ns to
make.
And one of them is, “What file format
should I use?”
PDF
JPEG
Before you can
decide on file
format, you have
some homework to
do.
Answer the
following:
Are the
documents…
• Office Text
Documents
• Magazines/Journals
• Books
• Drawings
• Maps
• Newspapers
• Photographs
Graphic-BasedText-Based
Are they…
Black and
White,
Bitonal,
Grayscale,
Color?
Stained, torn,
aged?
Contain
Handwritten
Notes or
Mixed
Components?
How will I use
them…
Web: Search, View or
Print?
Network Search and
Retrieve (everyday
business use)?
Archival (search and
retrieval or
preservation)?
How will my
users search for
documents?
How will my
users search for
documents?
Designated fields
such as Invoice
No., Customer
Name, Date,
Patient ID…?
or
will they need
free-form
searching on all
text?
Do I have other
considerations?
Legal:
Admissibility and retention
requirements?
Retention:
How long do to keep the file for
the users, legal?
Security:
Do documents need passwords,
restricted usage, changes
tracked?
Retrieval
Limitations:
Can my users wait milliseconds,
seconds, or minutes?
Storage
Limitations:
How many documents do I have? Is
my storage budget limited ?
Conversion:
Will I need to convert or
present the files in another, or
multiple formats later.
Let’s take a look at PDF v.
TIFF, the dominant formats for
scanned documents.
What is
?
(Tagged Image File Format)
TIFF
• Created by Aldus and Microsoft in 1980’s.
Now owned by Adobe.
• Developed as a format for scanned images
• Most recent version, 6.0 published in
1992
• Universal: Broadly adopted, widely
supported by many applications and free
viewers, platform independent
• Many subtypes representing different
compression and color representation
schemes
Source: National Digital Information Infrastructure
and Preservation Program.
What is
?
TIFF
For document scanning purposes, the most notable
versions are:
• Uncompressed,
lossless
TIFF-UNC
• Compressed,
lossless
• Often deployed
for bitonal or
color.
• Most effective
for solid
colors
(graphics),
and less
effective for
24-bit photo
TIFF-LZW
• Compressed,
lossless
• Widely
deployed in
digital
libraries and
businesses as
a master
format for
bitonal
images.
TIFF-G4
*Lossless compression discards no information whereas lossy compression allows some
degradation in order to achieve smaller file size.
What is ?
(Portable Document Format)
PDF
• Created by Adobe over 20 years ago,
portions now maintained by ISO
• Page-oriented and may contain text,
images, graphics, and other multimedia
content, such as video and audio
• Universal: Broadly adopted, widely
supported by many applications and free
viewers, platform independent
• Many subtypes representing different
features
• Optionally: hyperlinks, searchable,
assistive technology, security features,
Source: National Digital Information Infrastructure
and Preservation Program.
For document scanning purposes, the most notable
issues:
What is ?
Searchable
Selecting “make
searchable”, “apply
OCR”, “text-under-
image” or “searchable
PDF” from your scanning
device options creates
a “full-text”
searchable file by
creating a PDF file
with two layers, an
image layer and a text
layer for full-text
searching.
PDF
For document scanning purposes, the most notable
issues:
What is ?
Archive
It differs by omitting features not necessary
for long-term archiving, such as font linking.
Growing in international government and
industry segments, including legal systems,
libraries, newspapers, and regulated
industries.
PDF/A , ISO-standard for digital preservation
or archiving of electronic documents.
PDF
Just a quick
note on
• Used primarily
for
photographs
• Single page
• “Lossy”
compression
• NOT a
“document”
scanning
JPEG
Now
let’s
take a
look
at
decisi
on
points
.
Indexing and
Searchability?
TIFF
TIFF was designed as a
“wrapper for images. Can use
simple tags only. To be fully
searchable, it needs an OCR
process to create a separate
text file that can then be
searched and indexed.
Some document indexing
software packages include this
as an option.
Accommodates basic tags and can
support more sophisticated XML-
based metadata with Adobe's
Extensible Metadata Platform
(XMP). XMP allows you to embed
metadata about a file, into the
file itself.
Full-text searching option is easily
supported and native to the file
format so unless it is saved as an
“image-only” format, it is fully
searchable.
PDF
TIFF
Both TIFF and PDF are universal in that they
are common output formats of many
applications. They also can be accessed and
viewed using many different applications.
TIFF files are easily integrated into other
applications such as Word and PowerPoint as
they are “image” based. Both formats are
viewable across most if not all operating
systems.
Adoption/Portability?
PDF
Longevity/Archiving?
TIFF
Because of the widespread
adoption and plethora of
viewers, TIFF is expected to be
a viable file format for some
time.
Because PDF/A format was
designed for long term use
and has been adopted by
many libraries and
government groups, PDF/A is
the clear winner for archiving
situations.
PDF
Security?
TIFF
There are no built-in security
features. Users can only be
allowed or disallowed access
to TIFF files.
Sophisticated security options.
Includes password protection,
permissions and restricted use
(view, search, print,
cut/copy/paste restrictions),
watermarking, and
encryption.
PDF
Before we take a look at file size
which impacts storage requirements and
upload/download speeds, let’s examine
the four things that effect file size.
Before we take a look at file size
which impacts storage requirements and
upload/download speeds, let’s examine
the four things that effect file size.
1. Scanning Resolution
A 300 dpi scan is much smaller than a 600 dpi scan.
2.Color Space
Color and grayscale scans are much larger than
black and white scans.
3.Physical Dimensions
An 8 ½ by 11 page is much smaller than an 11 x 14,
all other things being equal.
4.Compression
Raw scans can be compressed for a much smaller size
and compression technologies compress different
types scanned of documents differently. Reference: Adobe: Acrolaw Blog
File Size/Upload and
Download Speed?
TIFF PDF
Both TIFF and PDF offer compression
technology. Scan your typical documents with
a variety of file compression formats to
determine the acceptable file size and
upload/download speed for your environment.
Color, Grayscale, or Black
and White?
TIFF PDF
As mentioned previously, G4
compression files are often
used for black and white or
bitonal scans.
TIFF-LZW is often used for
bitonal or color images and is
most effective for solid color
graphics and less effective for
24-bit photos.
PDF files also offer different
compression technologies
which present options for
color space.
Color, Grayscale, or Black
and White?
TIFF PDF
As mentioned previously, G4
compression files are often
used for black and white or
bitonal scans.
TIFF-LZW is often used for
bitonal or color images and is
most effective for solid color
graphics and less effective for
24-bit photos.
PDF files also offer different
compression technologies
which present options for
color space.Both TIFF and PDF support color, grayscale,
and black and white. Here again, scan your
typical documents with a variety of formats
to determine the acceptable output. Caution,
scanning a black and white text document with
a color setting, needlessly creates a large
file.
TIFF PDF
Miscellaneous?
Legal Admissibility: Varies by country. Generally
both file types can be admissible as long as
the appropriate processes are followed for
the rules of evidence for the specific
jurisdiction.
TIFF PDF
Miscellaneous?
Legal Admissibility: Varies by country. Generally
both file types can be admissible as long as
the appropriate processes are followed for
the rules of evidence for the specific
jurisdiction.
Conversion: Both TIFF and PDF files can be
converted with readily available tools. This
may be important if your scanned files are to
be used as “master files”. For example, you
may need to scan for both archival and web
viewing. Because of file size, you may need
to copy and convert a large archival file for
easy web viewing. Hence the “master file”
l
And the
decision
goes to…
…maybe both PDF and TIFF as users often
have a variety of document types with
different requirements.
you decide
Learn More about Document Imaging and
Capture
Contact us for more information on:
• Intelligent data capture
• PDF to TIFF Conversion
• How to convert PDF and TIFF Files
• More tutorial information on document management
• Scanning documents for document management,
• How to intelligently capture index data from your scans
• Requirements for document management scanning
• How to select a document capture or document scanning
solution
• Using touchscreen scanners such as the Fujitsu ScanSnap as an
intelligent capture solution
• Batch document scanning solutions
• Document Management cost savings
• EMR data capture
• Batch Indexing solutions
• Batch document indexing
• Index documents
• Create a document index
• Document management index
• Index from print stream
• ECM index
• Index ECM
By DocuFi
30 years’ experience in the Document Imaging
market.
Find out more at ImageRamp and
www.docufi.com Copyright ©2014
makers of ImageRamp,
Document Management
Capture Solution
Image Credits and
References
• Todd Anderson neurmadic aesthetic, ”Ding” , http://bit.ly/1egCSkU
• Doug Waldron, “Files (85)”, http://bit.ly/1bfciII
• Knile Lucy, you have some sorting to do! http://bit.ly/19bSgjFDave Gray
• Butterbean man, “Decisions”, http://bit.ly/1iqCVSc
• Ben Schumin, SchuminWeb, “Shelves at Archives II”, http://bit.ly/1iqDD1K
• Angel Arcones, Freddy The Boy, “Dia 91: Decisiones”, http://bit.ly/1egCSkU
• MicroAssist “Apples and Oranges”, http://bit.ly/17KPimb
• AJC1, “Checklists”, http://bit.ly/KDCsgO
• Russ, russteaches, “2 Big 2 Small”, http://bit.ly/1hODsdL
• The U.S. Army,” West Point wins collegiate boxing championship”,
http://bit.ly/1g4BAA6
• Aberdeen Proving Ground, “16th pounds 143rd to win Amateur Boxing Tournament”,
http://bit.ly/KLxkH4
All images are owned or licensed by DocuFi with acknowledgement given to:
Reference /Source Material:
• Alternative File Formats for Storing Master Images of Digitisation Projects,
National Library of the Netherlands Research & Development Department
• Department of Physics, Wake Forest University,
• “Sustainability of Digital Formats. Planning for Library of Congress
Collectiion” Library of Congress
1 of 37

Recommended

Automatic file naming and routing for scanned documents and existing files. by
Automatic file naming and routing for scanned documents and existing files.  Automatic file naming and routing for scanned documents and existing files.
Automatic file naming and routing for scanned documents and existing files. DocuFi, offering HAI and Infection Prevention Analytics
6.9K views14 slides
DocuWare Overview by
DocuWare OverviewDocuWare Overview
DocuWare OverviewAtif Sheikh
1.1K views14 slides
Proposal DMS by
Proposal   DMS Proposal   DMS
Proposal DMS Media-Mosaic
10.8K views49 slides
Digital Records Management & Preservation by
Digital Records Management & PreservationDigital Records Management & Preservation
Digital Records Management & Preservationvictor Nduna
2.1K views57 slides
Document management system by
Document management systemDocument management system
Document management systemAbhishek Agrawal
505 views20 slides
Records by
RecordsRecords
RecordsDr. Hina Kaynat
3.1K views14 slides

More Related Content

What's hot

Records Inventory And Appraisal by
Records Inventory And AppraisalRecords Inventory And Appraisal
Records Inventory And AppraisalFe Angela Verzosa
3.6K views17 slides
University electronic management system by
University electronic management systemUniversity electronic management system
University electronic management systemAleksey Lashin
1.7K views25 slides
Basics of records management by
Basics of records managementBasics of records management
Basics of records managementRussell James
19.3K views76 slides
Electronic recordkeeping by
Electronic recordkeepingElectronic recordkeeping
Electronic recordkeepingExpoco
2.8K views23 slides
Ch01 records management by
Ch01 records managementCh01 records management
Ch01 records managementxtin101
980 views15 slides
Introduction to Records Management @ UNC-Chapel Hill by
Introduction to Records Management @ UNC-Chapel HillIntroduction to Records Management @ UNC-Chapel Hill
Introduction to Records Management @ UNC-Chapel HillUNCrecman
2.2K views17 slides

What's hot(20)

University electronic management system by Aleksey Lashin
University electronic management systemUniversity electronic management system
University electronic management system
Aleksey Lashin1.7K views
Basics of records management by Russell James
Basics of records managementBasics of records management
Basics of records management
Russell James19.3K views
Electronic recordkeeping by Expoco
Electronic recordkeepingElectronic recordkeeping
Electronic recordkeeping
Expoco2.8K views
Ch01 records management by xtin101
Ch01 records managementCh01 records management
Ch01 records management
xtin101980 views
Introduction to Records Management @ UNC-Chapel Hill by UNCrecman
Introduction to Records Management @ UNC-Chapel HillIntroduction to Records Management @ UNC-Chapel Hill
Introduction to Records Management @ UNC-Chapel Hill
UNCrecman2.2K views
Why records management is important by OMWOMA JACKSON
Why records management is importantWhy records management is important
Why records management is important
OMWOMA JACKSON12.7K views
Document Management With Workflow Presentation by John Street
Document Management With Workflow PresentationDocument Management With Workflow Presentation
Document Management With Workflow Presentation
John Street12.6K views
Electronic Records Management by Brad Houston
Electronic Records ManagementElectronic Records Management
Electronic Records Management
Brad Houston13.2K views
Digital Archiving Solutions Presentation English by amangu
Digital Archiving Solutions Presentation EnglishDigital Archiving Solutions Presentation English
Digital Archiving Solutions Presentation English
amangu5K views
Archival Management: Principles and Techniques by Fe Angela Verzosa
Archival Management: Principles and TechniquesArchival Management: Principles and Techniques
Archival Management: Principles and Techniques
Fe Angela Verzosa45.8K views
Seminar ppt hazards to library by Nagendra N
Seminar ppt hazards to librarySeminar ppt hazards to library
Seminar ppt hazards to library
Nagendra N2.7K views
Negotiation Power Skills Applied in Library Services Management by Shirley Ingles-Cruz
Negotiation Power Skills Applied in Library Services ManagementNegotiation Power Skills Applied in Library Services Management
Negotiation Power Skills Applied in Library Services Management
Organization of Archival Materials by Fe Angela Verzosa
Organization of Archival MaterialsOrganization of Archival Materials
Organization of Archival Materials
Fe Angela Verzosa11.5K views

Viewers also liked

ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an... by
ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...
ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...DocuFi, offering HAI and Infection Prevention Analytics
3.4K views33 slides
Document scanning and capture (local, central, outsource) what's working best by
Document scanning and capture (local, central, outsource) what's working bestDocument scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working bestVander Loto
2.3K views26 slides
Image Scanning Services by
Image Scanning ServicesImage Scanning Services
Image Scanning ServicesGlobal Associates
1.1K views6 slides
Scanning Document Types | Record Nations by
Scanning Document Types | Record NationsScanning Document Types | Record Nations
Scanning Document Types | Record NationsRecord Nations
298 views1 slide
Apa itu soft copy by
Apa itu soft copyApa itu soft copy
Apa itu soft copyjohnthj
65.8K views7 slides
What can barcodes do for me? A look at barcodes in Document Management/EMR da... by
What can barcodes do for me? A look at barcodes in Document Management/EMR da...What can barcodes do for me? A look at barcodes in Document Management/EMR da...
What can barcodes do for me? A look at barcodes in Document Management/EMR da...DocuFi, offering HAI and Infection Prevention Analytics
4.9K views25 slides

Viewers also liked(15)

Document scanning and capture (local, central, outsource) what's working best by Vander Loto
Document scanning and capture (local, central, outsource) what's working bestDocument scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working best
Vander Loto2.3K views
Scanning Document Types | Record Nations by Record Nations
Scanning Document Types | Record NationsScanning Document Types | Record Nations
Scanning Document Types | Record Nations
Record Nations298 views
Apa itu soft copy by johnthj
Apa itu soft copyApa itu soft copy
Apa itu soft copy
johnthj65.8K views
Why you need to use document scanning management system for business? by Digismartek
Why you need to use document scanning management system for business?Why you need to use document scanning management system for business?
Why you need to use document scanning management system for business?
Digismartek707 views
Scanning & document management by Gautam Ganguly
Scanning & document managementScanning & document management
Scanning & document management
Gautam Ganguly772 views

Similar to PDF vs. TIFF, An Evaluation of Document Scanning File Formats

PDF/a for Dutch Law firms by
PDF/a for Dutch Law firmsPDF/a for Dutch Law firms
PDF/a for Dutch Law firmsDean Sappey
382 views22 slides
Right Solution for PDF by
Right Solution for PDFRight Solution for PDF
Right Solution for PDFEnvision Technology Advisors
689 views18 slides
Different file types by
Different file typesDifferent file types
Different file typesDeftPDF
78 views27 slides
Graphic files by
Graphic filesGraphic files
Graphic filesselinasetzer
372 views6 slides
Graphic files by
Graphic filesGraphic files
Graphic filesselinasetzer
116 views6 slides
WEBINAR PRESENTATION: PDFA - its more than you think by
WEBINAR PRESENTATION: PDFA - its more than you thinkWEBINAR PRESENTATION: PDFA - its more than you think
WEBINAR PRESENTATION: PDFA - its more than you thinkAdlib - The PDF Experts
402 views21 slides

Similar to PDF vs. TIFF, An Evaluation of Document Scanning File Formats(20)

PDF/a for Dutch Law firms by Dean Sappey
PDF/a for Dutch Law firmsPDF/a for Dutch Law firms
PDF/a for Dutch Law firms
Dean Sappey382 views
Different file types by DeftPDF
Different file typesDifferent file types
Different file types
DeftPDF78 views
Presentation1 by f6aim
Presentation1Presentation1
Presentation1
f6aim364 views
An introduction to Portable Document Format by Fiter Kill
An introduction to Portable Document FormatAn introduction to Portable Document Format
An introduction to Portable Document Format
Fiter Kill906 views
What is PDF/A? by DeftPDF
What is PDF/A?What is PDF/A?
What is PDF/A?
DeftPDF135 views
e-Services to Keep Your Digital Files Current by pbajcsy
e-Services to Keep Your Digital Files Currente-Services to Keep Your Digital Files Current
e-Services to Keep Your Digital Files Current
pbajcsy273 views
Document Automation and Integration Webinar For CVision by Chris Riley ☁
Document Automation and Integration Webinar For CVisionDocument Automation and Integration Webinar For CVision
Document Automation and Integration Webinar For CVision
Chris Riley ☁674 views
Beginning an Imaging Program: Achieving Success and Avoiding the Pitfalls – A... by Raymond Cunningham
Beginning an Imaging Program: Achieving Success and Avoiding the Pitfalls – A...Beginning an Imaging Program: Achieving Success and Avoiding the Pitfalls – A...
Beginning an Imaging Program: Achieving Success and Avoiding the Pitfalls – A...
Raymond Cunningham562 views
Digitization of Physical Assets by Daniel Novak
Digitization of Physical AssetsDigitization of Physical Assets
Digitization of Physical Assets
Daniel Novak496 views

More from DocuFi, offering HAI and Infection Prevention Analytics

HAIvia Mobile for Infection Prevention Data Capture and Forms Management (for... by
HAIvia Mobile for Infection Prevention Data Capture and Forms Management (for...HAIvia Mobile for Infection Prevention Data Capture and Forms Management (for...
HAIvia Mobile for Infection Prevention Data Capture and Forms Management (for...DocuFi, offering HAI and Infection Prevention Analytics
394 views27 slides
Automated Data Capture and Extraction with ChronoScan for Automated Metadata ... by
Automated Data Capture and Extraction with ChronoScan for Automated Metadata ...Automated Data Capture and Extraction with ChronoScan for Automated Metadata ...
Automated Data Capture and Extraction with ChronoScan for Automated Metadata ...DocuFi, offering HAI and Infection Prevention Analytics
1.6K views33 slides
Intelligent Data Capture Just Got Better, What's New in ImageRamp 6 by
Intelligent Data Capture Just Got Better, What's New in ImageRamp 6Intelligent Data Capture Just Got Better, What's New in ImageRamp 6
Intelligent Data Capture Just Got Better, What's New in ImageRamp 6DocuFi, offering HAI and Infection Prevention Analytics
483 views26 slides

More from DocuFi, offering HAI and Infection Prevention Analytics(13)

Recently uploaded

Fleet Management Software in India by
Fleet Management Software in India Fleet Management Software in India
Fleet Management Software in India Fleetable
11 views1 slide
360 graden fabriek by
360 graden fabriek360 graden fabriek
360 graden fabriekinfo33492
37 views25 slides
SAP FOR CONTRACT MANUFACTURING.pdf by
SAP FOR CONTRACT MANUFACTURING.pdfSAP FOR CONTRACT MANUFACTURING.pdf
SAP FOR CONTRACT MANUFACTURING.pdfVirendra Rai, PMP
11 views2 slides
Advanced API Mocking Techniques by
Advanced API Mocking TechniquesAdvanced API Mocking Techniques
Advanced API Mocking TechniquesDimpy Adhikary
19 views11 slides
FIMA 2023 Neo4j & FS - Entity Resolution.pptx by
FIMA 2023 Neo4j & FS - Entity Resolution.pptxFIMA 2023 Neo4j & FS - Entity Resolution.pptx
FIMA 2023 Neo4j & FS - Entity Resolution.pptxNeo4j
6 views26 slides
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -... by
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...Deltares
6 views15 slides

Recently uploaded(20)

Fleet Management Software in India by Fleetable
Fleet Management Software in India Fleet Management Software in India
Fleet Management Software in India
Fleetable11 views
360 graden fabriek by info33492
360 graden fabriek360 graden fabriek
360 graden fabriek
info3349237 views
Advanced API Mocking Techniques by Dimpy Adhikary
Advanced API Mocking TechniquesAdvanced API Mocking Techniques
Advanced API Mocking Techniques
Dimpy Adhikary19 views
FIMA 2023 Neo4j & FS - Entity Resolution.pptx by Neo4j
FIMA 2023 Neo4j & FS - Entity Resolution.pptxFIMA 2023 Neo4j & FS - Entity Resolution.pptx
FIMA 2023 Neo4j & FS - Entity Resolution.pptx
Neo4j6 views
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -... by Deltares
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
Deltares6 views
Software evolution understanding: Automatic extraction of software identifier... by Ra'Fat Al-Msie'deen
Software evolution understanding: Automatic extraction of software identifier...Software evolution understanding: Automatic extraction of software identifier...
Software evolution understanding: Automatic extraction of software identifier...
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports by Ra'Fat Al-Msie'deen
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug ReportsBushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t... by Deltares
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...
Deltares9 views
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated... by TomHalpin9
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...
TomHalpin95 views
AI and Ml presentation .pptx by FayazAli87
AI and Ml presentation .pptxAI and Ml presentation .pptx
AI and Ml presentation .pptx
FayazAli8711 views
2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx by animuscrm
2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx
2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx
animuscrm14 views
DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J... by Deltares
DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J...DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J...
DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J...
Deltares9 views
Navigating container technology for enhanced security by Niklas Saari by Metosin Oy
Navigating container technology for enhanced security by Niklas SaariNavigating container technology for enhanced security by Niklas Saari
Navigating container technology for enhanced security by Niklas Saari
Metosin Oy13 views
Quality Engineer: A Day in the Life by John Valentino
Quality Engineer: A Day in the LifeQuality Engineer: A Day in the Life
Quality Engineer: A Day in the Life
John Valentino5 views
Copilot Prompting Toolkit_All Resources.pdf by Riccardo Zamana
Copilot Prompting Toolkit_All Resources.pdfCopilot Prompting Toolkit_All Resources.pdf
Copilot Prompting Toolkit_All Resources.pdf
Riccardo Zamana8 views
DSD-INT 2023 The Danube Hazardous Substances Model - Kovacs by Deltares
DSD-INT 2023 The Danube Hazardous Substances Model - KovacsDSD-INT 2023 The Danube Hazardous Substances Model - Kovacs
DSD-INT 2023 The Danube Hazardous Substances Model - Kovacs
Deltares8 views
Myths and Facts About Hospice Care: Busting Common Misconceptions by Care Coordinations
Myths and Facts About Hospice Care: Busting Common MisconceptionsMyths and Facts About Hospice Care: Busting Common Misconceptions
Myths and Facts About Hospice Care: Busting Common Misconceptions

PDF vs. TIFF, An Evaluation of Document Scanning File Formats

  • 1. k at file formats for document sca PDF v. TIFF Copyright ©2014
  • 2. So you’ve decided to implement a document management or search and retrieval system for all your paper documents.
  • 4. And one of them is, “What file format should I use?” PDF JPEG
  • 5. Before you can decide on file format, you have some homework to do.
  • 7. Are the documents… • Office Text Documents • Magazines/Journals • Books • Drawings • Maps • Newspapers • Photographs Graphic-BasedText-Based
  • 8. Are they… Black and White, Bitonal, Grayscale, Color? Stained, torn, aged? Contain Handwritten Notes or Mixed Components?
  • 9. How will I use them… Web: Search, View or Print? Network Search and Retrieve (everyday business use)? Archival (search and retrieval or preservation)?
  • 10. How will my users search for documents?
  • 11. How will my users search for documents? Designated fields such as Invoice No., Customer Name, Date, Patient ID…? or will they need free-form searching on all text?
  • 12. Do I have other considerations? Legal: Admissibility and retention requirements? Retention: How long do to keep the file for the users, legal? Security: Do documents need passwords, restricted usage, changes tracked? Retrieval Limitations: Can my users wait milliseconds, seconds, or minutes? Storage Limitations: How many documents do I have? Is my storage budget limited ? Conversion: Will I need to convert or present the files in another, or multiple formats later.
  • 13. Let’s take a look at PDF v. TIFF, the dominant formats for scanned documents.
  • 14. What is ? (Tagged Image File Format) TIFF • Created by Aldus and Microsoft in 1980’s. Now owned by Adobe. • Developed as a format for scanned images • Most recent version, 6.0 published in 1992 • Universal: Broadly adopted, widely supported by many applications and free viewers, platform independent • Many subtypes representing different compression and color representation schemes Source: National Digital Information Infrastructure and Preservation Program.
  • 15. What is ? TIFF For document scanning purposes, the most notable versions are: • Uncompressed, lossless TIFF-UNC • Compressed, lossless • Often deployed for bitonal or color. • Most effective for solid colors (graphics), and less effective for 24-bit photo TIFF-LZW • Compressed, lossless • Widely deployed in digital libraries and businesses as a master format for bitonal images. TIFF-G4 *Lossless compression discards no information whereas lossy compression allows some degradation in order to achieve smaller file size.
  • 16. What is ? (Portable Document Format) PDF • Created by Adobe over 20 years ago, portions now maintained by ISO • Page-oriented and may contain text, images, graphics, and other multimedia content, such as video and audio • Universal: Broadly adopted, widely supported by many applications and free viewers, platform independent • Many subtypes representing different features • Optionally: hyperlinks, searchable, assistive technology, security features, Source: National Digital Information Infrastructure and Preservation Program.
  • 17. For document scanning purposes, the most notable issues: What is ? Searchable Selecting “make searchable”, “apply OCR”, “text-under- image” or “searchable PDF” from your scanning device options creates a “full-text” searchable file by creating a PDF file with two layers, an image layer and a text layer for full-text searching. PDF
  • 18. For document scanning purposes, the most notable issues: What is ? Archive It differs by omitting features not necessary for long-term archiving, such as font linking. Growing in international government and industry segments, including legal systems, libraries, newspapers, and regulated industries. PDF/A , ISO-standard for digital preservation or archiving of electronic documents. PDF
  • 19. Just a quick note on • Used primarily for photographs • Single page • “Lossy” compression • NOT a “document” scanning JPEG
  • 21. Indexing and Searchability? TIFF TIFF was designed as a “wrapper for images. Can use simple tags only. To be fully searchable, it needs an OCR process to create a separate text file that can then be searched and indexed. Some document indexing software packages include this as an option. Accommodates basic tags and can support more sophisticated XML- based metadata with Adobe's Extensible Metadata Platform (XMP). XMP allows you to embed metadata about a file, into the file itself. Full-text searching option is easily supported and native to the file format so unless it is saved as an “image-only” format, it is fully searchable. PDF
  • 22. TIFF Both TIFF and PDF are universal in that they are common output formats of many applications. They also can be accessed and viewed using many different applications. TIFF files are easily integrated into other applications such as Word and PowerPoint as they are “image” based. Both formats are viewable across most if not all operating systems. Adoption/Portability? PDF
  • 23. Longevity/Archiving? TIFF Because of the widespread adoption and plethora of viewers, TIFF is expected to be a viable file format for some time. Because PDF/A format was designed for long term use and has been adopted by many libraries and government groups, PDF/A is the clear winner for archiving situations. PDF
  • 24. Security? TIFF There are no built-in security features. Users can only be allowed or disallowed access to TIFF files. Sophisticated security options. Includes password protection, permissions and restricted use (view, search, print, cut/copy/paste restrictions), watermarking, and encryption. PDF
  • 25. Before we take a look at file size which impacts storage requirements and upload/download speeds, let’s examine the four things that effect file size.
  • 26. Before we take a look at file size which impacts storage requirements and upload/download speeds, let’s examine the four things that effect file size. 1. Scanning Resolution A 300 dpi scan is much smaller than a 600 dpi scan. 2.Color Space Color and grayscale scans are much larger than black and white scans. 3.Physical Dimensions An 8 ½ by 11 page is much smaller than an 11 x 14, all other things being equal. 4.Compression Raw scans can be compressed for a much smaller size and compression technologies compress different types scanned of documents differently. Reference: Adobe: Acrolaw Blog
  • 27. File Size/Upload and Download Speed? TIFF PDF Both TIFF and PDF offer compression technology. Scan your typical documents with a variety of file compression formats to determine the acceptable file size and upload/download speed for your environment.
  • 28. Color, Grayscale, or Black and White? TIFF PDF As mentioned previously, G4 compression files are often used for black and white or bitonal scans. TIFF-LZW is often used for bitonal or color images and is most effective for solid color graphics and less effective for 24-bit photos. PDF files also offer different compression technologies which present options for color space.
  • 29. Color, Grayscale, or Black and White? TIFF PDF As mentioned previously, G4 compression files are often used for black and white or bitonal scans. TIFF-LZW is often used for bitonal or color images and is most effective for solid color graphics and less effective for 24-bit photos. PDF files also offer different compression technologies which present options for color space.Both TIFF and PDF support color, grayscale, and black and white. Here again, scan your typical documents with a variety of formats to determine the acceptable output. Caution, scanning a black and white text document with a color setting, needlessly creates a large file.
  • 30. TIFF PDF Miscellaneous? Legal Admissibility: Varies by country. Generally both file types can be admissible as long as the appropriate processes are followed for the rules of evidence for the specific jurisdiction.
  • 31. TIFF PDF Miscellaneous? Legal Admissibility: Varies by country. Generally both file types can be admissible as long as the appropriate processes are followed for the rules of evidence for the specific jurisdiction. Conversion: Both TIFF and PDF files can be converted with readily available tools. This may be important if your scanned files are to be used as “master files”. For example, you may need to scan for both archival and web viewing. Because of file size, you may need to copy and convert a large archival file for easy web viewing. Hence the “master file”
  • 33. …maybe both PDF and TIFF as users often have a variety of document types with different requirements.
  • 35. Learn More about Document Imaging and Capture
  • 36. Contact us for more information on: • Intelligent data capture • PDF to TIFF Conversion • How to convert PDF and TIFF Files • More tutorial information on document management • Scanning documents for document management, • How to intelligently capture index data from your scans • Requirements for document management scanning • How to select a document capture or document scanning solution • Using touchscreen scanners such as the Fujitsu ScanSnap as an intelligent capture solution • Batch document scanning solutions • Document Management cost savings • EMR data capture • Batch Indexing solutions • Batch document indexing • Index documents • Create a document index • Document management index • Index from print stream • ECM index • Index ECM By DocuFi 30 years’ experience in the Document Imaging market. Find out more at ImageRamp and www.docufi.com Copyright ©2014 makers of ImageRamp, Document Management Capture Solution
  • 37. Image Credits and References • Todd Anderson neurmadic aesthetic, ”Ding” , http://bit.ly/1egCSkU • Doug Waldron, “Files (85)”, http://bit.ly/1bfciII • Knile Lucy, you have some sorting to do! http://bit.ly/19bSgjFDave Gray • Butterbean man, “Decisions”, http://bit.ly/1iqCVSc • Ben Schumin, SchuminWeb, “Shelves at Archives II”, http://bit.ly/1iqDD1K • Angel Arcones, Freddy The Boy, “Dia 91: Decisiones”, http://bit.ly/1egCSkU • MicroAssist “Apples and Oranges”, http://bit.ly/17KPimb • AJC1, “Checklists”, http://bit.ly/KDCsgO • Russ, russteaches, “2 Big 2 Small”, http://bit.ly/1hODsdL • The U.S. Army,” West Point wins collegiate boxing championship”, http://bit.ly/1g4BAA6 • Aberdeen Proving Ground, “16th pounds 143rd to win Amateur Boxing Tournament”, http://bit.ly/KLxkH4 All images are owned or licensed by DocuFi with acknowledgement given to: Reference /Source Material: • Alternative File Formats for Storing Master Images of Digitisation Projects, National Library of the Netherlands Research & Development Department • Department of Physics, Wake Forest University, • “Sustainability of Digital Formats. Planning for Library of Congress Collectiion” Library of Congress