2. www.dclab.com 2
Valuable Content Transformed
• Document Digitization
• XML and HTML Conversion
• eBook Production
• Hosted Solutions
• Big Data Automation
• Conversion Management
• Editorial Services
• Harmonizer
3. www.dclab.com 3
Experience the DCL Difference
DCL blends years of conversion experience with cutting-edge technology and the
infrastructure to make the process easy and efficient.
• World-Class Services
• Leading-Edge Technology
• Unparalleled Infrastructure
• US-Based Management
• Complex-Content Expertise
• 24/7 Online Project Tracking
• Automated Quality Control
• Global Capabilities
5. www.dclab.com 5
. . . Spanning All Industries
• Aerospace
• Associations
• Defense
• Distribution
• Education
• Financial
• Government
• Libraries
• Life Sciences
• Manufacturing
• Medical
• Museums
• Periodicals
• Professional
• Publishing
• Reference
• Research
• Societies
• Software
• STM
• Technology
• Telecommunications
• Universities
• Utilities
6. www.dclab.com
• Based in Houston, Texas
• Founded in 1984, CGM Experts for 30 years
• Develops & markets graphics software tools , based on
open standards
• Promoting CGM Usage, VizEx Reader a free CGM and TIFF
viewer
• Member of the CGM Open Foundation
• Larson products used in Technical Publishing by
Aerospace, Defense and Heavy Manufacturing.
• Used to create, maintain and deploy technical graphics
• More information available at www.cgmlarson.com
6
7. www.dclab.com
Webinar Objectives
• Provide lessons learned when transforming technical
graphics
• Focus on common graphic source formats
– Raster and Vector data
• Describe the data preparation considerations and
techniques to achieve the best results
• Highlight the pitfalls of graphic conversion
• Provide Tips, Benefits, Advice and Knowledge during the
webinar
• Summarize the graphic conversion and transformation
process
7
8. www.dclab.com
Introduction
• The reuse of technical graphics produced in
varying formats can be a significant challenge
during the publications production process
• Different file formats can present different issues
and resolutions
• Quality is a major consideration for the source
data and output requirement for publishing
• Using ‘Best Practice’ will reduce production time
and costs
8
9. www.dclab.com
Today’s Technical Graphics Process
• Technical graphics are usually originated, generated or
revised in a electronic format
– Can also be legacy raster, a scan of a hard copy illustrations
• The most likely starting point for a new
technical graphic today is from
a CAD system
– This could be a 2D or 3D CAD source data
• Spare part illustrations
• Schematics
• Wiring Diagrams
• Drawing from scratch – could be quicker
– Dependent of availability and suitability of CAD data
– Create illustration using a photograph as a template
9
KNOWLEDGE:
Computer Aided Design is
mainly used for detailed
engineering of 3D models and/or
2D drawings of physical
components, but it is also used
throughout the engineering
process from conceptual design
and layout of products.
10. www.dclab.com
Possible Data Sources
• Raster file
– Legacy format
• Scanned line art illustrations
– Intermediate format
• Exported from 3rd party software, CAD System?
• Photographs a good example
• Vector graphics
– 2D or 3D CAD system
• Many formats are possible and used
– Adobe Illustrator, CorelDRAW
– PDF
• Embedded vector and raster data
10
KNOWLEDGE:
The word Raster originates from the
term 'raster scan,' which was how
old CRT monitors displayed images,
by magnetically steering a
concentrated electron beam line by
line to form an image.
KNOWLEDGE:
Vector graphics is the use
of geometrical primitives
such as points, lines,
curves, and shapes or
polygons—all of which are
based on mathematical
expressions—to
represent images in
computer graphics.
11. www.dclab.com
Poll #1
1. What source file formats do you use to generate
your technical graphics?
Raster – Scans of original illustrations
Raster – Intermediate format, raster
from CAD, photographic
Vector – Adobe Illustrator,
PDF or other graphics format
Vector – CAD 2D
Vector – CAD 3D
11
13. www.dclab.com
Introduction – Raster Data
• Raster Data still plays a significant role in the technical
publications process
• Source – e.g. photographs
• Original – e.g. scanned line art
• Delivery – e.g. TIFF files
• Analysis of raster data can prove to be beneficial
• Allowing the optimization of the production process
• Raster Data may require enhancements with the addition
of hotspots
• We will investigate the possible pitfalls of this process (OCR)
• Raster Data as an intermediate format
• Correct decision at the time but now alternatives could exist?
• Photographic files, intermediate and delivery
13
14. www.dclab.com
Hard Copy Scan – Raster Data
• Legacy Raster Data is usually the result of a hard
copy scan
• If the original no longer exists little
can be done to improve quality
• Quality
• Dependent on scan resolution
• What generation?
14
15. www.dclab.com
Clean Up – Raster Data
• Quality of the legacy scan - Considerations?
– It is unlikely you will be able to improve the quality of a
scan apart from a general clean up
• TIP: clean up scans using the well known raster editors,
for example Adobe Photoshop & Corel Photo-Paint
– The scans sometimes contain sporadic pixels (Noise) not
part of the main image
• The pixels can be removed using software functions, for example
– Corel Photo Paint
» Remove Noise
– Adobe Photoshop
» Despeckle
» Dust & Scratches
15
16. www.dclab.com
File Formats – Raster Data
• Raster File Formats
• TIFF is the most common raster format for line art scans, usually
using a compression algorithm to reduce the file size
• Other raster formats include:
– JPEG (Joint Photographic Experts Group)
– BMP (Bitmap Image File)
– PNG (Portable Network Graphic)
– PCX (PiCture eXchange)
– GIF (Graphics Interchange Format)
• TIP: Specifications compliance to
S1000D etc.
• Save the TIFF as part of a CGM file
• It will retain its quality and properties
• It can also be combined with Vector data
16
KNOWLEDGE - TIFF
(Tagged Image File
Format)
Industry-standard method for
storing scanned and
other digital black-and-white,
grayscale, and full-
color images. Developed mainl
y for desktop publishing, TIFF
has found applications in
several other areas such
as computer aided design
(CAD), fax transmission. TIFF
saves detailed image informati
on with each pixel, resulting in
a relatively large data file.
17. www.dclab.com
Properties - Raster Legacy
• DPI (Dots Per Inch) better known as image
resolution
• The DPI is a major factor when determining
the scan quality (200-300 DPI a good baseline)
• TIP: Right click the file and select Properties
– The smaller the number the lower the quality
– The publishing of the illustration is a major
factor when considering if the DPI is
acceptable
• Higher resolution to maintain quality for print
• The display on a laptop screen for example the
DPI can be lower but if the viewing tool can
zoom the user will see deterioration as they
zoom-in
17
19. www.dclab.com
Analysis – Raster Legacy
• TIP: Evaluate manually re-drawing as a vector
illustration
– Assess the longevity of the graphic and its future use
• The cost versus use
– Does the legacy illustration now exist in a CAD
format
• If yes, could be repurposed
19
20. www.dclab.com
Auto-trace – Raster Legacy
• An alternative to manual re-draw is
Auto-Trace
• The two major mass market graphics
programs both have the facility
• CorelDRAW - Centerline Trace
• Adobe Illustrator – Image Trace
• TIP: Perform tests on typical files
before committing to this strategy
• It could be quicker to re-draw
than clean-up the trace
• ADVICE: consider the
longevity of the file how
many more revisions could
take place
20
21. www.dclab.com
Transforming – Raster Data
• Resizing, rotating etc. impacts the quality of the image
• The technical term is Interpolation
• In basic terms the pixels have to be remapped when
transformed
• ADVICE: this can effect the quality of your image
• TIP: Artifact’s of interpolation can be used to improve
the appearance
21
HaloBlurringAnti-aliasing
22. www.dclab.com
Intermediate Raster Data
• Raster data can also originate from 3rd party software
• Example's, CAD, Graphics software etc.
• The possible reasons for using raster export:
• The limitation's of the export options
• The limitation's of the import options of target publishing
software
• Historical – we’ve always done it that way?
• ADVICE:
• Validate the export settings being used
– Is it possible to increase the DPI of the export?
• Is a vector option available?
22
23. www.dclab.com
Intermediate Raster Data - 2
• Photographic data has always played an important
role in publications
• Predominantly used as a maintenance type illustration
• It can be used directly and annotated appropriately
• Alternative, the photograph can be used as a template
• Traced to create line art vector version (Automotive)
• BENEFIT: Relevant information can be highlighted and be
distinct
23
25. www.dclab.com
OCR (Optical Character Recognition)
• The BIG question why OCR a raster technical illustration?
• Primary reason, illustration requires hotspot regions
• OCR is a very good way of achieving a defined region
• Dependency on quality of raster text
• Spare Part illustrations are more likely to have hotspots
• ADVICE: Assess suitability for OCR – is it capable of graphic
hot spotting?
• TIP: Data Analysis Steps:
• Determine if characters sufficiently formed to make OCR viable?
• Best Practice: Test a generic subset of data to determine viability
• Use test results to select the best process
25
26. www.dclab.com
OCR (Optical Character Recognition) 2
• Test Passed:
– Convert the whole batch of files using OCR software
• Test Fails:
– There are probably 2 options:
1. Regenerate the required text for the graphic and
include hotspot information
2. Manually Hotspot the raster Data (Overlay)
26
29. www.dclab.com
Introduction - Vector Data
• Vector Data can be in varied formats
• In the majority of cases it is preferable to Raster Data, mainly for
quality reasons
• BENEFIT: Vector Data can be scaled and the quality is
maintained
• True both for hard copy and on a computer display
• The display of a vector file, enhanced display by using anti-
aliasing
• This ability will be software dependent (Viewer)
• Vector Data Sources, the most common:
• CAD Systems
• General Graphics Software
• PDF
• Technical Writing software
29
KNOWLEDGE:
Anti-Aliasing (in computer graphics) a
technique used to add greater realism to
a digital image by smoothing jagged
edges on curved lines and diagonals.
31. www.dclab.com
Poll #3
• What CAD System/s does your company use?
Autodesk
Catia
PTC Creo Parametric (ProE)
Siemens NX - Unigraphics
Solidworks
31
32. www.dclab.com
CAD File Formats – Vector Data
• CAD File Formats
– 3D Data
• CAD Native files
– DWG native file format for AutoCAD data files
• VRML (Virtual Reality Modeling Language)
• IGES
• STEP (Standard for the Exchange of
Product model data)
– ISO 10303 Standard
– 2D Data
• DXF
• DWG
• CGM
• PDF
32
KNOWLEDGE:
Initial Graphics
Exchange
Specification
(IGES)
is a vendor-
neutral file
format that
allows the digital
exchange of
information
among computer
-aided
design (CAD)
systems.
KNOWLEDGE:
Data Exchange File
(DXF)
The format is a tagged
data representation of
all the information
contained in an
AutoCAD® drawing file.
34. www.dclab.com
CAD Data/Takeaways – Vector Data
• Considerations when using CAD files
– File sizes
• 3D files sizes can be challenging
– Select data exchange file formats carefully
• Converted CAD files can contain polylines that increase file
size
• Conversion to a 2D file format, for example CGM will provide
a more suitable file for publishing purposes
– Removal of Extraneous Information
• Layers
• Legends
• Text
– The process needs to be automatic to be cost effective
34
35. www.dclab.com
CAD Data/Takeaways – Vector Data - 2
• Editing
– Polylines can make the file difficult to edit (not Bezier
curves)
• Text can also be sometimes be converted to vector lines
– Text mapping can be an issue
• Font substitution
• Character positioning
• ADVICE: Make sure text does not obscure the
illustration
– Wiring diagrams can be challenging
• Text in tight areas
35
36. www.dclab.com
General Graphics Software – Vector Data
• General Mass Market Graphics Software
– 2D Data
• Adobe Illustrator
• CorelDRAW X7
• Commercial Industry requirements for graphic publishing
are usually satisfied common formats:
• AI (Adobe Illustrator)
• PDF (Portable Document Format)
• DWG
• SVG (Scalable Vector Graphic)
• Specification driven industries may well require the CGM
format e.g. S1000D and ATA
– TIP: CorelDraw X7 does support CGM but not the profiles
required for specification compliance
36
37. www.dclab.com
Graphics Software Takeaways – Vector Data
• Very good choice of export formats
– PDF is now an open format so good potential to use as archive
– SVG another open format
• ADVICE: SVG probably not a good choice for an archive
format
• File sizes
– Very good, usually small and easy to handle
• Good compatibility with publishing software
• ADVICE:
– Do not support the CGM format for compliance with industry
specifications
– Not capable of adding hotspot information fro use in Interactive
Electronic Technical Manuals (IETM’s)
• TIP: To be specification compliant save as a TIFF, not ideal
37
39. www.dclab.com
Specialist Software Takeaways – Vector Data
• Used in all areas of manufacturing
– Including specification's driven Aerospace and Defense
industries
– Support for CGM is a given
• Cost generally higher than general graphics software
– Specific functionality will earn pay back
• Very good choice of import and export formats
• Very good handling of both 2D and 3D files
• Good compatibility with publishing software
– Especially electronic delivery for IETM’s
39
40. www.dclab.com
PDF Format – Vector Data
• PDF (Portable Document Format)
• In 2008 PDF became an ISO standard
– ISO International Standard - ISO 32000-1
• All new releases of the format are governed by ISO
• Interoperability was always the mission for PDF, ISO
governance can only strengthen
this objective
40
KNOWLEDGE:
Portable Document
Format (PDF)
A file format used to
present and exchange
documents reliably,
independent of software,
hardware, or operating
system.
KNOWLEDGE:
ISO (International Organization for
Standardization)
is an independent, non-governmental membership
organization and the world's largest developer of
voluntary International Standards.
41. www.dclab.com
PDF Format – Vector Data 2
• PDF was based on PostScript another Adobe
technology
– Therefore it does support graphic primitives
– Lines, Bezier curves etc.
– Can also contain raster data
• BENEFIT: You can open a PDF file in different graphics
programs and edit them
• ADVICE: Not all PDF files are creating equal, it is
possible older PDF files will not have a vector structure
• TIP: PDF can be a good export option from a CAD
system
41
43. www.dclab.com
Technical Writing Software – Vector Data
• Technical Writing Software can contain
graphics
– Framemaker
– Arbortext Editor
– oXygen
– ETC.
• There are methods of retrieving the data
– The method will vary depending on software
– Raster or Vector Data
43
44. www.dclab.com
CGM Supports Raster & Vector Data
• Another good 2D format to consider is CGM
• CGM (Computer Graphics Metafile)
– An Open Non-Proprietary format
– Actively governed and developed by the CGM Open
Foundation www.cgmopn.org
– Ability to contain both Raster and Vector Data and retain
attributes
– Specifications compliant, S1000D and ATA
– Interoperable
– Can contain metadata, for example can contain hotspot
information
• TIP: Specialist software the best option to maintain
CGM files
44
45. www.dclab.com
Summary
• There are many opportunities to source information and
produce technical graphics
• The number of source and publishing formats are diverse
– Selecting the appropriate format and software can be a
challenge
• The primary considerations should be, quality, re-use and
publishing
– Ideally choose a format that answers all of these criteria
– Software should be capable, a combination of solutions may
be required
• We hope todays webinar has provided information that
you found interesting and above all useful
45
46. www.dclab.com
Q&A
46
Mark Gross
President & CEO,
Data Conversion Laboratory
(718) 307-5711
MarkGross@dclab.com
@dclaboratory
Don Larson
CEO
Larson Software Technology
713-977-4177
don.larson@cgmlarson.com
www.cgmlarson.com
@cgmlarson