• Bulk process
• Long running process
• Mostly sequential
• Once, daily, monthly, yearly, ...
REQUIREMENTS OF A BATCH PROCESS
Large data volume—Batch applications must be able to handle
large volumes of data to import, export, or compute.
Automation—Batch applications must run without user
interaction except for serious problem resolution.
Robustness—Batch applications must handle invalid data
without crashing or
Reliability—Batch applications must keep track of what goes
wrong and when(logging, notification).
Performance—Batch applications must perform well to finish
processing in a dedicated time window or to avoid disturbing any
other applications running simultaneously.
HOW BATCH APPLICATIONS FIT IN TODAY’S
• Every day, large and complex calculations take place to index billions of documents,
using cutting-edge algorithms like MapReduce.
• Even in our event- and notification-driven world, batch applications are still
Batch processing is also used for efficient bulk database updates and automated
transaction processing, as contrasted to interactive online transaction processing
(OLTP) applications. The extract, transform, load (ETL) step in populating data
warehouses is inherently a batch process in most implementations.
Batch processing may also used for converting computer files from one format to
another. For example a batch job may convert proprietary and legacy files to
common standard formats for end-user queries and display.