ETL-PROJECT
PRESENTED BY :-
MONIKA VERMA
ANNUSHRI SHARMA
RATI LODHA
PRESENTED TO:-
SHRADHHA MASIH
WEBLOG -DATA
OUTLINES
 Introduction
 Process
 About the project
 Snap sorts
 Input data
 Performing validation
 Performing transformation
 Writer
 Output data
ETL:- INTRODUCTION
ETL is short for extract, transform, load,
three database functions that are combined into one tool to
pull data out of one database and place it into another
database.
WHAT IS ETL?
 ETL = Extract – Transform – Load
 Extract
› Get the data from source system as efficiently as
possible
 Transform
› Perform calculations on data
 Load
› Load the data in the target storage
PROCESS
ABOUT THE PROJECT:-
 The project is based on WEB-LOG DATA
 LOG FILE - A log file is a file that records,
either events that occur in an operating system or
other software runs, or messages between different
users of a communication software.
 Logging is the act of keeping a log. In the simplest
case, messages are written to a single log file.
ABOUT THE PROJECT:-
 We have motivated to do this project as our initiative to
learn the ETL process. It is basically working on the
cleaning process of a system’s log file of Internet uses
through ETL Tool which is provided to us.
 In this, we go through all phases of Advance ETL
Processor Tool and generated our cleaned and
formatted data in spreadsheet as our output.
 In this, we have applied multiple inbuilt functions on our
Weblog Data.
Input log file
Starting of the project with etl different phase:-
Performing Validation :
Performing transformation :
Write the data through writer
Writer
OUTPUT OF CLEAN DATA
Thanks!!!!!

Etl project on weblog

  • 1.
    ETL-PROJECT PRESENTED BY :- MONIKAVERMA ANNUSHRI SHARMA RATI LODHA PRESENTED TO:- SHRADHHA MASIH WEBLOG -DATA
  • 2.
    OUTLINES  Introduction  Process About the project  Snap sorts  Input data  Performing validation  Performing transformation  Writer  Output data
  • 3.
    ETL:- INTRODUCTION ETL isshort for extract, transform, load, three database functions that are combined into one tool to pull data out of one database and place it into another database.
  • 4.
    WHAT IS ETL? ETL = Extract – Transform – Load  Extract › Get the data from source system as efficiently as possible  Transform › Perform calculations on data  Load › Load the data in the target storage
  • 5.
  • 6.
    ABOUT THE PROJECT:- The project is based on WEB-LOG DATA  LOG FILE - A log file is a file that records, either events that occur in an operating system or other software runs, or messages between different users of a communication software.  Logging is the act of keeping a log. In the simplest case, messages are written to a single log file.
  • 7.
    ABOUT THE PROJECT:- We have motivated to do this project as our initiative to learn the ETL process. It is basically working on the cleaning process of a system’s log file of Internet uses through ETL Tool which is provided to us.  In this, we go through all phases of Advance ETL Processor Tool and generated our cleaned and formatted data in spreadsheet as our output.  In this, we have applied multiple inbuilt functions on our Weblog Data.
  • 8.
  • 9.
    Starting of theproject with etl different phase:-
  • 10.
  • 11.
  • 12.
    Write the datathrough writer
  • 13.
  • 14.
  • 15.