Sunday 28 August 2011

ETL Overview.......




ETL is very important process in Data Warehousing which involves following three major steps:-

Extraction :-
This is the first step in the ETL Process which involves extracting the data from the source systems. Source systems here refer to Relational Databases , Flat Files and many other systems like ERP, SAP, CRM etc. Thus extraction involves fetching the data from different sources and converting it to single consistent format so that it is ready for next stage which is transform.


Transform:-
This is second step in the ETL in which number of Business rules are applied to the extracted data so that data can be loaded in the required format in the target database. Transform is the most time consuming stage in ETL which requires a lot of processing and calculations need to be done. Some of the transformations normally done in transform stage are listed below:
• Joining data from multiple sources
• Aggregation
• Calculating new values from the source data values.
• Splitting a column in multiple columns
• Normalization and De- Normalization
• Cleaning the source data
• Translating source data as per business rules and requirements


Load:-
Load step takes the data from the transform step and loads it into the target which is mostly the Data Warehouse. Depending upon the business requirements this process may load overwrite the existing data or may maintain the historic data for analysis purpose. Load period also varies from daily ,weekly to monthly as per the requirements.




9 comments:

  1. Thanks for sharing this post. It is really helpful, continue sharing more like this.
    Informatica Training in Chennai | Informatica Training center Chennai

    ReplyDelete
  2. Excellent Information here from you. Thanks for sharing such a good content to readers. Informatica relies on a ETL concept which is abbreviated as Extract- Transform- Load. Informatica -ETL products and services are provided to improve business operations, reduce big data management, provide high security to data, data recovery under unforeseen conditions.
    Informatica ETL Tool and its features

    ReplyDelete
  3. Thank you so much for sharing this. I appreciate your efforts on making this collection

    Informatica Training Institute in Chennai

    Best Informatica Training in Chennai

    ReplyDelete
  4. Hai,
    Thanks for sharing nice information. Informatica PowerCenter is a widely used extraction, transformation and loading (ETL) tool used in building enterprise data warehouses. The components within Informatica PowerCenter aid in extracting data from its source, transforming it as per business requirements and loading it into a target data warehouse. Hope this blog may also helpful for you. https://tekslate.com/informatica-training/

    ReplyDelete
  5. This comment has been removed by the author.

    ReplyDelete
  6. Hey, would you mind if I share your blog with my twitter group? There’s a lot of folks that I think would enjoy your content. Please let me know. Thank you.
    Java Training in Chennai | J2EE Training in Chennai | Advanced Java Training in Chennai | Core Java Training in Chennai | Java Training institute in Chennai

    ReplyDelete
  7. I am really enjoying reading your well written articles. It looks like you spend a lot of effort and time on your blog. I have bookmarked it and I am looking forward to reading new articles. Keep up the good work.
    Java Training in Chennai

    Java Training in Velachery

    Java Training in Tambaram

    Java Training in Porur

    Java Training in OMR

    Java Training in Annanagar

    ReplyDelete