Astera’s ReportMiner is a template-based data extraction solution that is widely used by enterprises to simplify and automate their data extraction strategies.
In this course on Extracting Unstructured Data, you will learn about the essentials of data extraction through Astera’s ReportMiner. Starting off with the fundamental concepts of Data and its different types, this interactive course will build your way up to creating report models and extracting data professionally from a variety of unstructured source files.
Whether you have used ReportMiner before or have no experience with it, this course will equip you with the solid foundation of the tool so you can get up to speed with ReportMiner. With hands-on practice, you will be able to expertly navigate through ReportMiner’s user Interface, learn about expert best practices, and solve a number of business-oriented use cases through downloadable source files and prebuilt templates.
Data extraction forms an important part of data management and businesses across the world rely on template-based extraction tools to make the most out of their unstructured data gold mines. Anyone willing to learn the basics of data extraction and to expertly use ReportMiner in the future will find this course helpful. Building your confidence with Astera ReportMiner means enhancing your employability skills portfolio so you can position yourself ahead of others and succeed at your work goals.
-
Module 1: Fundamentals of Data
In this module, you will learn about the key concepts related to data, especially business data. You will learn the difference between different categories of data based on its structure: Structured Data, Semi-Structured Data, and Unstructured Data. After learning about the foundational features of data, you will be introduced to the concept of data extraction which is an important domain of data management.
-
Module 2: Basic Concepts to Learn ReportMiner
This module builds the basic concepts required to start using Astera ReportMiner for extracting data from unstructured sources. You will learn the key terminologies we will use when working with the unstructured documents throughout the course. You will start by learning how to group areas of interest in a file into data regions. Afterwards, you will be acquainted with the concept of matching patterns as a means to capture data regions. Once familiar with data regions, you will learn how individual data points within these regions can be identified as data fields.
-
Module 3: ReportMiner's Interface
In this module, you will be acquainted with the user-friendly interface of Astera ReportMiner. You will get a look at the panels and bars available in the tool and the different options present in them to modify the data during extraction. Once familiar with the UI of the tool, you will know where to navigate and which features to use at each step of creating a flexible extraction template using Astera ReportMiner.
-
Module 4: Importing Source Data in ReportMiner
Astera’s Data extraction tool can extract unstructured data residing in source files of different formats. In this module, you will learn about the file formats supported as source in ReportMiner: PDF, Text, Excel, and Delimited file formats. You will also learn how to work with OCR-scanned files and read files stored at a cloud location.
-
Module 5: Let's Build an Extraction Template
Astera’s Data extraction tool can extract unstructured data residing in source files of different formats. In this module, you will learn about the file formats supported as source in ReportMiner: PDF, Text, Excel, and Delimited file formats. You will also learn how to work with OCR-scanned files and read files stored at a cloud location.
-
Module 6: AGL (Beta)
Auto-Generate Layout (Beta) or AGL is a very useful feature recently added to Astera ReportMiner’s UI. AGL and its two sub-features Auto Create Fields (Single-Instance) and Auto Generate Table let you autogenerate extraction templates and expedite your data extraction tasks. In this module, you will learn how to extract either the complete data from a PDF document using the AGL button or just the tabular data using the Auto Generate Table button. After building your extraction template with a single click, you will see how you can fine-tune it using different features, including Auto Create Fields (Single-Instance).
-
Module 7: Exporting Your Data
After creating a report model to extract data from an unstructured document, the next step is to export the extracted data to a destination. Astera ReportMiner supports various destinations for exporting your data. This module will teach you how to export data to an Excel file, a delimited file or a database. You will also learn how to export data into Astera’s Dataflow artifact for further manipulation and cleaning before writing structured data to a final destination.
-
Module 8: Working with Dataflows
Dataflows are the main artifacts of Astera when it comes to data integration. In this module, you will be introduced to dataflows, their significance, and User Interface. You will learn how you can use your extracted data as source in a dataflow and further transform it by designing ETL pipelines.
-
Module 9: Working with Workflows
Workflows in the Astera suite of products are very useful as they enable automation and repeated execution of tasks in ETL jobs. In this module, you will learn about workflows and important workflow task objects: Decision, OR, File System, Run Dataflow, Run Workflow, Send Mail, etc. You will learn how to create a sequence of workflow tasks that will use and transform your extracted data.
4.00 average based on 1 rating
Reviews
-
fsdfsdf