The DataBee Set Extractor application uses an extraction set built by the Set Designer to extract the relevant rows from the source Oracle database. The ROWIDs of the extracted rows are stored in a temporary table within the source schema (or proxy schema). The Set Loader application accesses these extracted ROWIDs and loads the target schema in the destination Oracle database.
The Set Extractor is one component in a suite of applications used to accomplish the task of building a smaller Oracle database from the contents of a larger one. For a step-by-step guide to the DataBee software and the tools and processes involved, please see the DataBee Quick Start Guide. If you are unsure about what an extraction set is and why you might need one, please see the useful
extraction set overview help page.
The Set Extractor Application
As with all DataBee applications, the Set Extractor presents its global functions (loading, saving, exiting etc.) as a
column of buttons in the right hand side of the application window. The tabs on the top left are used to provide areas of major functionality where the currently open extraction set can be used to extract rows from the source Oracle database. Along the bottom of the window is a status line area where the Set Extractor application provides information about its current state.
Command Line Options and Batch Mode
The Set Extractor application can be used in a "batch" mode to automate the extraction of data from a source schema. Please see the detailed discussion on the Set Extractor Command Line Arguments help page.
The Global Functions
- The Bee Logo Button
- The Bee Logo button brings up an About DataBee form which provides information on the Set Extractor version and support contact information. The About DataBee form also contains the license key details and provides information on new upgrades which have been issued subsequent to this version.
- Open Extraction Set
- The Open Extraction Set button reads in an extraction set and configures the Set Extractor to use it. Extraction sets contain the rules that govern which rows are extracted from the source schema. Extraction sets are built by the Set Designer application.
- Clear Extraction Set
- The Clear Extraction Set button clears the existing extraction set information from the Set Extractor application.
- Run Extraction Set
- This is the button that does the work! Once the extraction set has been opened, press this button to extract the ROWIDs in the source source Oracle schema (or schemas) defined in the Rule Controller rules. At the end of the extraction process, the ROWIDs which identify the rows which need to be copied to the target subset schema will have been identified and stored in a temporary table in the source schema (or proxy).
There are three stages to the extraction process. In the first stage the Set Extractor application will acquire all of the ROWIDs to support the Where Clause and All Rows rules defined in the extraction set as well as executing any user defined stage one Command rules. Once the Where Clause and All Rows rules have supplied the initial conditions for the extract, the Set Extractor application will iterate repeatedly through the Table-To-Table rules in stage two and make sure every table has the required rows extracted for it. Stage three, which is rarely used, contains only user defined Command rules which can perform post extraction actions.
- Stop Extraction Set Run
- This button will stop any extracts which are currently in progress. Extracts are completely restartable so you can stop and restart them as required. The Stop Extraction Set Run button can be pressed several times to request increasingly strict levels of halt signal.
- The Restart button re-activates the current extraction set and restarts execution from the place it last stopped. Note that the restart process uses the information contained in the DataBee temporary tables located in the source schema and in order for the extraction set to successfully restart this information must be present.
- Quick Read and Restart
- When developing a set of extraction rules a repeated cycle of "Add a Rule", "Run the Extract", "Add Another Rule" usually takes place. For larger extracts, a complete re-extraction to test a newly added rule can mean a considerable delay. Depending on the type of changes made to the rules it is not always necessary to
completely re-extract. If no rules have been deleted or disabled it is possible for the new rules to be reloaded and used with the currently extracted information. This continuation feature simulates the effect of a complete re-extract
and dramatically speeds up the development process.
- Quick Start Guide
- The Quick Start Guide button launches a help page which will guide you through the process of creating extraction and loader sets for your own databases.
- A button which launches context sensitive help on the Set Extractor Application (this file).
- Closes the Set Extractor application.
- Rules in Set
- This tab displays the rules contained in the currently opened extraction set. The rules cannot be edited here
(that is the function of the Set Designer application) but they can be disabled if required. Please see the specific Rules in Set tab help page for detailed information on this topic.
- Table Statistics
- The Set Extractor application keeps an accurate count as the rows are extracted from the source Oracle
database. This tab shows a continuously updated display of the number of unique rows extracted for each table. Please see the specific Table Statistics tab help page for detailed information on this topic.
- Rule Statistics
- As the Set Extractor iterates through the rules in the extraction set, it acquires the appropriate
rows from the source Oracle database tables to support those rules. The Rule Statistics tab shows a continuously updated count of the number of rows each rule extracted and the rate of extraction. Please see the specific Rule Statistics tab help page for detailed information on this topic.
- Run Statistics
- The Set Extractor is multi-threaded and can execute multiple extraction rules in parallel. The Run Statistics tab provides information on the rules which are currently running as well as summary statistics for the entire extraction set run. Please see the specific Run Statistics tab help page for detailed information on this topic.
- Misc. Setup
- This tab provides a variety of configuration options for extraction sets. Please see the specific
Misc. Setup tab help page for detailed information on this topic.