In this section: |
How to: |
Reference: |
A source allows you to map inbound data to PMF elements, called datapoints, that will become elements in measures, such as Actual, Target, and so on.
There are currently three types of sources available:
All loaded data from a loadable, user-entered, or generated source can be wiped out or deleted in a single operation. This is useful when you have loaded data that is invalid. It is a simple way to delete all the data.
Note: It may take PMF a moment to purge all of the data.
You can view the lineage for all datapoints for any loadable, user-entered, or generated source by selecting the Lineage tab in the New Source or Edit Source panel. Lineage shows the progress of data through PMF, starting from the external data harvested into datapoints, through any derived datapoints, and finally, all terminal points in measures.
The following image shows an example of the Lineage tab for a loadable source.
The lineage tab automatically displays the entire lineage. You can click the Collapse All button to hide the entire lineage.
PMF keeps track of each load that is executed for each source in the system, regardless of whether you loaded it manually or the load was called by the scheduler. This data is stored in a special logging section of the PMF data mart.
The History tab on each loadable, user entered, or generated source displays the history of all loads that have been logged, as shown in the following image.
The history of the source shows:
The total count of mismatches for generated sources should always be zero.
The total count of gaps in data discontinuity for generated sources should always be zero.
Errors are unlikely for generated sources.
How to: |
Reference: |
Loadable sources in PMF:
Data harvesting is the process of taking data from a source and processing it into information that is then loaded into one or more datapoints needed by the PMF source.
Note: The result of any data harvesting operation is always numeric.
Typical data harvesting actions are:
In PMF, all data harvesting is done across a potential Cartesian cross product of dimensional intersection. Nearly every load involves a fairly high degree of sparsity against this cross product, but in most cases, it also involves creating multiple records that follow a particular pattern against the source data.
In many cases, depending on the degree of granularity of the inbound loadable data from the source, PMF requests the data to be aggregated, even when loaded at the lowest level of its dimensionality. This happens when the physical data table or view contains more detailed records that go to a lower logical level than the source requests. Some examples of this include the following:
In the examples listed, the PMF loader will be aggregating records from the source inbound during the load process.
Loadable Sources are primary sources for data in PMF. PMF treats them as first generation in any lineage, along with user-entered datapoints and generated datapoints.
Loadable data is treated as updated on the date of load, but is effective as of the time dimension linkages for the data.
To set up a new Loadable source:
The New Source panel opens.
The File Picker allows you to look at all of the available metadata files in your currently configured WebFOCUS app path. The icons to the left of each file name let you explore the contents of the file to make sure that it is the correct one. To select the file to use, click the name once and close the File Picker.
When switching the datapoint type to Count, set the format to D20.
When switching the datapoint type to Distinct Count, set the format to D20.
If switching to User Defined from a datapoint type of Count or Distinct Count, change the format to D20.2.
Specify filters to be used by clicking the Filters button and specify the WebFOCUS code-based formula for harvesting the field by clicking the Code button.
PMF creates the following template code:
SRC_DATA_COLnn/D20.2 MISSING ON = MISSING ;
where:
nn is the one up datapoint number within the Source. The left side of the code should not be changed. The right side must be syntactically correct FOCUS code.
WHERE (M_NAME EQ 'COGS')
Your datapoints should look similar to the following image.
Note: You can save your work-in-progress at any time. PMF will not be able to actually harvest data into the datapoints that were set up until the needed harvesting details and dimensionality have been specified.
Note: The fields you select for dimensional linkages must contain values that match up to those you loaded for the dimension keys in the Dimension Loader. PMF will alert you if there is an issue.
To update a loadable source:
If the name of the Master File or other physical connection information used by a loadable source changes, you can adapt those changes into the source in PMF.
In this section: |
How to: |
Reference: |
User-entered sources in PMF:
User-entered sources let you collect data from groups of end users.
The New Source panel opens.
For example, if you are collecting input from HR staff for HR metrics, you could create a user-entered source called HR Input, and in that source, you could define the datapoints to be collected from that group.
For each datapoint you define, you can also define the numeric validation format. The PMF input facility will enforce that data format during collection, as shown in the following image.
Note: You can save your work and leave the session at any time. If all the steps are not completed, the Source panel will mark this source as incomplete. End users will not be able to enter data until the source is complete, and incomplete components do not participate in recalculation.
If data entry is already displayed, click the Refresh button to refresh the contents in the Enter Data tab.
In this tab, PMF displays the rows that are ready to receive data from input and allows you to enter data.
Click the Save button within the Data Entry tab to save the data you entered into the individual datapoints on the Data Entry tab.
User-entered data differs from standard loaded data in the following ways:
As a result, you need to schedule recalculation more frequently.
Note: User-entered data is treated as updated on the date of entry, and the downstream datapoints and measure copies are treated as loaded on the day they were scheduled to update.
You can update user-entered sources at any time. PMF adjusts existing user-entered data for the datapoints in a user-entered source as follows:
In this section: |
How to: |
Reference: |
Generated sources in PMF enable you to:
Generated sources enable PMF to automatically create sample data for your models. They should be used in the following situations:
Note: Data that is generated should not be treated as real performance data. PMF cannot distinguish between generated and performance data, so use generated sources only for non-production work.
The New Source panel opens.
If you choose to specify the rules that PMF should use to generate data, the following options are available:
Specifies the decimal format of the data generated:
Examples of typical decimal formats are: D12.2, I8, D20.6, and I32.
Controls how PMF will calculate the sample values:
The lowest and highest number for the range of possible values PMF will generate. The numbers will be formatted using the mask you entered in the Decimal Format field.
Note: You can save your work and leave the session at any time. If minimum necessary entries are not set up to generate data, PMF will mark the generated source as Incomplete. Incomplete sources and their datapoints do not participate in recalculation. You need to complete the source to allow its datapoints to participate, or you will not be able to get the data published using measures.
Note: If you are setting up a trained generated source, you do not have to specify dimension links because they will be inherited from the source.
The following option is available:
Controls the amount of data PMF generates by letting you focus the data on dimensional choices:
You can specify any datapoint to train from a loadable datapoint, user-entered datapoint, derived datapoint, or another generated datapoint.
Tip: When generating random data for generated datapoints, PMF will wipe all existing data before regenerating it using your new rules. Generally, with generated datapoints, the Preview tab is useful for new data you are planning to generate into an empty datapoint, or when changing rules. If your rules have not changed, and data is showing up on Preview as 100% added and deleted, you should not regenerate data.
To update a generated source:
Generated sources are primary sources for data in PMF. As such, PMF treat these source types as first generation in any lineage, along with loadable and user-entered datapoints.
Generated sources are used when you lack real data to prove that your model works, or they are needed for a demonstration. Once you are ready to use a working model with real data, generated sources are no longer necessary to feed your metrics.
To promote the generated source:
You can preview the data that PMF will generate by clicking the Preview tab.
The Preview tab generally shows rows that are new, or will be updated or deleted.
How to: |
The Load Now panel lets you automatically refresh the entire PMF cube from Source data. It runs the following:
Note: If any Dimensions or Sources are incomplete, the Load Now operation will fail. But this is one way to quickly show you what Dimensions, Sources, Derived Datapoints, or Metrics you still need to complete.
To perform a Load Now operation:
The Load Now panel opens.
A confirmation dialog box opens.
|
WebFOCUS |