Import and Edit of Observed Data
Last updated
Last updated
A generic tool for handling of observed data within the Open Systems Pharmacology Suite is the formerly known PKExcelImporter. It is used in both applications (PK-Sim® and MoBi®) for importing observed data from e.g. Microsoft Excel® files with following prerequisites:
A file contains one or several sheets with data tables.
Column headers are in the first non-empty row.
Units are given in the second header row or as part of the column header (at the end) enclosed in brackets (For example Time [h] would be interpreted as column name Time [h] and unit h).
LLOQ values (= values below Lower Limit Of Quantification) must be preceded by a "<", e.g. "<0.2", where 0.2 is the LLOQ value. In case of different LLOQ values in one Observed Data vector the largest of those LLOQ values is used as LLOQ value.
The LLOQ value is stored at the data column and is not editable. All LLOQ values are stored as LLOQ/2 (= 0.1 in the example) to display them in the middle of the unknown range 0, LLOQ. In charts for such data a dotted line y=LLOQ is shown.
File Selection Dialog
In the Parameter Identification those LLOQ values can be handled differently (see ).
To import data you should do the following:
Select the input file (see ).
Specify the column mapping (see ).
You can continue importing data sheets/data files by adding or changing the column mapping or selecting another input file.
Enter all required meta data and set unit information.
Complete the transfer of the imported data sheets to the calling application by confirming your settings.
Click on the Observed Data button to start the import component and specify the the excel file to be imported.
Both excel file formats (xls and xlsx) are supported and it is not required to have Microsoft Excel® installed on your computer.
After selection of the file to be imported, a split window appears (see screenshot below). The left hand side shows a preview of the imported data file using the current mapping, each data table can be found in a separate tab. The right hand side window displays the mapping of imported column identifiers with the predefined data types. This mapping is performed automatically upon import but can be overridden by adjusting the controls. The preview of the imported data displays the first one hundred lines of each imported sheet.
An estimate of the number of data tables upon import using the current mapping is given in brackets in the Import button. This helps the user to judge whether the specified mapping produces the desired number of data tables. The Import All button is used to import multiple sheets at the same time.
Clicking on the Preview Original Data button allows the user to quickly review the original data. This might be useful in case explanatory data that is needed to perform mapping gets trimmed out during the import process. Also, in the preview of the original data, specific subsets of data can be selected for import.
The mapping table on the right in the Import Observed Data window shows the automatically generated mapping of the columns of the source sheet to the targets columns. Automatic mapping of source columns onto the target columns takes the following criteria into account:
Equality of names.
The target column has the same name as the source column.
The target column supports the unit of the source column.
If several target columns match the above criteria, the ones that have not been used in mapping are preferred to avoid multiple mapping.
If no matching target column can be found, proceed as for meta data information on table level.
The mapping of source and target columns can be changed manually by using the buttons on the right hand side of each target column cell.
A source column can only be mapped to a target column if the data types are compatible. This means, for example, that you cannot map a source column of data type 'date' to a target column of data type 'number'. Source columns of data type text can be mapped to all target column data types.
The icons to the left of each target entry in the mapping dialog have the following meaning:
It might be more effective to enter meta data information for a column during the mapping process, especially if you are using the multiple mapping feature.
The meta data will then be used for all columns which will be created out of this mapping.
Using Group By in the mapping By mapping a source column on a target column using, the data sheet will be split into every distinct value of that source column. This results in multiple import tables. The label of each resulting import table contains the source column name and the respective value of the source column in that group. If used for grouping, a target column will appear as meta data in the following.
Meta Data Mapping If meta data are requested for importing tables you can also map source columns onto such meta data fields. Then the source data gets split in the same way as for a group by mapping and the meta data fields are filled out with the distinct values of the source column.
In both, PK-Sim® and MoBi®, observed data can be organized in folders in the Building Block explorer. Observed data can be grouped into subfolders and shifted among folders by drag/drop or by using meta data to automatically group observed data during the import into a specific subfolder.
To import a single data sheet you have to click on the Import button. If you want to import several data sheets in one file, click on the Import All button (see Import All) button. The number of currently imported tables is shown in brackets in the imports tab page caption.
You can go through each source sheet, map the columns and import the sheet as new import table. That way you would collect several import tables which can be transferred to the calling application later on.
Each required target column must be mapped onto at least one source column to enable the import buttons.
Meta data are additional information that the calling application might request of the user. There can be meta data requested for an imported table or for each imported column (see below for an example).
For a column there can be multiple dimensions defined. Each dimension can have multiple units.
If no unit information is found in the source column, the default unit is automatically set but must be explicitly confirmed.
A new tab page is created for each imported data file and you can enter meta data for tables or columns, set unit information or just view the imported data (see Imported Table Tab Page Screenshot). Changes to the error type or to units can be made in this view and are directly reflected in the chart.
To complete the import of data tables to the calling application press the OK button.
The new editing window can be accessed through double clicking the observed data in the building block view or through the context menu.
Deselecting Sheets You can deselect a complete source sheet from being imported by closing the tab page (clicking the button). This can increase clarity and has a direct influence on the Import All button (see Import All).
The predefined data types are time, concentration and error of concentrations and are available from a drop down menu. Similarly, imported data can be classified as meta data. Meta data is additional information on the imported data that applies to one or more data repository. The following meta data categories are available from the dropdown menu: molecule, species, organ, compartment, study ID, gender, dose, route and patient ID. For further information on handling and entering meta data see, “Entering Meta Data”. Units can be specified after clicking on .
The icon indicates that meta data are requested.
The icon indicates that meta data are requested which are not entered right now.
The icon indicates that unit information are requested.
The icon indicates that unit information are requested which are not explicitly entered right now.
The icon indicates that meta data and unit information are requested.
The icon indicates that the data will be split into several tables by distinct values of source column (see Using Group By in the mapping).
A column unit can be set in the mapping dialog or by selecting Set Unit from the context menu of a column in the imported table tab page (see Imported Table Tab Page Screenshot).
On the left hand side you can see all meta data of the currently selected table and their columns. You can enter the requested information directly into this area or select Edit Meta Data from the context menu of a column header.
To set a unit for a column of an imported table you can select Set Unit from the context menu of a column header.
Missing User Input All required meta data and units need to be defined before finalizing the import. Each table in which meta data and/or unit information is missing, is labelled by a icon preceding the table name. Use the page select button to get a list of all tables and identify those with missing information.
Deselecting Tables You can deselect an imported table from being transferred by closing its tab (clicking the button).
Once a repository of observed data is imported, it can be manipulated by adding new data points, numerically changing data points or changing meta data. Changes are reversible through and will be tracked in the project history. Numerically changing a value is reflected in real time in the preview graph below and will result in moving the data point in the data grid to the new settings