Programmer Guide

Question

Q:

What is the target database type?

Answer 1

A:

DFsqlload supports four popular SQL server platforms - Oracle, PostgreSQL, MS SQLServer and MySQL. Your target database will need to be one of these four types. Once this is known, DFsqlload is directed to create a set of relational tables for a given database type using the -flavor option. If the target database type is Oracle, then this option is not required as DFsqlload assumes Oracle by default. Otherwise,

Target is a PostgreSQL database - use -flavor postgresql
Target is a MySQL database - use -flavor mysql
Target is a MS SQLServer database - use -flavor mssql
Target is an Oracle database - omit this option, or use -flavor oracle

Answer 2

A:

The default behavior of DFsqlload is to preserve data typing. The older version of DFsqlload that was shipped with DFdiscover 3.7 and 3.7.001 did not preserve data typing and all user fields were converted to type VARCHAR. If you want to retain this behavior, then you will need to use the -type option or the applications you have written that use any of the tables created by these older versions of DFsqlload will probably not work. If you are writing new applications, but want your old applications to work as well, you can get DFsqlload to create both typed and untyped tables in your target relational database.

To preserve Data Typing - omit this option or use -type typed To provide both typed and untyped tables - use -type both To provide just untyped tables - use -type untyped

Answer 3

A:

Relational tables rely on other relational tables when a code has a corresponding label and it is the label that you want to report. Rather than create a separate table for every coded variable, DFsqlload offers two options which may be used together or separately depending on the specific requirements. The -coding option controls the inclusion of label data. By default, you get just the codes. You can create a column for the code and a column for the label corresponding to that code using the option -coding both. If all you want is the labels, use the option -coding label. The other way to get value/label into your relational tables is to create the optional DFCODING table. This is done using the -table dfcoding option. This optional table contains all codes and labels for all DFdiscover fields with type CHOICE. See the DFsqlload reference page for more details.

Answer 4

A:

Use the -table dfsubjectalias option to request creation of the optional DFSUBJECTALIAS table containing two columns, DFpid and DFalias. Thereafter it is an easy SQL join statement to include the subject alias together with the subject id.

The DFSUBJECTALIAS table is also created by default if subject aliases are defined when loading of all tables is requested.

Answer 5

A:

How missing codes are handled depends upon the -type option. If the target tables are untyped, then by default, codes are output in the relational tables as is. To get the labels corresponding to a missing value code, use the option -missing label. If the target tables are typed, then all missing codes are converted to NULL and logged as such, regardless of how the -missing option is used. If the -table dfnullvalue option is used, a record of each substitution is created in the optional DFNULLVALUE table.

Answer 6

A:

By default, DFsqlload creates date columns using the correct data type for the target system. A string representation of a date can be output in a separate column using the option -date both. If just the string representation is required, use the -date untyped option. Partial dates (i.e., dates where the day or month are missing) are imputed by default, according to the rules specified in the DFdiscover setup. If imputed dates are not desired, you can turn it off using the -noimpute option, in which case any partial dates will be converted to NULL.

Answer 7

A:

DFsqlload writes extensive logging information. When DFsqlload encounters a problem, the problem is written to stderr by default, unless overridden with the -q option. In either case, problems are logged in the DFsqlload log file for a given run. If DFsqlload encounters problems with DFdiscover data, it replaces the problem data with a NULL value, writes the substitution to the log file and optionally creates a record for the substitution in the DFNULLVALUE table. The following identifies typical problems and how they are handled by DFsqlload.

Any field that is blank or contains only white space (space, tab) will be converted to a NULL. These substitutions are not logged.
All missing value codes are converted to NULL if the target tables are typed (the default). This is applied consistently, even if the missing value code happens to be legal for some field types.
Any value wider than the storage width defined for the field in the DFdiscover schema is converted to NULL.
If a field has a format defined in the DFdiscover schema, values are checked for adherence to the format and are converted to NULL if they do not conform.
Invalid dates are converted to NULL.
If date imputation is not defined in the DFdiscover schema, partial dates are converted to NULL. Any imputed dates that are not legal dates are also converted to NULL.
If a check or choice box contains an undefined code, it is converted to NULL.

If you use the -d drfname option, a .drf file will be created containing a reference to each DFdiscover record having one or more non-blank substitutions to NULL.

Complete records will be rejected if the following conditions are encountered.

the record does not contain the correct number of fields
any of the DFdiscover fields are blank or invalid
a record with the same keys has already been imported

These cases will also appear in the .drf file if the -d drfname option is used.

Answer 8

A:

Yes. DFsqlload will ignore them.

Answer 9

A:

DFsqlload recreates the tables it needs each time it is run. Be careful when changing DFsqlload program options from run to run. SQL tables created from a previous run are not recreated if they are not required by the current run.

Answer 10

A:

DFsqlload will drop the table (and all of your changes) and recreate it from the current DFschema and data in the DFdiscover study database.

Answer 11

A:

If there have been no changes to the data definitions, the data is dropped from the SQL table and reloaded from DFdiscover. This occurs even if there have been no data changes for that DFdiscover plate since the last time DFsqlload was run.

Description	Limit	Comments
DFdiscover Study Number	`1`-`999`	The suggested range for study numbers is 1-249 as study numbers of 250-255 are reserved for DFdiscover test and validation studies (e.g. ATK = 254). With the appropriate software license, study numbers 256-999 are available for defining EDC studies.
Plate Number	0-501, 510, 511	Plates 501 and 511 are reserved by DFdiscover for Query Reports and can not be re-defined at the user level. Plate 0 references the new record queue. Plate 510 is reserved by DFdiscover for Reason records.
Visit/Sequence Number (barcoded)	0-511
Visit/Sequence Number (first data field)	`0`-`65535`	Any data field representing the visit/sequence number must be defined in the database as field #6 using DFdiscover schema numbering.
Site Number	`0`-`21460`	This limit applies to the site number only. A subject identifier is concatenated to the site number to obtain the subject ID.
Subject ID Number	`0`-`281474976710655`	For subject IDs that are composed of site # + ID #, this limit applies to the concatenated value of the two. This field could contain 15 digits at maximum.
Any numeric value	`-2147483647`-`2147483647`	Any numeric field, except the subject ID field, can contain 10 digits at maximum, which include any leading sign and decimal point. This limit applies to the following DFdiscover field types, which have a base numeric value: numeric, visual analog scale (VAS), choice codes, check codes.
Query Use	0 = none 1 = external 2 = internal
Query Type	0 = none 1 = Q&A (clarification) 2 = refax (correction)
Query Category Code	`1`=missing, `2`=illegal, `3`=inconsistent, `4`=illegible, `5`=fax noise, `6`=other, `21`=missing page, `22`=overdue visit, `23`=edit check missing page, `30-99`=user-defined problem type
Query Status	`0`=pending review, `1`=new, `2`=in unsent report, `3`=resolved, NA, `4`=resolved, irrelevant, `5`=resolved, corrected, `6`=in sent report
Query Detail Field	max 500 characters
Query Note Field	max 500 characters
Missed Data Log Explanation Field	max 500 characters
Default Date Format	YY/MM/DD
Validation Level (system)	0-7	Level 0 represents new, not yet entered, records
Validation Level (user)	1-7	A user cannot assign a validation level of 0 to a data record.
Maximum Data Record Length	`16384` ASCII (4096 UNICODE) characters	This is the maximum length that the system can accept and includes 55 characters of overheard maintained by the system. Therefore, the length of data record available for user-defined fields is 55 characters less.
Maximum DFsas Record Length	2048 characters	DFsas is unable to process input files for SAS® greater than this size.

ccc-yymmdd	The naming convention for Query Reports is a zero padded 3 digit site ID (if the site ID is greater than 999, it will be a 4 digit number), followed by a dash, and then the date on which the report was created. For example: 055-150815 would identify a Query Report created for site 55 on Aug 15, 2015. Thus you can tell from a Query Report name, both whom it was created for and when. Although this is helpful it does have one disadvantage, namely, it means that you cannot create more than one Query Report per day for an individual clinical site. If you do, the earlier one will be over-written. However, since Query Reports are sent to the clinical sites, and this isn't something you should do more than about once a week, and certainly not more than once a day, this approach to naming Query Reports works. Query Reports remain under `QC` until they are sent to the clinical sites. Thus any reports that you see at this level have not been transmitted to the study clinics.
`QC/sent`	Query Reports are moved to this directory after they have been successfully sent to their respective clinical sites.
`QC/internal`	DF_QCreports is also able to create named, internal Query Reports, which can include subjects from more than one site. These reports are written directly to this directory.
QC_LOG	This is a plain ASCII file that lists any error messages reported during the last execution of DF_QCreports.
`QC_NEW`	This is a plain ASCII file that lists the Query Reports created by the most recent execution of DF_QCreports.
SENDFAX.log	This is a plain ASCII file in which DF_QCfax records the success or failure of its attempts to fax Query Reports to the clinical sites.
SENDFAX.qup	This is a plain ASCII file that lists all Query Reports which have been queued up for transmission to the clinical sites.
other files	Occasionally, DF_QCreports might be halted in mid-execution, by a power failure, or some other problem. In such cases it may not have time to remove the temporary files that it creates in the process of generating the Query Reports. These files generally contain a process ID#; for example you might see files that look like this `18273_N.refax`, `18273_N.qalist`. These temporary files can, and should, be removed.

Heading	Description
Usual Name	the file name that is usually given to files having this format. Some files are kept at the DFdiscover directory level while others are kept separately with each study directory.
Type	one of: "clear text" or "binary". Clear text files can be reviewed with any text editor.
Created By	the name of the DFdiscover program(s) that create and modify this file. If you need to edit the contents of the file, use the program listed here.
Used By	the name of the DFdiscover program(s) that reference or read this file.
Field Delimiter	how fields within a record are delimited. Typically, the delimiter is a single character.
Record Delimiter	how records within the file are delimited. Typically, the delimiter is a single character.
Comment Delimiter	how comments within the file are delimited. If comments are not permitted within the file, "NA" is indicated.
Fields/Record	the expected number of fields per record. If the number of fields varies across records, the minimum number is given, followed by a `+`.
Description	a detailed description of the meaning of each field.
Example	one or more example records from the file.

Usual Name	`filename.drf`
Type	clear text
Created By	DFexport.rpc, DFmkdrf.jnl, DFexplore
Used By	DFexplore
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	#
Fields/Record	3+
Description	Data records have the common fields and characteristics defined in Table 2.3, “DFdiscover Retrieval File field descriptions”.
Example	This is an example of a DRF listing all primary records having one or more queries attached to them. The DRF filename `ext_qcs.150813.drf` was created on August 13, 2015 and has the description `external qc notes.150813`. 99001\|1\|2 99001\|1011\|9 99002\|1\|2 99002\|22\|5 99002\|1011\|9 99004\|1\|2

Field #	Contains	Description
1	subject/case ID	this is a number in the range of `0`-`281474976710655` used to uniquely identify each subject/case in the study database
2	visit/sequence number	a number in the range of `0`-`21460`, that uniquely identifies each occurrence of a visit number for a subject ID
3	plate number	a number in the range of 1-501 that uniquely identifies the plate to be included in the DRF.
4	image identifier	the value in this optional field is the unique identifier of the CRF image identified by the keys in the first 3 fields. This field is used to identify a specific instance of a CRF when there are one or more secondaries having the same key fields.
5	optional text	this field can be used to provide a short descriptive message to DFexplore users. The text is displayed in the message window at the bottom of DFexplore when the data record is selected in the DFexplore record list.

	Note
	This directory must not be used to store any other files. Extraneous files will be deleted when CRF images are published from a development study to its production study, or when reverting a development study to the current state of its production study.

Field #	Contains	Description
1 (`DFSTATUS`)	record status	Enumerated value from the list `1`=`final`, `2`=`incomplete`, `3`=`pending`, `4`=`FINAL`, `5`=`INCOMPLETE`, `6`=`PENDING`, `0`=`missed`
2 (`DFVALID`)	validation level	Enumerated value from the list 1, 2, 3, 4, 5, 6, 7
3 (`DFRASTER`)	raster name	the fax image id from which this data record was derived, if there was a fax image. The image id is always in the format `YYWW/FFFFPPP` or `YYWWRFFFFPPP`, where `YY` is the year (minus the century) in which the image id was created, `WW` is the week of the year, `FFFF` is a sequential base 30 value, in the range 0001-ZZZZ, representing the fax arrival order within the week, and `PPP` is the page number within the fax (also see `pages`). If the image id contains a slash (`/`) in the 5th character position, then the image id is for a fax. Pre-pending this image id with the value of the `PAGE_DIR` variable for the study creates a unique pathname to the file containing the fax image. If the image id contains `R` in the 5th character position, then the image id is for a raw data entered record, and in fact there is no image.
4 (`DFSTUDY`)	study number	the DFdiscover study number (must be constant across all records for the same study). Legal limit is `1`-`999`.
5 (`DFPLATE`)	plate number	the plate number as identified in the barcode of the CRF. Within each data file, this field has a constant value. Legal limit is 1-501.
6 (`DFSEQ`^[a])	visit/seq number	the visit or sequence number of this occurrence of a plate for a subjects id. Legal limit is 0-511 if defined in the barcode, and `0`-`65535` if defined as the first data field on the page.
7	subject ID	the subject identifier. Legal limit is `0`-`281474976710655`. subject identifiers are composed of site # + subject ID. This limit applies to the concatenated value of the two, where the site # legal limit is `0`-`21460`.
8 to N-3	data	data fields from the corresponding CRF page
N-2 (`DFSCREEN`)	record status	Enumerated value from the list `1`=`final`, `2`=`incomplete`, `3`=`pending`. This record status mimics the status recorded in the first field except that there is no distinction between primary and secondary.
N-1 (`DFCREATE`)	creation date	the creation date and time stamp in the format `yy/mm/dd hh:mm:ss`. The creation date and time stamp are added to the record the first time it is validated to a level higher than 0. It is never modified after initial addition to the record.
N (`DFMODIFY`)	last modification date	the last modification date and time stamp in the format `yy/mm/dd hh:mm:ss` The initial value for this field is the same as the creation date. Each time any data within the record is modified, the modification stamp is updated. It is not updated if only the record's validation level has changed.
^[a]Field 6 has this name only if the field is defined to be in the barcode; otherwise, the name is user-defined.

Usual Name	`plt###.dat`
Type	clear text
Created By	DFserver.rpc
Used By	DFserver.rpc
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	11
Description	If a CRF is reported missing, and is not expected to ever arrive (e.g. because the subject missed a visit, or refused a test), it can be registered in the study database using DFexplore. These records have a status of `lost` and are added to the study database file with the same plate number, `plt###.dat`, similar to actual data records. The contents of the record fields are described in Table 2.7, “Missed record field descriptions”.
Example	Here is an example of a missed data record 0\|7\|000/0000000\|099\|4\|1\|0123\|1\|subject was on vacation\|92/01/10 10:23:23\|92/01/10 10:23:23\|

Field #	Contains	Description
1 (`DFSTATUS`)	record status	Enumerated value - always contains a `0` for missed records, which is equivalent to the record status `lost`
2 (`DFVALID`)	validation level	Enumerated value from the list 1, 2, 3, 4, 5, 6, 7
3 (`DFRASTER`)	image name	always contains `0000/0000000` for missed records. This is simply a placeholder value as there is no CRF image.
4 (`DFSTUDY`)	study number	the DFdiscover study number (must be constant across all records for the same study)
5 (`DFPLATE`)	plate number	the plate number
6	visit/seq number	the visit or sequence number
7	subject ID	the subject identifier
8	reason code	Enumerated value - the reason that the CRF was missed, selected from the following list: `1`=Subject missed visit, `2`=Exam or test not performed, `3`=Data not available, `4`=Subject refused to continue, `5`=Subject moved away, `6`=Subject lost to follow-up, `7`=Subject died, `8`=Terminated - study illness, `9`=Terminated - other illness, `10`=Other reason
9	reason text	an additional, optional explanation as to why the CRF was missed
10 (`DFCREATE`)	creation date	the creation date and time stamp in the format `yy/mm/dd hh:mm:ss`
11 (`DFMODIFY`)	last modification date	the last modification date and time stamp will always be the same as the creation date and time stamp as missed records are never modified once created (changes, if necessary, are made by deleting the existing missed record and creating another one)

Usual Name	`DFqc.dat`
Type	clear text
Created By
Used By	DFserver.rpc, DFexplore, DFbatch, DF_CTqcs, DF_ICqcs, DF_ICrecords, DF_PTcrfs, DF_PTqcs, DF_QCreports, DF_QCsent, DF_QCstatus, DF_QCupdate
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	22
Description	The query data file, DFqc.dat, is maintained by the study server in the same manner as the CRF data files, DFin.dat and plt.dat. All query records have exactly the same format, across all DFdiscover studies. Each query record has 22 fields, of which 5 (fields 4-8) are the keys needed to uniquely identify each record. Multiple queries are permitted per field provided their category codes are unique. As for study data files, you should always use DFexport.rpc* to export the query database before using it. For this purpose, DFqc.dat has been assigned a DFdiscover reserved plate number of 511. Table 2.9, “`DFqc.dat` field descriptions” describes each field in a query record.
Example	An example of a newly added query that is to be included in the next Query Report sent to the originating site (the line break is for presentation purposes only): 1\|1\|0000/0000000\|20\|10\|999\|2100101\|6\|21\|000000\|0\|\|1. Sample collection date \|11/10/05\|1\|2\|\|\|user1 06/05/27 11:35:57\|user1 06/05/27 11:35:57\|\|1\| An example of a query that was sent out in a Query Report and has now been resolved: 5\|4\|0000/0000000\|20\|397\|2030\|1415412\|11\|14\|060424\|31\|1\|A2. Whole Blood 1ml ID # \|\|1\|2\|\|\|brian 06/03/23 12:57:20\|brian 06/03/23 12:57:20\|barry 06/05/11 14:10:04\|1\|

Usual Name	`DFreason.dat`
Type	clear text
Created By	DFserver.rpc
Used By	DFserver.rpc
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	12
Description	Data fields may require that a change to the value in the field be supported by a reason for the change. This reason information is recorded in . The contents of the record fields are described in Table 2.11, “Reason for data change record field descriptions”.
Example	Here is an example of a reason data record 1\|3\|000/0000000\|254\|4\|1\|0123\|9\|phone\|information provided by physician over the phone\|nathan 03/01/10 10:23:23\|nathan 03/01/10 10:23:23 It indicates that field 9 was changed because of the reason "information provided by physician over the phone".

Field #	Contains	Description
1 (`DFSTATUS`)	record status	valid status codes include: 1=approved, 2=rejected, 3=pending
2 (`DFVALID`)	validation level	Enumerated value from the list 1, 2, 3, 4, 5, 6, 7. This is the record's last validation level when the reason was created or modified.
3 (`DFRASTER`)	image name	always contains `0000/0000000` for reason for data change records. This is simply a placeholder value as there is no CRF image.
4 (`DFSTUDY`)	study number	the DFdiscover study number (must be constant across all records for the same study)
5 (`DFPLATE`)	plate number	the plate number
6 (`DFSEQ`)	visit/seq number	the visit or sequence number
7 (`DFPID`)	subject ID	the subject identifier
8 (`DFRSNFLD`)	reason field number	the data entry field number this reason note refers to. This numbering does not correspond directly to the DFdiscover schema numbering, but instead is offset by 3, so for example, the ID field which is always database field number 7 will always report the reason field number as 4. This is identical to the behavior of the field number reported in queries.
9 (`DFRSNCDE`)	reason code	an optional coding of the reason for data change. Although this code field contains textual data, it should be possible to use it as a categorical variable. The code will typically come from the first field of the `REASON` lookup table, if it is defined.
10 (`DFRSNTXT`)	reason text	required text that provides the reason for the data change. The maximum length of this field is 500 characters.
11 (`DFRSNCRT`)	creation date	the creator, creation date and time stamp in the format `name yy/mm/dd hh:mm:ss`. This field is completed when the reason for data change is first created.
12 (`DFRSNMDF`)	last modification date	the modifier, last modification date and time stamp in the format `name yy/mm/dd hh:mm:ss`. The initial value for this field is the same as the creation date. Each time the reason note is modified, the modification stamp is updated.

Usual Name	`DFin.dat`
Type	clear text
Created By	DFserver.rpc
Used By	DFserver.rpc
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	5+
Description	Each initial data record, created by the ICR software when it scans a CRF page, is passed to the study database server which appends it to this file. These are called "new records". All new records have the common fields and attributes identified in Table 2.13, “`DFin.dat` field descriptions”. The first 5 fields are added by the process that ICRed the page. The 4^th and 5^th fields were read from the barcode at the top of each CRF page. Dependant upon the success of the ICR algorithm, subsequent fields in each new record may or may not be filled in. If a fax image is particularly difficult to read, it is possible that only the first five fields will appear in the new record. The first three fields, in addition to being delimited, are also in fixed column positions. The record status begins in the first column and is one column wide. The validation level begins in the third column and is one column wide. The raster name begins in the fifth column and is 12 columns wide. Columns two and four contain the field delimiter.
Example	Here is an example of a new data record from 0\|0\|9145/0045001\|099\|4\|1\|0123\|\|92/01/01\|\|\| This record has new record status, has not yet been validated, contains the data from fax image $(PAGE_DIR)/9145/0045001, and belongs to study 99, plate 4, visit 1, and subject ID 123.

Usual Name	`YYMM.jnl`
Type	clear text
Created By	DFserver.rpc
Used By	DF_ATfaxes, DF_ATmods, DF_WFcrfs, DF_WFqcs
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	11+
Description	The study database server adds a record to the current study database journal file, each time that a data record is written to the study database, including when: a new record arrives in the study database from the incoming daemon an existing record is modified in DFexplore the validation level of an existing record is modified in DFexplore the status of an existing record is modified in DFexplore an existing record is deleted in DFexplore a query is added, modified, resolved, or deleted in DFexplore a record is imported into the study database using DFimport.rpc the database is structured by DFsetup Separate journal files are kept for each study, and within a study, a new journal file is created for each month. The naming scheme for journal files is `YYMM.jnl` where `YY` is the last two digits of the year (e.g. 92) and `MM` is the two digit month of the year, January being 01 and December being 12. The fields within each journal record are defined as described in Table 2.15, “`YYMM.jnl` field descriptions”.
Example	Following is an example of a complete journal record. The line breaks are for presentation purposes only. 980312\|132426\|valid1\|d\|1\|1\|9807/0047008\|254\|5\|24\|99001\|ABC\|06/07/97\| 171\|097\|172\|096\|2\|055.1\|121.5\|2\|\|1143\|00\|\|\|\|1\|1\|\|1\|1\| 98/03/12 13:24:26\|98/03/12 13:24:26\|

Field #	Contains	Description
1	date stamp	A date stamp, in `YYMMDD` format, identifying when the data record was written to the database
2	time stamp	a time stamp, in `HHMMSS` format, identifying when the data record was written to the database. Hours are reported in 24-hour notation.
3	username	the username of the person who wrote the record to the database. This is the login name of the user who modified the record.
4	record type	Enumerated value - indicates the type of the journal record. Possible values are: `d` for data record, `q` for query record, `r` for reason record, `s` for the beginning of a setup restructuring, and `S` for the end of a setup restructuring.
5-11	record keys	fields 5 through 11 contain the first 7 fields from the data record.
12+	record data fields	the remaining data fields (8 to the end of the record) follow.

Usual Name	`DFaudit.db`
Type	binary
Created By	DFserver.rpc, DFauditdb
Used By	DFserver.rpc, DFedcservice, DF_SBhistory
Field Delimiter	NA
Record Delimiter	NA
Comment Delimiter	NA
Fields/Record	NA
Description	All journal records are stored additionally in a sqlite database. The database contains table DFaudit and indexes. Columns in the DFaudit table match the output from DFaudittrace: CREATE TABLE IF NOT EXISTS dfaudit ( dftype TEXT NOT NULL, -- 1 type: N=new, C=changed field, D=deleted dfdate INTEGER NOT NULL, -- 2 date: yyyymmdd dftime TEXT NOT NULL, -- 3 time: hhmiss dfuser TEXT NOT NULL, -- 4 user dfsubject INTEGER, -- 5 subject dfvisit INTEGER, -- 6 visit dfplate INTEGER NOT NULL, -- 7 plate dffieldid INTEGER, -- 8 unique field id: data(=0), query(>0), reason(<0) dffieldchange INTEGER, -- 9 changed field: data(0, unique id), query(field#), reason(field#) dfstatus INTEGER, -- 10 data, query and reason status dflevel INTEGER, -- 11 validation level dfmaxlevel INTEGER, -- 12 maximum validation level reached dfcategory TEXT, -- 13 missed record code, query category, reason code dfuse TEXT, -- 14 missed record text, query usage, reason text dfoldval TEXT, -- 15 old value dfnewval TEXT, -- 16 new value dffieldnum INTEGER, -- 17 field number dffielddesc TEXT, -- 18 field description dfoldlbl TEXT, -- 19 old coded field label dfnewlbl TEXT, -- 20 new coded field label dfuid INTEGER) -- 21 internal use for database restructure CREATE INDEX IF NOT EXISTS dfidx_date ON dfaudit (dfdate, dfsubject) CREATE INDEX IF NOT EXISTS dfidx_user ON dfaudit (dfuser) CREATE INDEX IF NOT EXISTS dfidx_fuid ON dfaudit (dfuid) CREATE INDEX IF NOT EXISTS dfidx_fid ON dfaudit (dffieldid) CREATE INDEX IF NOT EXISTS dfidx_fnum ON dfaudit (dffieldnum) CREATE INDEX IF NOT EXISTS dfidx_keys ON dfaudit (dfsubject, dfvisit, dfplate) CREATE INDEX IF NOT EXISTS dfidx_vst ON dfaudit (dfvisit) CREATE INDEX IF NOT EXISTS dfidx_plt ON dfaudit (dfplate) CREATE INDEX IF NOT EXISTS dfidx_sta ON dfaudit (dfstatus, dflevel) CREATE INDEX IF NOT EXISTS dfidx_lvl ON dfaudit (dflevel) DFserver.rpc opens `DFaudit.db` in read-write mode. Whenever a record entry is added to journal file, it will be appended to DFaudit table. DFedcservice opens `DFaudit.db` in read-only mode when requesting history. DFauditdb converts existing journal files to `DFaudit.db`.

Field #	Description	Required	Maximum Size
1	site ID	yes	5 digits
2	Contact	yes	30 characters
3	Name	yes	40 characters
4	Address	no	80 characters
5	Fax	no	4096 characters
6	Attributes including Start Date, End Date, Enroll Target, Protocol Effective Date (x5), Protocol Version (x5) This field contains zero of more semi-colon ( `:` ) delimited pairs, where each pair is a keyword and value where the keyword is from the list `country`, `beginDate`, `endDate`, `enroll`, `protocol1`, `protocol1Date`, `protocol2`, `protocol2Date`, `protocol3`, `protocol3Date`, `protocol4`, `protocol4Date`, `protocol5`, `protocol5Date`, `testSite`.	no	4096 characters
7	Telephone (site)	no	30 characters
8	Investigator	no	30 characters
9	Telephone (investigator)	no	30 characters
10	Reply to email address	no	80 characters
11+	Subject ID ranges	yes	30 digits,1 space

Usual Name	`plt###.ndx`
Type	binary
Created By	DFserver.rpc
Used By	, DFshowIdx
Field Delimiter	NA
Record Delimiter	NA
Comment Delimiter	NA
Fields/Record	NA
Description	Every study data file has a corresponding index file that the study server uses to track the current status and location of each record in the data file. The index entry for a particular data record includes the value of the key fields id, plate, and visit/sequence number, the record status, the validation level, the offset of the beginning of the record into the data file, and the length of the data record. When searching for a data record by keys, it is much more efficient for the database server to search the index file for matching keys and then use the offset and length to extract the data record from the data file. Each time an existing data record is modified or a new record is added, a new entry is made at the end of the index file for the new, modified copy of that record, and the status of the old index entry (if there was one) is changed to indicate that it has been superseded by a new entry. Index and data files are sorted (on id and visit/sequence number) by the study database server, each time the server exits. Before sorting, an index file has `M` sorted entries at the top of the file, and `N` unsorted entries at the bottom of the file. When searching, the unsorted index entries are searched first in a linear fashion and then, if necessary, a binary search is performed on the sorted entries. The first 32 bytes of an index file are header information, consisting of four 4-byte numbers that identify attributes of the file as a whole, as described in Table 2.18, “`plt###.ndx` header bits”, followed by 16 bytes of padding. Before the study database server exits, it checks this header information of each index file. If the number of unsorted entries and the number of pending deletes are both 0, then the file is already sorted and does not need further attention. The 32 bytes of header information are followed by the actual index entries. Each index entry is 32 bytes in size and is described in Table 2.19, “`plt###.ndx` record bits”. Each index entry contains 8 bytes for the subject ID, 2 bytes for the visit number, 2 bytes for the plate number, 4 bytes for the offset into the data file, 2 bytes for the record length, a status byte and 13 bytes of padding. The status byte encodes three pieces of information: the record status (equivalent to the numeric value of the first byte of the data record), the record validation level (equivalent to the numeric value of the third byte of the data record), and the status of the index entry. This is encoded as illustrated in Figure 2.1, “Encoding for bits within the status byte”. Figure 2.1. Encoding for bits within the status byte The record status contains the same values as those allowed in the record status field of the data record, namely 0 through 7 (binary 111). Similarly, the validation level will take on the same values as those allowed in the validation level of the data record, again 0 through 7. The index status contains the value 2 if this is a new index entry, 1 if the index entry has been superseded by a newer one, and 0 otherwise. The size of all index files should always be a multiple of 32 bytes.

First Byte	Last Byte	Contains
1	4	magic number indicating that this is an index file. Should always have the fixed value 0xdf6464df.
5	8	the number of sorted index entries
9	12	the number of unsorted index entries
13	16	the number of pending deletions
17	32	reserved for future use

Size	Contains
8 bytes	subject ID
2 bytes	visit/sequence number
2 bytes	plate number
4 bytes	offset in bytes from beginning of data file to first byte of record (0 based)
2 bytes	length of record in bytes
1 byte	status bits
13 bytes	reserved for future use

Usual Name	`DFedits`
Type	clear text
Created By	DFsetup
Used By	DFexplore
Field Delimiter	NA
Record Delimiter	NA
Comment Delimiter	`#`
Fields/Record	NA
Description	This file contains the edit checks that are defined for this study. The edit check language is fully described in Edit checks.
Example	edit SetInit() { if ( dfblank( init ) && !dfblank( init[,0,1] ) ) init = init[,0,1]; } edit AgeOk() { number age; if ( !dfblank( p001v03 ) && !dfblank( p001v04 ) ) { age = ( p001v03 - p001v04 ) / 365.25; ...

Usual Name	`DFccycle_map`
Type	clear text
Created By	DFsetup
Used By	DF_QCupdate
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	`#`
Fields/Record	2+
Description	This table describes the conditional cycle map file structure and provides an example. It does not describe all of the syntax and rules related to this feature. Usage instructions for all 4 conditional maps is fully described in Study Setup User Guide, Conditional Maps. The file contains one or more specifications, each consisting of a condition definition followed by one or more actions to be applied if the condition is met. Entries in the file have the general appearance of: IF\|Visit List\|Plate\|Field\|Value AND\|Visit List\|Plate\|Field\|Value [+-~]\|List of conditional cycles Each condition may be followed by one or more action statements. Each of these statements begins with: '+' to indicate that the cycles are required, '-' to indicate that the cycles are unexpected, or '~' to indicate that the cycles are optional, when the condition is met. There is no limit to the number of condition/action entries that may be included but the order in which the conditions appear may be important, because in the event of a conflict, the action specified by the last entry, applicable to each cycle, is the action that will be applied. This point is illustrated in the following example.
Example	IF\|0\|1\|22\|6 +\|2,5,8 -\|3,6,9 ~\|4,7,10 IF\|0\|1\|22\|5 AND\|0\|9\|13\|>0 AND\|0\|9\|36\|!1 +\|11 IF\|1\|3\|9\|~^A -\|11 This example, consists of 3 conditional specifications. They are applied in the order in which they are defined. The first specification indicates that, if field 22 on plate 1 at visit 0 equals 6, then cycles 2, 5 and 8 are required; cycles 3, 6 and 9 are not expected; and cycles 4, 7 and 10 are optional. The second specification indicates that, if field 22 on plate 1 at visit 0 equals 5, and field 13 on plate 9 at visit 0 is greater than zero, and field 36 on plate 9 at visit 0 is not equal to 1, then cycle 11 is required. The third specification indicates that, if field 9 on plate 3 at visit 1 begins with the capital letter "A", then cycle 11 is not expected. If both conditions 2 and 3 are met cycle 11 will be considered unexpected because, when a conflict occurs, the last condition wins.

Usual Name	`DFcvisit_map`
Type	clear text
Created By	DFsetup
Used By	DF_QCupdate
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	`#`
Fields/Record	2+
Description	This table describes the conditional visit map file structure and provides an example. It does not describe all of the syntax and rules related to this feature. Usage instructions for all 4 conditional maps is fully described in Study Setup User Guide, Conditional Maps. The file contains one or more specifications, each consisting of a condition definition followed by one or more actions to be applied if the condition is met. Entries in the file have the general appearance of: IF\|Visit List\|Plate\|Field\|Value AND\|Visit List\|Plate\|Field\|Value [+-~]\|List of conditional visits Each condition may be followed by one or more action statements. Each of these statements begins with: '+' to indicate that the visits are required, '-' to indicate that the visits are unexpected, or '~' to indicate that the visits are optional, when the condition is met. There is no limit to the number of condition/action entries that may be included but the order in which the conditions appear may be important, because in the event of a conflict, the action specified by the last entry, is the action that will be applied. This point is illustrated in the following example.
Example	IF\|0\|1\|22\|6 +\|10-19 -\|20-29 ~\|30 IF\|0\|1\|22\|5 AND\|0\|9\|13\|>0 AND\|0\|9\|36\|!1 +\|40 IF\|1\|3\|9\|~HIV -\|40 This example, consists of 3 conditional specifications. They are applied in the order in which they are defined. The first specification indicates that, if field 22 on plate 1 at visit 0 equals 6, then visits 10 to 19 are required, visits 20 to 29 are unexpected, and visit 30 is optional. The second specification indicates that, if field 22 on plate 1 at visit 0 equals 5, and field 13 on plate 9 at visit 0 is greater than zero, and field 36 on plate 9 at visit 0 is not equal to 1, then visit 40 is required. The third specification indicates that, if field 9 on plate 3 at visit 1 contains the literal string "HIV", then visit 40 is not expected. If both conditions 2 and 3 are met, visit 40 will be considered unexpected because, when a conflict occurs, the last condition wins.

Usual Name	`DFcplate_map`
Type	clear text
Created By	DFsetup
Used By	DF_QCupdate
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	`#`
Fields/Record	2+
Description	This table describes the conditional plate map file structure and provides an example. It does not describe all of the syntax and rules related to this feature. Usage instructions for all 4 conditional maps is fully described in Study Setup User Guide, Conditional Maps. The file contains one or more specifications, each consisting of a condition definition followed by one or more actions to be applied if the condition is met. Entries in the file have the general appearance of: IF\|Visit List\|Plate\|Field\|Value AND\|Visit List\|Plate\|Field\|Value [+-~]Visit List\|List of conditional plates Each condition may be followed by one or more action statements. Each of these statements begins with: '+' to indicate that the plates are required, '-' to indicate that the plates are unexpected, or '~' to indicate that the plates are optional, at the specified visits, when the condition is met. There is no limit to the number of condition/action entries that may be included but the order in which the conditions appear may be important, because in the event of a conflict, the action specified by the last entry, applicable to each plate, is the action that will be applied. This point is illustrated in the following example.
Example	IF\|0\|1\|22\|6 +10,20\|50,51 -10,20\|40,41 ~10,20\|15 IF\|0\|1\|22\|5 AND\|0\|9\|13\|>0 AND\|0\|9\|36\|!1 +91-95\|16 IF\|1\|3\|9\|yes -91\|16 This example, consists of 3 conditional specifications. They are applied in the order in which they are defined. The first specification indicates that, if field 22 on plate 1 at visit 0 equals 6, then at visits 10 and 20: plates 50 and 51 are required, plates 40 and 41 are not expected, and plate 15 is optional. The second specification indicates that, if field 22 on plate 1 at visit 0 equals 5, and field 13 on plate 9 at visit 0 is greater than zero, and field 36 on plate 9 at visit 0 is not equal to 1, then at visits 91-95 plate 16 is required. The third specification indicates that, if field 9 on plate 3 at visit 1 contains exactly the string "yes", and nothing more, then plate 16 is not expected at visit 91. If both conditions 2 and 3 are met plate 16 will be considered unexpected at visit 91, but required at visits 92-95, because, when a conflict occurs, the last condition wins.

Usual Name	`DFcterm_map`
Type	clear text
Created By	DFsetup
Used By	DF_QCupdate
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	`#`
Fields/Record	1+
Description	This table describes the conditional termination map file structure and provides an example. It does not describe all of the syntax and rules related to this feature. Usage instructions for all 4 conditional maps are fully specified in Study Setup User Guide, Conditional Maps. The file contains one or more specifications, each consisting of a condition definition followed by one action to be applied if the condition is met. Entries in the file have the general appearance of: IF\|Visit List\|Plate\|Field\|Value AND\|Visit List\|Plate\|Field\|Value A or E Each condition is followed by either the letter 'A' (abort all follow-up), or 'E' (early termination of the current cycle). The termination date is defined as the visit date of the visit that triggered the condition, specifically the visit specified in the `IF` statement.
Example	IF\|0\|1\|22\|6 A IF\|6\|1\|22\|5 AND\|6\|9\|13\|>0 AND\|6\|9\|36\|!1 E This example, consists of 2 conditional specifications. The first specification indicates that, if field 22 on plate 1 at visit 0 equals 6, then all follow-up terminates as of the visit date for visit 0. Visits scheduled to occur before this date are still expected, but visits scheduled following this date are not. The second specification indicates that, if field 22 on plate 1 at visit 6 equals 5, and field 13 on plate 9 at visit 6 is greater than zero, and field 36 on plate 9 at visit 6 is not equal to 1, then the current cycle terminates, i.e. the cycle in which visit 6 is defined; with the termination date being the visit date of visit 6. Any visits in this cycle (or in previous cycles) that were scheduled to occur before the termination date are still expected, but visit within this cycle scheduled following this date are not. On termination of a cycle, subject scheduling proceeds to the next cycle in the visit map, if there is one.

Usual Name	`DFCRFType_map`
Type	clear text
Created By	DFsetup
Used By	DFbatch, DFprintdb, DFimport.rpc, DFexport.rpc, DFcmpSchema DFcmpSchema
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	2
Description	Each record in the CRF type map has two fields, an acronym or short form (1^st field), and a descriptive label (2^nd field). The CRF type acronym and label appear in the Import CRFs dialog. The CRF type label appears in the DFexplore Preferences dialog. This file is optional. If it does not exist, CRF Types are not used for this study.
Example	PAPER\|Print Version CHINESE\|Chinese ENGLISH\|English SWEDISH\|Swedish FRENCH\|French SPANISH\|Spanish

Usual Name	`DFcrfbkgd_map`
Type	clear text
Created By	DFsetup
Used By	DFprintdb
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	3
Description	Each record in the CRF background map has three fields: the visit number with the background to be repeated (1^st field), the list or range of visit numbers where the background will be repeated (2^nd field), and an optional comment field (3^rd field). This file is optional. If it does not exist, the default CRF will be displayed for visits not tagged during the Import CRFs step. If no default CRF has been imported, the CRF background will be blank for untagged visits.
Example	10\|11,13-14,16-17,19-20\|Quarterly visits 12\|15,18\|Annual visits 22\|25,28\|Annual lab work

Programmer Guide

Preface

1. Getting Help

2. Conventions

Chapter 1. Introduction

1.1. About This Guide

1.2. DFdiscover Programming Limits

Chapter 2. DFdiscover Study Files

2.1. Introduction

2.2. DFdiscover study directories

2.3. Study File Permissions

2.4. Format used to describe files

2.5. DFdiscover Retrieval Files (DRF)

2.6. The study data directory

2.6.1. Temporary data files

2.6.2. Plate data files

2.6.3. Query data files

2.6.4. Reason for change data files

2.6.5. Newly arrived data file

2.6.6. Journal files

2.6.7. Index files

2.7. The study ecsrc directory

2.8. The study lib directory

2.9. The study lut directory

2.10. The study work directory

Chapter 3. Shell Level Programs

3.1. Introduction

3.2. User Credentials

3.2.1. Good Password Management

3.2.2. Order of Evaluation

3.3. Organization of Reference Pages

3.4. Alphabetical Listing

DFaccess.rpc

Synopsis

Description

Options

Exit Status

Examples

DFattach

Synopsis

Description

Attaching Documents

Permissions

Database Actions

Options

Exit Status

Examples

DFaudittrace

Synopsis

Description

Options

Exit Status

Examples

DFbatch

Synopsis

Description

Options

Exit Status

See Also

DFcompiler

Synopsis

Description

Options

Exit Status

Examples

DFdisable.rpc

Synopsis

Description

Options

Exit Status

Examples

See Also

DFenable.rpc

Synopsis

Description

Options

Exit Status

Examples

See Also

DFencryptpdf

2.6. The study `data` directory

2.7. The study `ecsrc` directory

2.8. The study `lib` directory

2.9. The study `lut` directory

2.10. The study `work` directory

Usual Name	`DFedits.bin`
Type	binary
Created By	DFsetup, DFcompiler
Used By	DFexplore
Field Delimiter	NA
Record Delimiter	NA
Comment Delimiter	NA
Fields/Record	NA
Description	This file contains an internal, binary equivalent of the edit checks stored in the plain text `DFedits` file. This binary format contains no external references to other included files and is a more compact representation that can be more efficiently transmitted to and interpreted by DFexplore clients. The `DFedits.bin` file is created when Publish is selected in DFsetup's edit checks dialog. This file is the only edit checks file used by DFexplore.

Usual Name	`DFfile_map`
Type	clear text
Created By	DFsetup
Used By	DFserver.rpc
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	4
Description	The file map lists all of the unique plate numbers used in a particular study database. The study server uses this file at start-up to determine which database files it may need to initialize and allocate file descriptors for. In addition to the listed plate numbers, the study server also allocates file descriptors for the new records file () and the query data file (). The format of this file is one record for each plate defined in the study setup. Each record has 4 fields. The first field of the record is the plate number, leading zero-padded to three digits. The second field is a textual description of the plate from the user-supplied definition in DFsetup. The textual part is displayed in the plate selection dialogs that can be found in various study tools. The third field identifies how the visit/sequence number key field is coded for that plate. A code of `1` means that the visit/sequence number is in the barcode; a code of `2` means that it is the first data field on the data entry screen for that plate. Any other code is an error. The fourth field indicates whether the arrival of this plate signals early termination of study follow-up for the subject, as would for example the arrival of a death report. A code of `1` appears if the plate signals early termination, otherwise the code is `2`. The records in the file map do not have to be sorted in increasing plate number order as the file is internally sorted by the study database server at start-up time.
Example	001\|Dosage of Study Drug (DOST-2)\|2\|2 002\|Concomitant Medication (COM)\|1\|2 003\|Written Consent (CONS-w)\|1\|2 005\|Diagnosis (DSM-III-R)\|1\|2 006\|Psychiatric History (PSH)\|1\|2 200\|Death Report\|1\|1

Usual Name	`DFlogo.png`
Type	PNG
Created By	NA
Used By	DFexplore
Field Delimiter	NA
Record Delimiter	NA
Comment Delimiter	NA
Fields/Record	NA
Description	A study logo is a small PNG file, maximum dimension is 64 pixels tall and 128 pixels wide. The study logo is displayed in the top-left corner of report output in DFexplore, in the data entry screen header of DFweb, in the studies list displayed during login, and in several DFadmin dialogs. If no study logo is available, the study name is written to the header of each report output. The study logo must be created outside of DFdiscover - there is no interface for creating it. Once the file is created, it must be copied to the study folder and saved as `lib/DFlogo.png`.

Usual Name	`DFlut_map`
Type	clear text
Created By	DFsetup
Used By	DFexplore, DFbatch
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	2
Description	Lookup tables are used to select results from a list of pre-defined values and insert them into DFdiscover data fields. Lookup tables can also be defined for the query, query note, and reason fields to allow users to select pre-specified text for these fields. Lookup tables are described in Lookup tables. A study setup may use multiple lookup tables and so the lookup table map is used to associate simple table names with more complex and lengthy full pathnames of the actual lookup tables. Each record in the lookup table map has a lookup table name, followed by a `\|`, and a filename containing the lookup table. DFdiscover will search for the lookup table filename first in the `$(STUDY_DIR)/lut` directory and if that fails in the `/opt/dfdiscover/lut` directory. The table name is a symbolic name that can be referenced in edit checks. The special table names `QC`, `QCNOTE` and `QCREPLY` are used to associate a lookup table with the detail, note and reply fields, respectively, in the DFexplore Field > Add/Edit/Reply Query dialogs. The special table name `REASON` is used to associate a lookup table with the reason code and text for data changes.
Example	QC\|DF_QClut QCNOTE\|DF_QCnotelut QCREPLY\|DF_QCreplylut REASON\|DF_Reasonlut MEDS\|meds.dict COSTART\|costart.dict WHO\|who.dict

Usual Name	`DFpage_map`
Type	clear text
Created By	DFsetup
Used By	DF_QCreports, DFexplore
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	3
Description	This file is optional for a study. If defined, it must be located in `$(STUDY_DIR)/lib/DFpage_map`. The page map allows one to specify non-default labels, to identify visits and CRF pages, on Query Reports and in DFexplore's subject binder view. If a page map file is not defined, queries and pages in the binder are identified by the visit number and plate number keys of the CRF. The first record in a page map contains only a single field which specifies a text label to be used as a title over the queries. The default label is: PLATE SEQNO This label appears at the top of the Fax/Refax List and Question & Answer List sections of the Query Reports. The remaining records describe the text labels to be used in place of the default plate and visit number identifiers. The first field is the plate number, the second field is the visit/sequence number, and the third field is the substitution to be used for that plate and visit/sequence combination. The third field can contain `%P` and `%S` which are replaced with the plate number and the visit/sequence number fields from the query, respectively. It may also contain parts of the plate number and/or sequence number fields by using the notation `%{n.P}`, `%{P.n}`, `%{n.S}`, or `%{S.n}`. `%{n.P}` which returns the leading n digits of the plate number while `%{P.n}` returns the trailing n digits of the plate number. Similarly, `%{n.S}` and `%{S.n}` produce n leading and trailing digits of the sequence number. The third field may also contain the notation `%#` which is replaced with the value stored in field `n` of the data record matching the specified plate and sequence number. Additionally, when using the `%#` notation, and for data fields that have a data type of choice or check, it is possible to request that the reported value be decoded by using the `%n:d` notation. This substitutes the label associated with the value, instead of the value itself. ^[a] When a Query Report is created, those queries whose plate and visit numbers match the first and second fields, will be identified on the Query Report by the label which appears in the third field. If either the first field or second field contains a ``, all values for that field that have not yet matched a previous rule will use the format in the third field. For more information, see Study Setup User Guide, Page Map*.
Example	PAGE (PLATE-SEQ) 018\|001\|MED VISIT1 \|001\|VISIT1 (%P-%S) 025\|\|TERM (%P-%S) \|\|(%P-%S) Note how the last rule catches all plate and visit/sequence pairs that were not previously matched.
^[a]To implement `%#` or `%n:d`, the data record must be available. This is always true for Query Reports. However, in DFexplore only the currently visible data record is available at any moment. The result is that in the subject binder, for any record other than the current record, use of `%#` or `%n:d` is substituted with `???`. This limits the usefulness of this notation in DFexplore.

Usual Name	`DFqcproblem_map`
Type	clear text
Created By	DFsetup
Used By	DFbatch, DFprintdb, DFimport.rpc, DFexport.rpc, DFcmpSchema, DF_QCupdate, DF_XXkeys, DF_stats, DFsas, DFexport
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	NA
Fields/Record	4
Description	Each record in the Query category code map has four fields, the category code (1^st field), the descriptive label (2^nd field), the auto-resolve value (3^rd field), and the sort value (4^th field). The query category code appears verbatim in any data field which has been assigned a query with that code. The pre-defined categories are assigned integer codes from 1-6 and 21-23 (inclusive). User-defined categories must be assigned integer codes ranging from 30-99 (inclusive). Category codes must be unique. The problem labels must have a length ranging from 1-20 characters (inclusive), must be unique, and are case insensitive. The auto-resolve field may be set to No (0) or Yes (1). With auto-resolve set to Yes, an outstanding query with this category code will be automatically resolved if a new reason is added to the corresponding data value. The sort value may be set to any integer value between -2147483648 and 2147483647. Category types are sorted in ascending order by sort value, then by code. The query category code map lists all query category codes in the study. A query with one of these types can be entered into any data field by selecting the desired type from the category code list available under Field > Add Query... in DFexplore. When a new study is started, the file is created. By default, the file is populated with the following: 1\|Missing\|1\|0 2\|Illegal\|1\|0 3\|Inconsistent\|1\|0 4\|Illegible\|1\|0 5\|Fax Noise\|1\|0 6\|Other\|1\|0 21\|Missing Page\|0\|0 22\|Overdue Visit\|0\|0 23\|EC Missing Page\|0\|0
Example	1\|Missing\|1\|0 2\|Illegal\|1\|0 3\|Inconsistent\|1\|0 4\|Illegible\|1\|0 5\|Fax Noise\|1\|0 6\|Other\|1\|0 21\|Missing Page\|0\|0 22\|Overdue Visit\|0\|0 23\|EC Missing Page\|0\|0 30\|Clinical QC\|1\|1 50\|Needs Review\|0\|1 37\|Refuted\|1\|2

Usual Name	`.DFreports.dat`
Type	clear text
Created By	DFexplore
Used By	DFexplore
Field Delimiter	`\|`
Record Delimiter	`\n`
Comment Delimiter	`#`
Fields/Record	9+
Description	Users create and save report history lists (including options) in much the same way that they create and save task lists. The reports history list for ail study users is stored in this file in the study `lib` directory. Each history list is represented by a single record in this file. Each user may have more than one history list. History lists are created in DFreport by executing the desired reports and then saving the history list. There is no text-based interface to this file or its contents.

Usual Name	^[a]
Type	clear text
Created By	DFsetup
Used By	DFsetup
Field Delimiter	`\n`
Record Delimiter	`\n\n`
Comment Delimiter	NA
Fields/Record	2+
Description	This file is very much like the study schema, `DFschema` - database schema, but instead of variable definitions it simply catalogs all of the variable styles used in DFsetup to define new variables. Like `DFschema` the file is generated/updated automatically by DFsetup whenever a save is required for the study definition. Note that the style schema lists only field types that are defined by the style itself. It omits those definitions that are specified at the variable level.
Example	%S 254 %N 51 %I 1 %T int SimpleNumber %A optional %I 2 %T date SimpleDate 1940 0 %A optional %W 8 %F dd/mm/yy %I 3 %T string SimpleString %A optional %I 4 %T choice SimpleChoice %A optional %L $(choices) %l 0 %c 0
^[a]This file is present for users and programs that require backwards compatibility and are not able to read the JSON study definition. This file may be removed in a future release.

Code	Relevance	Meaning
S	Study	DFdiscover study number
a	Study	production study number
b	Study	development study number
B	Study	year in which study database was defined
e	Study	Run edit checks in View mode
N	Study	the total number of user-defined plates in the setup (this excludes plate 0 (new records), plate 510 (reasons for data change), and plate 511 (queries). In `DFschema.stl` file this represents total number of styles defined in the study
U	Study	DFdiscover version used to create setup
u	Study	DFdiscover version used on last setup change
Y	Study	reason is required for data change for specified fields (0=per field), for all fields (2=always), or for no field (1=never). These values are followed by the value for the 'Only if changing a non-blank value' setting; 0=no, 1=yes.
Z	Study	field description maximum length (25 or 40)
M	Study	number of new patients to be listed in patient binder
m	Study	enable Start New Subject dialog
P	Plate	plate number
n	Plate	total number of fields per plate; this includes the 6 DFdiscover default fields present on all records (i.e., DFstatus, DFcreate, DFmodify, etc.)
p	Plate	plate label
E	Plate	is the sequence number predefined (code `1`) or is it the first data field (code `2`)?
R	Plate	plate arrival trigger. This tag identifies an executable shell script or program, located in the study ecbin or DFdiscover ecbin directories, which is executed on plate arrival.
t	Plate	does plate trigger early termination; "simple" = plate does not trigger early termination, "term" = plate triggers early termination
I	Field/Style	the field order number or the style index in the `DFschema.stl` file
i	Field	the number that uniquely identify fields within a study
V	Field/Style	alias
v	Field/Style	name
D	Field/Style	description
g	Field/Style	the minimum validation level after which any changes to the data value will require a reason. This code is optional, or may be present and have the value 0, in which case a reason for a data change is never required. If a minimum validation code between 1-7 is present, it will be followed by the value for the 'Only if changing a non-blank value' setting; 0=no, 1=yes.
h	Field/Style	field visibility
L	Field/Style	legal values; where `$(ids)` has been used as a legal range definition for patient id variables, the literal `$(ids)` will be reported.
A	Field/Style	field optionality
F	Field/Style	format
H	Field/Style	help message
W	Field/Style	maximum number of stored characters
w	Field/Style	maximum number of displayed characters
J	Field/Style	edit check(s) on field entry
K	Field/Style	edit check(s) on field exit
j	Field/Style	edit check(s) on plate entry
k	Field/Style	edit check(s) on plate exit
c	Field/Style	code value and label definition for no choice
C	Field/Style	code value and label definition
s	Field/Style	number of fields to skip and condition
T	Field/Style	a compound value containing, in order: data type, one of string, int, date, choice, check, time style name (which may contain spaces) for dates only, the cutoff year for dates only, the imputation method for dates only, one of NonSched or VisitDate
r	Field/Style	field's module name, instance number and description.
X	Field/Style	is field constant (0=no, 1=yes) and constant value (if constant)
y	Study/Plate/Field/Style	value for a custom property (tag)
z	Study/Plate/Field/Style	tag for a custom property, useful to replace a standard name

Usual Name	`DFserver.cf`
Type	clear text
Created By	DFadmin
Used By	DFserver.rpc
Field Delimiter	`=`
Record Delimiter	`\n`
Comment Delimiter	`#`
Fields/Record	2
Description	Each study database server is configured by this file. The study server configuration keywords and their meanings are given in the Table 2.43, “`DFserver.cf` configuration keywords”.
Example	STUDY_NUMBER=254 STUDY_NAME=DFdiscover Acceptance Test Study STUDY_DIR=/opt/studies/val254 PAGE_DIR=$(STUDY_DIR)/pages WORKING_DIR=$(STUDY_DIR)/work PRINTER=hp4000 THRESHOLD=500000 STUDY_URL=http://www.ourstudy.com/index.html LOCAL_CACHE=1 VERSION_STRICT=0 AUTO_LOGOUT=5\|30 ADD_IDS=1\|\| VERIFY_IDS=0\|