S/CDISC/Library Resource
CDISC stands for Clinical Data Interchange Standards Consortium. As the name indicated, the organization engages with clinical data community and creates and maintains data standards for clinical developments, such that the planning, creating, storing, using and exchanging of information and records can be processed at a higher level of consistency [1].
The modules stdcdisc
include basic functions for processing data already prepared with CDISC standards.
Please note that this page is not executed while building. Functions and options might be updated as needed. Please visit API reference page for the newest information on this module.
CDISC Foundational Standards
CDISC standards are used through medicinal product development processes, including nonclinical and clinical data collection, processing, reporting, analyzing and more. CDSIC foundational standards include:
Data collection:
Clinical Data Acquisition Standards Harmonization (CDASH),
Clinical Data Acquisition Standards Harmonization Implementation Guide (CDASHIG),
Data Tabulation:
Standard for Exchange of Nonclinical Data (SEND),
Standard for Exchange of Nonclinical Data Implementation Guide (SENDIG),
Study Data Tabulation Model (SDTM),
Study Data Tabulation Model Implementation Guide (SDTMIG),
Analysis Data Model (ADaM),
Questionnaires, Ratings and Scales (QRS), etc.
Select CDISC Version
CDISC standards SEND, SDTM, and ADaM are required for electronic study data submitted to CDER and CBER of FDA.
A full list of FDA supported version for CDISC models and IGs is available online [2].
Please note that data model standard and IG versions supported by FDA are determined by
comparing the “study starting date” with “Date Support Begins” and “Date Support Ends” within data standard catalog [3].
Please see FDA technical conformance guidance and study data standards catalog for more details [4].
Resources
The development of data standards is necessary for utilizing data efficiently. While the standards continuously improving and evolving, the implementation of data standards needs considerable time and effort.
Some must have documents are available to start with including:
CDISC model main documents
CDISC model IGs
CDSIC model conformance rules
regulatory agencies technical conformance guide
regulatory agencies validation rules (conformance rules and business rules)
community white papers
Access CDISC Library Resource
Steps:
To access CDISC library and documents, follow steps here to create cdiscID: https://www.cdisc.org/cdiscid-sign-insign-instructions
save the key to a file, say “/user/home/name/cdisc.txt”
Use the code below to initiate
from mtbp3.stdcdisc.lib import accessLib
clib = accessLib("/user/home/name/cdisc.txt")
Get CT Packages
CDISC Controlled Terminology (CT) is maintained and distributed as part of the National Cancer Institute (NCI) Thesaurus [5]. CT can also be downloaded from NCI FTP and multiple file formats are available (Excel, text, odm.xml, pdf, html, and OWL/RDF formats) [6].
To get a list of newest CT packages available on CDISC Library:
clib.get_ct_list()
print(clib.ct_list_titles)
Output:
['ADaM', 'CDASH', 'COA', 'DDF', 'Define-XML', 'Glossary', 'MRCT', 'Protocol', 'QRS', 'QS-FT', 'SDTM', 'SEND', 'TMF']
Some notes about these titles:
DDF: digital data flow.
MRCT: multi-regional clinical trials
COA: clinical outcome assessment. This is an archived version. COA is part of QRS supplement supporting FDA COA qualification program [7].
QRS: questionnaires, ratings and scales. This is an archived version. QRS SDTM CT is currently included in SDTM CT.
QS-FT: questionnaire and functional test. This is an archived version. This set was merged into COA [8].
To show package effective date:
print(cl.ct_list[['Title','PkgSeries','Effective']])
Output:
Title PkgSeries Effective
0 ADaM 58 2024-09-27
1 CDASH 58 2024-09-27
2 COA 21 2015-03-27
3 DDF 58 2024-09-27
4 Define-XML 58 2024-09-27
5 Glossary 58 2024-09-27
6 MRCT 58 2024-09-27
7 Protocol 58 2024-09-27
8 QRS 23 2015-09-25
9 QS-FT 19 2014-09-26
10 SDTM 58 2024-09-27
11 SEND 58 2024-09-27
12 TMF 58 2024-09-27
To get the newest protocol ct package information:
import json
clib.get_ct_package('Protocol')
print(json.dumps(clib.ct_package['Protocol']['package_info'], indent=2))
Output:
{
"description": "CDISC Controlled Terminology for Protocol is the set of CDISC-developed or CDISC-adopted standard expressions (values) used with data items within CDISC-defined PROTOCOL datasets.",
"effectiveDate": "2024-09-27",
"label": "Protocol Controlled Terminology Package 58 Effective 2024-09-27",
"name": "Protocol CT 2024-09-27",
"source": "Protocol Controlled Terminology developed by the CDISC Terminology Team in collaboration with the National Cancer Institute's Enterprise Vocabulary Services (EVS)",
"registrationStatus": "Final",
"version": "2024-09-27"
}
To get the newest Protocol CT package in DataFrame format:
title = 'Protocol'
clib.get_ct_package(title)
clib.get_ct_codelists_df(title)
print(clib.ct_package[title]['ct_df'][['conceptId', 'name', 'preferredTerm']].head())
Output:
conceptId name preferredTerm
0 C179587 Biological Sample Attribute Terminology CDISC Protocol Biological Sample Attribute Ter...
1 C201266 Biomarker Attribute Terminology CDISC Protocol Biomarker Attribute Terminology
2 C201267 Biomarker Category Value Set Terminology CDISC Protocol Biomarker Category Value Set Te...
3 C142191 Clinical Study Attribute Terminology CDISC Protocol Entities Clinical Study Attribu...
4 C139020 Clinical Trial Attribute Terminology CDISC Protocol Entities Clinical Trial Attribu...
To get the newest Glossary CT package in DataFrame format:
title = 'Glossary'
clib.get_ct_package(title)
clib.get_ct_codelists_df(title)
print(clib.ct_package[title]['ct_df'][['conceptId', 'name', 'group', 'preferredTerm', 'definition']].head())
Output:
conceptId name group preferredTerm definition
0 C67497 CDISC Glossary CDISC Glossary Terminology The terminology of the Clinical Data Interchan...
1 C80442 CDISC Glossary Premarket Device Notification 510(k). Premarket Notification (PMN) required ...
2 C42610 CDISC Glossary Abbreviation A set of letters that are drawn from a word or...
3 C71733 CDISC Glossary Biological Absorption The process by which medications reach the blo...
4 C156638 CDISC Glossary Accelerated Approval Regulatory mechanism by which new drugs meant ...