Filename conventions

  • SEX_database_model_subset_attrib0_attrib1_...

    • databse: TRUVEN, UCM or any other
      all caps
    • model: ZBL, NN, CNN etc
      no underscores
    • subset:
      use BASIC for no subset
  • Tabular Files of size > 200MB have extension '.dat'

Column-name conventions

  • No capitalization
  • varnames: auc, poslr, neglr, fpr, tpr, sensitivity, specificity
  • zbl_risk is the column name of our estimated risk
  • Other column names:
    • age
      not age_at_screening
  • vartype_attrib
    • auc_pfsa (correct)
    • pvalue_final_cb (correct)
    • neglr (correct)
      NOT neg_LR or LR_neg etc
  • varname_cb is the confidence intervak for varname

Output files

  • AUC stats
  • Patient_id
  • importance files
  • spectrum data