In the free-response ROC (FROC) paradigm (Barnes, Sones, and Tesic 1985) the observer TBA. FROC data consists of mark-rating pairs, where a mark a location that was considered suspicious for presence of a localized lesion and the rating is the confidence level. By adopting a proximity criterion, each mark is classified by the investigator as a lesion localization (LL) - if it is close to a real lesion - or a non-lesion localization (NL) otherwise.
The observer gives a single rating to each ROI, denoted \({{R}_{k}}\). The rating can be an integer or quasi- continuous (e.g., 0 – 100), or a floating point value, as long as higher numbers represent greater confidence in presence of one or more lesions in the ROI.
ROI-level-normal ratings are stored in the NL
field and ROI-level-abnormal ratings are stored in the LL
field.
One can think of the ROI paradimg as similar to the FROC paradigm, but with localization accuracy restricted to belonging to a region (one cannot distinguish muttiple lesions within a region). Unlike the FROC paradigm, a rating is required for every ROI.
An example simulated ROI dataset is included as dataset04
.
str(dataset04)
#> List of 8
#> $ NL : num [1:5, 1:4, 1:200, 1:7] -Inf -Inf 1 -Inf -Inf ...
#> $ LL : num [1:5, 1:4, 1:100, 1:3] 4 5 4 5 4 3 5 4 4 3 ...
#> $ lesionNum : int [1:100] 1 1 1 1 1 1 1 1 1 1 ...
#> $ lesionID : num [1:100, 1:3] 1 1 1 1 1 1 1 1 1 1 ...
#> $ lesionWeight: num [1:100, 1:3] 1 1 1 1 1 1 1 1 1 1 ...
#> $ dataType : chr "FROC"
#> $ modalityID : Named chr [1:5] "1" "2" "3" "4" ...
#> ..- attr(*, "names")= chr [1:5] "1" "2" "3" "4" ...
#> $ readerID : Named chr [1:4] "1" "3" "4" "5"
#> ..- attr(*, "names")= chr [1:4] "1" "3" "4" "5"
dataset04$NL[1,1,1,]
#> [1] -Inf -Inf -Inf -Inf -Inf -Inf -Inf
mean(dataset04$NL[,,1:50,])
#> [1] -Inf
dataset04$NL[1,1,51,]
#> [1] -Inf -Inf -Inf -Inf -Inf -Inf -Inf
dataset04$lesionNum[1]
#> [1] 1
dataset04$LL[1,1,1,]
#> [1] 4 -Inf -Inf
x <- dataset04$LL;mean(x[is.finite(x)])
#> [1] 3.678532
Examination of the output reveals that:
This is a 5-treatment 4-reader dataset, with 100 non-diseased cases and 100 diseased cases.
For treatment 1, reader 1, case 1 (the 1st non-diseased case) the 4 ratings are -, -, -, -, -, -, -. The mean of all ratings on non-diseased cases is -.
$lesionNum
field is 1.The ratings of the 2 ROI-level-abnormal ROIs on this case are 4. The mean rating over all ROI-level-abnormal ROIs is 3.6785323.
An Excel file in JAFROC format containing simulated ROI data corresponding to dataset04
, is included with the distribution. The first command (below) finds the location of the file and the second command reads it and saves it to a dataset object ds
.
fileName <- system.file(
"extdata", "includedFrocData.xlsx", package = "RJafroc", mustWork = TRUE)
ds <- DfReadDataFile(fileName)
ds$dataType
#> [1] "FROC"
The DfReadDataFile
function automatically recognizes that this is an ROI dataset. Its structure is similar to the JAFROC format Excel file, with some important differences, noted below. It contains three worksheets:
Truth
worksheet - this indicates which cases are diseased and which are non-diseased and the number of ROI-level-abnormal region on each case.
CaseID
and 40 abnormal cases (labeled 51-90).LesionID
field for each normal case (e.g., CaseID
= 1) is zero and there is one row per case. For abnormal cases, this field has a variable number of entries, ranging from 1 to 4. As an example, there are two rows for CaseID
= 51 in the Excel file: one with LesionID
= 2 and one with LesionID
= 3.Weights
field is always zero (this field is not used in ROI analysis).
FP
(or NL
) worksheet - this lists the ratings of ROI-level-normal regions.
ReaderID
= 1, ModalityID
= 1 and CaseID
= 1 there are 4 rows, corresponding to the 4 ROI-level-normal regions in this case. The corresponding ratings are . The pattern repeats for other treatments and readers, but the rating are, of course, different.CaseID
is represented in the FP
worksheet (a rare exception could occur if a case-level abnormal case has 4 abnormal regions).TP
(or LL
) worksheet - this lists the ratings of ROI-level-abnormal regions.
CaseID
= 1-50 in the TP
worksheet.CaseID
in the TP
worksheet is 51, which corresponds to the first abnormal case.CaseID
in the Truth
worksheet there were two entries with LesionID
= 2 and 3. These must match the LesionID
’s listed for this case in the TP
worksheet. Complementing these two entries, in the FP
worksheet for CaseID
= 51, there are 2 entries corresponding to the two ROI-level-normal regions in this case.TP
and FP
worksheets is always 4.Barnes, Gary T., Richard A. Sones, and Mike M. Tesic. 1985. “Digital Chest Radiography: Performance Evaluation of a Prototype Unit.” Journal Article. Radiology 154: 801–6.