I would love a bit more information on how the measures in the output of this analysis are derived and how one should interpret them. Is there a source that I have maybe overlooked?
Is there a cut off where the average overlap/extent ratio might be said to indicate good or very good inter-rater reliability in terms of segmentation? e.g., is the below considered high?
Average overlap/extent ratio: 0.8182
Overall average overlap/extent ratio: 0.8182
Thanks for your help,