To see the full schema and try examples yourself, check out our API documentation.
Our API returns a document_classification field which indicates the most likely classification of the document. The possible values are HUMAN_ONLY, MIXED, and AI_ONLY. We also provide a probability for each classification, which is returned in the class_probabilities field. The keys for this field are human, ai or mixed. To get the probability for the most likely classification, the predicted_class field can be used. The class probability corresponding to the predicted class can be interpreted as the chance that the detector is correct in its classification. I.e. 90% means that 90% of the time on similar documents our detector is correct in the prediction it makes. Lastly, each prediction comes with a confidence_category field, which can be high, medium, or low. Confidence categories are tuned such that when the confidence_categoryfield is high 99.1% of human articles are classified as human, and 98.4% of AI articles are classified as AI.
Additionally, we highlight sentences that been detected to be written by AI. API users can access this highlighting through the highlight_sentence_for_ai field. The sentence-level classification should not be solely used to indicate that an essay contains AI (such as ChatGPT plagiarism). Rather, when a document gets a MIXED or AI_ONLY classification, the highlighted sentence will indicate where in the document we believe this occurred.