Skip to main content
Dryad

TMC-Tongue: A standardized tongue image dataset with pathological annotations for AI-assisted TCM diagnosis

Data files

Jan 06, 2026 version files 2.34 GB

Click names to download individual files

Abstract

This dataset contains 21 disease categories that can be used for target detection in tongue diagnosis. The categories are jiankangshe (Healthy Tongue), botaishe (Tongue with peeling coating), hongshe (Red tongue), zishe (Purple tongue), pangdashe (Chubby tongue), shoushe (Thin tongue), hongdianshe (Red dot tongue), liewenshe (Cracked tongue), chihenshe (Dentate tongue), baitaishe (White coating tongue), huangtaishe (Yellow coating tongue), heitaishe (Black coating tongue), huataishe (Smooth coating tongue), shenquao (renal depression), shenqutu (renal protrusion), gandanao (Hepatobiliary depression), gandantu (Hepatobiliary protrusion), piweiao (spleen and stomach depression), xinfeitu (heart and lung protrusion), xinfeiao (heart and lung depression), corresponding numerical order is 0-19.

Among them, there are 5594 images in the training set, 572 images in the validation set, and 553 images in the test set. Contains three annotation formats: coco/. txt/. XML, which can be used for experiments using relevant object detection algorithms through configuration files.