Assessment of Caelifera (Orthoptera) Representation in BOLD Systems: A DNA Barcode Gap Analysis with Emphasis on India
Dinesh Udapi
Department of Studies in Zoology, Chronobiology Laboratory, University of Mysore, Manasagangothri, Mysuru–570006, Karnataka, India.
H. Channaveerappa
L 126, Sankranthi, Hebbal, Mysuru–570017, Karnataka, India.
Shakunthala Venkat *
Department of Studies in Zoology, Chronobiology Laboratory, University of Mysore, Manasagangothri, Mysuru–570006, Karnataka, India.
*Author to whom correspondence should be addressed.
Abstract
Background: DNA barcoding has become an important tool in biodiversity informatics, supporting species identification, taxonomic assessment, and the evaluation of reference-library coverage through standardised genetic markers. The mitochondrial cytochrome c oxidase subunit I (COI) gene is widely used as the principal barcode region for animal taxa. However, barcode representation remains uneven across taxonomic groups and geographic regions.
Aims: This study assessed the representation of Caelifera (Orthoptera) in BOLD Systems, with particular emphasis on India, and identified taxonomic, geographic, and metadata gaps in barcode coverage.
Study Design: A database-based barcode gap analysis was conducted using publicly available DNA barcode records.
Place and Duration of Study: Data were retrieved from the BOLD Systems V4 public data portal and the NCBI GenBank Nucleotide database during 14-15 May 2026.
Methodology: Family-level taxonomic searches were conducted for ten Caelifera families. India-specific records were retrieved using geographic filters, and the India-focused dataset DS-ININD20 was additionally queried. Downloaded TSV files were analysed using Python 3.13 and the pandas library.
Results: Globally, 46,136 Caelifera specimens were retrieved, representing 1,714 species and 2,354 BINs across nine families. Acrididae accounted for 84.3% of global records. India contributed 89 specimens, representing 0.19% of global records, and 40 species across four families. Within Acrididae, 33 of approximately 285 known Indian species were represented, indicating an estimated 88.4% barcoding deficit. Six of ten queried families had no Indian records, and 89.9% of Indian Caelifera records lacked province- or state-level metadata.
Conclusion: The results indicate substantial quantitative, taxonomic, and metadata gaps in Indian Caelifera barcode representation. Targeted, taxonomically verified national barcoding efforts are needed to improve reference-library completeness and support biodiversity documentation and applied identification.
Keywords: DNA barcoding, Caelifera, Acrididae, Orthoptera, BOLD Systems, Barcode Index Number, COI-5P, India, barcode gap analysis, biodiversity informatics, metadata completeness