Data from: Identification and classification of ion-channels across the tree of life: Insights into understudied CALHM channels
Data files
May 09, 2026 version files 35.60 MB
-
human_ion_channels_orthologs.full_length.fasta
35.02 MB
-
human_ion_channels.domains.fasta
190.54 KB
-
human_ion_channels.full_length.fasta
381.69 KB
-
README.md
1.23 KB
Abstract
The ion channel (IC) genes encoded in the human genome play fundamental roles in cellular functions and disease and are one of the largest classes of druggable proteins. However, limited knowledge of the diverse molecular and cellular functions carried out by ICs presents a major bottleneck in developing selective chemical probes for modulating their functions in disease states. The wealth of sequence data available on ICs from diverse organisms provides a valuable source of untapped information for illuminating the unique modes of channel regulation and functional specialization. However, the extensive diversification of IC sequences and the lack of a unified resource present a challenge in effectively using existing data for IC research. Here, we perform integrative mining of available sequence, structure, and functional data on 419 human ICs across disparate sources, including extensive literature mining by leveraging advances in large language models to annotate and curate the full complement of the “channelome”. We employ a well-established orthology inference approach to identify and extend the IC orthologs across diverse organisms to above 48,000. We show that the depth of conservation and taxonomic representation of IC sequences can further be translated to functional similarities by clustering them into functionally relevant groups, which can be used for downstream functional prediction on understudied members. We demonstrate this by delineating co-conserved patterns characteristic of the understudied family of the Calcium Homeostasis Modulator (CALHM) family of ICs. Through mutational analysis of co-conserved residues altered in human diseases and electrophysiological studies, we show that evolutionary-constrained residues significantly alter gating functions. Thus, by providing new tools and resources for performing large comparative analyses on ICs, this study addresses the unique needs of the IC community and provides the groundwork for accelerating the functional characterization of dark channels for therapeutic intervention.
Dataset DOI: 10.5061/dryad.4xgxd25pc
Description of the data and file structure
This dataset includes the curated human ion channel sequences, both the full length and the pore domain sequences, along with a file with fasta sequences for all their identified orthologs. Please refer to the study: https://doi.org/10.7554/eLife.106134.1 for further details.
Files and variables
File: human_ion_channels.domains.fasta
Description: The pore domain sequences for the 343 pore-domain containing human ion channel sequences.
File: human_ion_channels.full_length.fasta
Description: The full-length sequences for the 419 human ion channel sequences.
File: human_ion_channels_orthologs.full_length.fasta
Description: Sequences for all the identified orthologs for the pore-containing human ion channel sequences.
Code/software
A custom annotation pipeline was used to mine, annotate, and curate the presented list of human ion channels. Orthologs were detected using the KinOrtho pipeline.
