ORFanBase is a comprehensive database of orphan genes and taxonomically restricted genes (TRGs) for investigators to explore the properties of these exceptional coding sequences in a comparative fashion, not limited to a handful of study groups, but globally across all domains of life and even viruses.

Sequences with limited taxonomic distribution which are also functionally essential to their species, represent a pattern wholly unexplored. We hope that through ORFanBase researchers would be able to find a rigorous analytical mathematical model for essential genes and proteins, so that we can estimate their probability of changing under known evolutionary processes.

ORFanBase Input Pipeline
ORFan gene of the Month

Chromosome 17

TP53 (Tumor Protein P53) is a Protein Coding gene. Diseases associated with TP53 include Li-Fraumeni Syndrome and Osteogenic Sarcoma. Among its related pathways are PI3K/AKT activation and Cell Cycle, Mitotic. Gene Ontology (GO) annotations related to this gene include DNA-binding transcription factor activity and protein heterodimerization activity. An important paralog of this gene is TP73.
p53 gene
Top ten Genomes mined
Genomes Percentage of Orphans
Features & Highlights

The ORFan Mine is a pipeline that finds the all ORFan genes for a given Genome. The non redundant database of NCBI is mined using a proprietary algorithm to curate all the ORFans genes of the given species.


ORFanID is a web-based software engine that identifies ORFan genes from the genomes of specified species or from a given list of DNA sequences. The scope of the search for orphan genes can be defined by the selection of the taxonomy level of interest. Detectable homologous sequences are found for candidate gene in the NCBI databases. From these findings the ORFanID engine identifies and depicts orphan genes. Results may be viewed and analyzed graphically for the purpose of scientific research and inquiry