Bhi07G000020 (gene) Wax gourd

NameBhi07G000020
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionTHUMP domain-containing protein 1
Locationchr7 : 1417763 .. 1428152 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAACATAATTTTTTTTGTTACCATTCTTTAATAGAAACTTTCTTGATATTTTTACAAATAAAATTTTAAAATTTGCTATTCGTCAAGCCCACCCTTTGAAAATTGGAGCAAATAGCTAAAGTTGGTCCGGACATTCCCTGAAAAATGTTTTCGGCCTGTTGGGCTTCGGACTTGAGCGTTGAGCCTGAGTTTGGACCTGCCATGGGCCAGAGCCGCTCAGTGATGAGACTGCCGGATTACCGTCGGTTATCGCGGCCGTAGGTTTTGTCATCGGAAGATCATCCGCCAAATCAAAACTACACACAGCTAAATGGCAGAGAAAGAAGAACAGATCAATGGCTGCCACCTCAAACCTGTAGCAGAAGCCTCAAAACCGACAGAGAATGTCCCCGAAACCGAATGGAAAACGATGACGCCATGGGAGCAGCACTCTGCTGTCATAAGCATCCCTCGGTTCGATTACAATGCACCGTCTGCGCTTCTTCAACGTTGCCAGTCTGGATTCCTCATTACATGCAGTATCAGTGAGAATCTCTTCTACAATTGTTTGCAGTCTATTTTTTAGTACCGTATTTTTCTTATTCGGTTCTTTCACTGTGCTTTGCAATCGATTGGTCGCTATTTTGATTACCGTCCAAAGAAAAAAGATGCTCGTAAGTTGGTATTTTTGGTTACCGTAGTGCGAATTAACTTCAACTTTCAGGCTTTGTTTACATGGTCCTTTTTATGGGAAAGATTTTAAGGCCCTGAACACTTGCTCTTCCAATAGTGTTCTTTCATTTGGGCTGTGAACTTTCCACTTCATTGGGATGGTTGCCTGGCTAATTTTGTAATCTTTAGAATCTGCCCTTTTGTGATACGGGCCGAATTTTATGGCAGTCTGATCTTTTTTTTAGTAATATATGGTTTCTGATTGGAGAGGAATAATAGAGATTTTAAATATCTTGGTAGTTCTTCGGAGTATGCCCTTTCATTTTCCATTTTTATTTTATTTTTGTATTTTGGTGTGTTTTTCCTCTCCCCTCCTCTTTTAGATGTTCATATCTGAATGATGCCTTTCAAGTCAATTGAGTTAGCATGGAATTTCTTACTTCAAAATAATAAAGAGTGAGAAAATGGACTATAATTCTTTCCTTTGTTTTGTCAGAGAGGGAGAAGAGTGCCACAAAAGAAGCTATCTCCATCCTTGAAAAGGTTATGTTTTTGTACTTTGATTGAATTTGCTACACGTCATGCATTTTAGGCTTCATGGCTTTATGTCTTTATCTAATGAGACTTTCTGAAGTTTAATTGGTGAACTACCGAGTTGAATCTAGGGTGTTTGGAGTACTAATTCTGGCCATTTTTATTTGATCTGGATCCGCTATAATTGCCCTGGAAATGGCCTGAGACTTTATACTTGAATTTAGCAGTCTTAGCACGAGAAACAATTGCTGAAAATTTGATTAAAATTGGGTTTATATATTTTCTATTGCTTGCTTGAGAGCAGTTACAATAGTTATATAAATCATTAGATCTAACACAACCATGAAAGGACTTAGAAATCTAATTAGTTTTTAGAATCCTAGTCACTCTAATATCCTGAAATAATAATAATAATTATTACATAAAATTTTATGACTGAATTGCTTTTGATGTTCAACAAACCTACATATTGGGAACCTGCCGCATGGTCATTATCAAAAAGAGGAGATTTCCTCAACCACTTCTTGGGACAAGCCCATCCCACTTTGTTCAAAGTCCGAAAATCTCTTCTACCGAACTATACTTGCTAAAGGAAAGGCTGTCATAATCAACCTCTTCATCGAAGTCAATGGACGAGGCAACATGTTTCATGAAATTAAGGGTTGACATGGAACAAATCTGACCTTTTCTTAAGATAGTATTATGGTGAGGCCCCCAATCACGATTCAGTATACTTGGAGGGGCAAGAAGGGAAATTTGCCTACAGCAAGTTATTTAGGTGAGGGAGGAAATCACTATAGAATGTTGGGCTGAACGTTGGGTAGTATATATGTAGGCGTAGTACTGTTGATCGTCTTTATGCTTTTTGTTAGCTAAGAAGTTGAATGGGAGAGAGTTTAGCCCTCCTTAATCAATTGGGTATTGTTTTGTATTGTCTTGGGGCACCATCTTCTTCTATGTATCAATATAGTGAGGAGATACCTTACATATTGATATTAGAGTATGTTCATCGCACAACTTGAAAAAATGGCCTTACAACATTTAGATTAACGAATTGATTTGTTTGACCAGGAGATGATGAGCATGTAAGGCCCCAAGTGATAAAAACATATGTGCAAAGGGCAAAAAGGGAAAAGAGGTAGAAAATATTGATATCAGAGTATGTTCATCACACAACTTGAAAAAATGGCCTTACAACATTTAGATTAACGAATGGATTTGTTGGACCAGGAGATGATGAGCATGTAAGGCCCCAAGTGATAAAAACCTATGTGCGAAGCGCAAAAAGGGAAAAGAGGTACAAAATCAGTTAGGGGATCTTTCTGATATTCTCTACCAGCTCGTTCATAGCCTGGTCCAATTTGGGCAACCTCTGGACCTCTTCACGTACCACATCAATGTACGTTCTAACCCTTCAATGCATTTTTTCGAGTCGAATTGTCCCATTTTCTTAGATTTTCCCAGAATACCACTCTGATACTAATGTAAGGACTCCTCCAAGAGAGTCACAATATATTGATAAGATCACAAAATACAAAGTTCACACCAGCTTGAGAGGATGGAGCTTTCTCGTTATGAGATTATCACCAGCTAAATACGAAACATAACATAACCCCAAAAGACATCACTAACCAAATATAAACAACTACTAACAACAGTACAAGCCACTAAGGGGGTGAATTACCACTTCTACCCCTCCTAACATAAGTATTAATGATTGGGGGCCTAACATACAGCTATATCAAATCAGGTAGATTTGGATGGAGAGTAGTCTTATCAGGTCGGGTAGTTTGATAGGAAGAGCATATCTTTGTTTGTGAGAGGAGATTAATGGTATGATAGCACTAGATACCAAGCATCCCTGAATGTGACAGGGGTAATAAGCAAATCGCCTAAAATGTCCACTACCCATTGTTAAAGACTCTTTTCTGAGACCATTTCAAACAATTGAGAATGGTCTTTTTCTGTTATTACCAAGAATCTTCTATTCCTCTCTTTCCTAAAGCCGGTTCCCAACACTTTCTTCCTCATGTTACACTTAAAGGTTGTTTTTGATGTCATGATGTCATGTTTAACATGACTGAGGCACATCATTATTGTGAAAGAGTTGAGGGGTTGATTTATGAGAGGAGAAGCGTGGCAGGGGAGAGGGGGAAAGGCAAGGAGGGTTGAAAGGAGAGTGAAGAATGTGCCATCCTTTTCCGTGTTTTGTTTGCTTCACGATGTGTTTTGTTTTTGTCATGAGTATTTTGTGAGAGGAGAAGCGTGTTGAAAGGAGAGTGAAGAATGCGCCATCCTTTTCCGTGTTTTGTTTGCTTCACGATGTGTTTTGTTTTTGTCATGAGTATTTTGTGAGAGGAGAAGCGTGTTGAAAGGAGAGTGAAGAATGCGCCATCCTTTTCCGTGTTTTGTTTGCTTCACGATGTGTTTTGTTTTTGTCATGAGTATTTTAGGAAGCCTTATCTCTTATTCATTGAGTTTTATGTTGGAATGCATGTTCTGTCTGTCGTTCAAAATACATTAAAAGATAATTGGTTTGAAACTTTTATGTCAAGACTGATGATAAGCTCTTTATTAGCCATAACATTGTTTTTTATTTTTTATTTTTTTAAGAACCAAGAGCATTATTTTCTAGAATGATGATGATTGTTATTATTTTTATTTTAAATCATCATCTACTGCTTCTTTTCTATGAAACTTTTGAATACAGAAACCCTATTGAGGAAGCCGAATTGGAACTTTAATGTCAAGACTGATGACTACTGGATTTTAGGCGCTCTCTCTCCGAAATTCACGATCCAAAAAAGAACAAGTTTTATCATTTAACTGTAACAACGTATCAACACATTAATATTCAAGTATAAGAGTCTTGTACTTGGGTTTTTGTTGTACAGTTCGGATATTTCATTTATCAAGAAAATTGTTTCTTATAAAAAAAAAAAAAAAAAAAAAATCAGTTATAAGAGAAGCTTTGCTCACTCTCATTTTCTTCTGCCCAACAATATTATGTACTTAATTTTTGTTGTGTTTCTTGCAGTATATTCAGTACTTCAGTAGCTCTATGCCAGAAACTTTGACGGTATCTGATGAAAATAAAACTTCTAAAAGGAGGAAAGTTTGTACAGGGGACGTTGATCCCAGAAGTGATGAAGGAGTGGAAAGGAGTACTGGTAAGAATTAATCTGCTGAAGTGATAGTAATAATAACTTTCCGTGGCCAATATACTTTGAAGTTGCAGTTTTTGGTTGTTGTACCTCCACTAATTTCTATTTTGTATTGTTGCATTTGACCTTGATGACATCAGATGAACATGCTGGAACTTCTTTGATTTCTACGAAGAGTGAGGCAAAAGTAGAGAAATGTTCTCCTATTTCACTAGTGAAGTTGACGCGGAGTGGCTTGCTTTTGTTTACTTTTATCAAGAATATCTCTCCTGATACTGTTTATATTGTCAAAGACATAATTCAGTGTCTGGAAGCAGGGACTTTGAAGTCACCCGCGTAAGTAGTTTTCTCATTCATTACCATTTCAATTAAGAATAGGTTGAATAAGCTTACTATTTTTTGCACGTGTTTGACATTTTTTTTTTGCTTGCATTTGCTAGAGAAGTATATAGTCATTAAGGCTTTCATTCGGTACTTCATACTTTTCCAAGTATTTCTTGGAGCAAATAACTTTGGAAATTTCTTGTTTTAATGGTTGTATTATTTTTATCACTACTATTTGCTGATTGTTTTTTCTTAAGAGAGGATAAACATTTTCATTGATGAAATTAAATAAAGAGGTAAAAATTCCAAACACCTAAAGGTGGATTACATGAAAGTTTGCCAGTCGGCCATTAAAGTAGAGAGACTAATGAGTAAAATGGTGTTTACATTTACACCAATACAAAGCATAAAAAAGAATAAAATCTATAATATAATCAAAGGAAGCCACCATTTTTTTACCCATTTGAGTCCAAGCACACTGTTAGGAATCAAAGTAGTATGAGATGTCTTTGTCCCACATTGGTTAGAATGGGATGACCAATGTGGTACTTAAGTGGCTTGGTTCTCCCACCCCAACAGCTAGCTTTTGGGATGTGGTTCTCCAAGGTGCTTAAGTACCTAATAATGGTATCAGAGCCAGTTGTCGACGTTCCGGAGTGGAACCGACGGAGGGTTCTGACCGGGTGTGGCCGAGTGAAGTACCATCGGAGTAGCCGACGAGACGTCGGCTTTCTGGGGATGGGGTATTGTTAGGAACAAGGTAGTATGGGATGTCTTTGTCCCACATTGGTTAGAATGGGATGACCAATGTGGTACTTAAGTGGCTTGTGGTGGTACTTAAGTGGCTTGGCTCTCCCACCCCAATAGCTAGCTTTTGGGATGTGGTTCTCCAAGGTGTGCTTAAGTACCTAACATACACTTCGAGCACCTAGTCTATTGTCCAGTGTTGTGTTGCCTGGTGTCTTGGGCATGCTTGTCCAGGGTGTGTTTGTCTATGGCCGCATGTCCAAATCATCCTTAACCACCTTATATTCATGTCTTGTAAATAGTTTAGCAAGTTGTTTTCATTTCATGTTATTTTCGTTGTACGTGACTCCTCGACTTTTTGATGTGCTAGCTTGCCGGTAGTGAAATTGTGCAAAATGTTCTATAAGGGAACCCATTGGTCAACCCTCGACAAGTATTGTTGTACTTTTTGCAATCAAAGATCTCTTGCAATTTACGGTCTACAATGCTTTCTTTGATGATGTCTTCAATGCTTTCTCTCACGTTTTATAAAGGCCATTTTCTCTCGAAATGTGAGTAGAGGCTAGACCATACAATGAAGTTGTTAGCAACTTTCGATTTTTACATGACTGACTTTCTCATCGGATTAGAGCAATCCCCCTTCCTGAGTTATAAAGTTTGCGACTATGACAAGAGGTATGAGCTTAGCTAGCTCTTTGGCTGGAAAAAGCAATTATCTAAGCCGCAGGCTGAATGACTTACTGCGGTTGAATATCTATGGCTATATCTGAGGGAAGTGCCGACGAAGTGAGATATATTCAAACTCGACTTGAATAACTCGGCTCTTTGGCTGGAAAAGACTGCAATCTTTTTGACAAAAAGCAATTATCTAAGCTGCAGGCTGAATTACTTACTGAGATTGAATATCTATGGCTATATCTGAGGGAAGTGCCGACGAAGTGAGATATATTCAAACTCGACTTGATGGGTTAAAAGAAAAAGTCGGGGAAATTGATGTGTTCAATGCCCGAATGGATGGCCTACCAATAAAAGAATTGACGTTGAGGATCGATTCTTTATAACACAAAGCTAAACGTCCCTGCTAGTTTTGAATGTGGTTATGGTTCCATGAGCTTGTCTCACTATGAGAGCGCTAGTTAGAGATATAGTAGGATTATTAAATAGAATATTGATATGGTTATTTGAAGGAGCATATTAGTAAATAACTAGTAAATTTGTTAGAACTTTTAGTTATAAACAGAGGAAGTGGATTGGGGTAAGATGTGAAGAATGTTTTGGGATTTCCTTTGTTGGGAATTTGGAGAGACTAGCCCTCTCGAAAATCAATGGTATATTGTAGTTTCTTCGTTGATATTGCAATATATATTTTTATCTTTAGTGTTCCTTGTATTTCTTGATTGTTCTTGTTAAGAGGTATCCTAACACTCACACATGGAAGAACGTGTCGAAGATCTAAACAATTTTCAAAGGATCATGTTGAAGTTGTTCAATGATCCGACTGAAGATTTTAGAGCAACGATAAAAGCCATCAAAGATAAGATGATCGATATGAAGACTTAGGTTAATTTGATGATGCACAGAGTGAGAAATCAAACTCCGAATTAGGCATACATCATGCCCAACAAGTTCAAGATCCCGAAGCGCTAAGCTTTCAATAGGAACTATTATGCTAAAGAACTTGAGAACTTCATCTTTGGCATGGAACAATACTTCAAAGCGAGTGGAACTGATTCAGAAGAAACAAGGCTACTTTGGCCTCCATTCATCCCTTTGACGATGCGAACCTATGGTGGAGGTCCAAAGTTAACAACATTCACAATGGTTTATGTACCATCGATACTTGGAGAGATCTTAGGAAAGAACTAAGGGCTTAGTTCTTCCTCGAGAATGTTAAACTCATTGCAAGAAGAAAGTTATAGGAACTCAAACATATAGGAACCATCAAGGACTATGTGAAACAGTCCCAGCTATTATGCTGGACATTCAGGACATATTTGAGAATTTACAATGGTGAACAAGCTCCCCAAAAGAAAATTCAAAAGTGTGGGACTTGATTAGTTCCTCTCAAGAGAGATGAAGATTGATGGCCAAGTGAACTAGTTCCTACCATTAAGGAGCTTGACTTGGACATCTCCTAATTTTAGTAAGTTGACTGGAGTGGGCCTAAAGAATAGGATCATAGTTTCATCTTCTTACCAAGTCAACTTGCTAAAGCTTCCCAACCAGTCCCTCTTTGTAAGAACGTGGTTGGCACATATATAGTGTATAAAATGGGGATTTTCCCTTTGGCCGTATGGTGTGCTTGTGTGCTTATGTTCTAAGGGGCTTTCCTTTGTATCAAGTCTTGAGGATCCATGGGACGAACTAGGCTAAACTAAGAAAACAATTTTTGTGTCCTCTCTACGAAAGGGAGTCTTCTCTACTAAGACAAGGTGAGGGATTTGAGAAATCCCAACACACTTCTCCATCTCTCATCTGAGTCAACATGGAGTTGAGGGATTCCATGAGGGAGAGTCTTCTGTGTGAAAAAGAGTTTAGTGTCACGCCTCATGAAAATGGTCCTTCAAATTGTGCCACAACATGACCCAAGACTCCTTTAGCAAGGCCAAGGAGGAGTCAAGATCCATCAGATGGCATCCTTGAGTCGCCATGCATGGCTCACCCGTGTGCCCGTGTTCCAGGCGCACGCCCATCATAGACACCCAAGACGCCCACATGCCCAGGCATAGGCATGCACAGGCATGCCAAGACACGATATGCCCACATAGGCGCCCCATTGAGCAGCCATGTGTGCAAGGATGCACACAACCATACGTGCAGATGTGCGCAAGGCCGGACACACAACCGTGCGTGCGGCCAAGCGAGTGGCCTTGCCCGATGTGTGCGGCTATGCGCGTGCACGCAGCGAAGGCACACGGCCAAGCACACGCCCTCTGCCCTAGTCGACCGCACCGCCCAACGCCCACACCCACGCCTGCCAACATGCTAGAGCTTTGGAAGATTCTAGAACCTCCTAGACAGTCCCAGCGCCCATGTCTATGTCCGATAAGCAACTGGAAAGCTCGAGGTGCTGCTAGACGTGTCTGGAAGAGCCCAGACAACACCAAAATGTGCCACACTGCTCACCCATGGGCTAGACAGTGCTAGAAGGCCCCAGAGTGCTCGAGAAGGCCTCAGACTAGTATGGAAGCCTCTAGAACGTGCTAGACAAGTCCAAACCCTACAATATCTATCAAGTACTAGAATGTACTAGAAATGTCTAGGGTGGTCAAGATAGGTCTAGAATGTTCTATATAAGGCTTGAATGTACTAAGATACATTGAGAAGGCTCTAGAGAGAGCCATAACCGACCCTCTAAGGGCGGTTTAGAAGCCCCTAAGTCAGTAGGGCTTGTCCATAAATAGCCCAAGTGGGACTCATACTATTTCATGTGATAGTTAGTAGGGCTTGTCCATAGTATGAAAAGATCCATGTACCAAGGTGCCATTATGGACTCGATTTTATGAAATTAGTAGTGTATTTTGAACATAGAGCAACTATGAAATTAGAGTCGAAGAAACCCTATTTCATGAAATTACTTACATTCATTTGTCTGCACGGACAAAATCTCTGATTTTGTTCATTCCTCTCTCCAAAGAAACCAAAGAAACCAAAGAAAGGTTAAGTTGAGAGTAAGCCATGATAACTTCTTTGCACTGTCGAAGGGATAGACCCATAAAAACAGTAGCTTCAAGACAAAATGAAAATGTACAACAATTAGGTAACTGGTTATTTTCTTTGTGCAGTTGGTGCCATCGCATATTCCCCATCCAAGCTACTTGCTGCTTGAATGAAAACGATCTCCACGCGGTTGTGTCAAAGCTTGTTCTTCATTTCATGAAGGATAAAGGAAACATTCTTTCACAGCCTGTAAAGGTAAAGCTTCGTCATTTTCATTAGTGAGTGAAGGTGTTTTCCTGGTTTTTTTCTATCCTTTTCATTAGGCAGTGAAGCTTGGATAATTCATGAGAGATAGATGGCAAGCTCCAGGCCCTTATTTTTCTTTCTTTTCATCGAATTTTTGACATTTATTTACATCTTTATGTAGTTTGCAGTAGGGTACAACAGAAGAGGAATTGAAGAGACTGAGATGAAGAAGACTTCCAAAGATAGTTCTGGTGCTAATGTTATGCTGGGGCGCGATAAATGCTTTAGCATCGTGGCTGCTGCCGTGAAAGATGTGGTCTCGAATGCCATTGTAGATTTGAAATCTCCAGAGGTGAATGGATTACTGTTAATGGCACCTTGGTACATGGATCTTTTCATACTATTTCATGTGATAGTTTTGTTTTGTGCTTATTGATTATCTTACAGCTCTGCATCCTTGTTGAGTTGCTTCCTGTTTCTGGGTTGCCTCTTGAATCATTGGTGGTGGGGGTGTCGGTTCTTCCAAGCAATCTTGTTACTACGAAGCCTCGACTTTGCATCAAAGCTTTGACTTCTGATACCAAGGCAAAGAGTTGAAGGCATAAATCAAAATTTTCAATGGACCAAATATTAGACTGTTAATCTCACGAATGGATAAGGTTGGAAAGCTTCAAGAATTATGGAACAAATGCTAGAGTGAAGATTTTGAAGAAAGAGGACGATTTCCCTCAATCAACTCAATGCAGTCAATATTATTGACAGTTGCACAATCTATCGAACAGATCTTGTTCAAGTAAGGTCGTCAGTAAACATTATTACTTTTAATGGAGATTTGTAATTTATCTTGTTTTGTATATAACTTCAACGGACCATTTCGAGTTGCACAATGCTAGTCCCTAGGATAGATGTTCGACACCTGACAAAAATATTACTCGAGTTTTGCACTTACTTATTGGGGTAACAAATACTTTGGCATAAATTTTAGGGGTTGTATCAATTTAGATCATAAACTTATATAAGTGAATTGATTTAGATTATTTGTTAATG

mRNA sequence

CCAACATAATTTTTTTTGTTACCATTCTTTAATAGAAACTTTCTTGATATTTTTACAAATAAAATTTTAAAATTTGCTATTCGTCAAGCCCACCCTTTGAAAATTGGAGCAAATAGCTAAAGTTGGTCCGGACATTCCCTGAAAAATGTTTTCGGCCTGTTGGGCTTCGGACTTGAGCGTTGAGCCTGAGTTTGGACCTGCCATGGGCCAGAGCCGCTCAGTGATGAGACTGCCGGATTACCGTCGGTTATCGCGGCCGTAGGTTTTGTCATCGGAAGATCATCCGCCAAATCAAAACTACACACAGCTAAATGGCAGAGAAAGAAGAACAGATCAATGGCTGCCACCTCAAACCTGTAGCAGAAGCCTCAAAACCGACAGAGAATGTCCCCGAAACCGAATGGAAAACGATGACGCCATGGGAGCAGCACTCTGCTGTCATAAGCATCCCTCGGTTCGATTACAATGCACCGTCTGCGCTTCTTCAACGTTGCCAGTCTGGATTCCTCATTACATGCAGTATCAAGAGGGAGAAGAGTGCCACAAAAGAAGCTATCTCCATCCTTGAAAAGTATATTCAGTACTTCAGTAGCTCTATGCCAGAAACTTTGACGGTATCTGATGAAAATAAAACTTCTAAAAGGAGGAAAGTTTGTACAGGGGACGTTGATCCCAGAAGTGATGAAGGAGTGGAAAGGAGTACTGATGAACATGCTGGAACTTCTTTGATTTCTACGAAGAGTGAGGCAAAAGTAGAGAAATGTTCTCCTATTTCACTAGTGAAGTTGACGCGGAGTGGCTTGCTTTTGTTTACTTTTATCAAGAATATCTCTCCTGATACTGTTTATATTGTCAAAGACATAATTCAGTGTCTGGAAGCAGGGACTTTGAAGTCACCCGCTTGGTGCCATCGCATATTCCCCATCCAAGCTACTTGCTGCTTGAATGAAAACGATCTCCACGCGGTTGTGTCAAAGCTTGTTCTTCATTTCATGAAGGATAAAGGAAACATTCTTTCACAGCCTGTAAAGTTTGCAGTAGGGTACAACAGAAGAGGAATTGAAGAGACTGAGATGAAGAAGACTTCCAAAGATAGTTCTGGTGCTAATGTTATGCTGGGGCGCGATAAATGCTTTAGCATCGTGGCTGCTGCCGTGAAAGATGTGGTCTCGAATGCCATTGTAGATTTGAAATCTCCAGAGCTCTGCATCCTTGTTGAGTTGCTTCCTGTTTCTGGGTTGCCTCTTGAATCATTGGTGGTGGGGGTGTCGGTTCTTCCAAGCAATCTTGTTACTACGAAGCCTCGACTTTGCATCAAAGCTTTGACTTCTGATACCAAGGCAAAGAGTTGAAGGCATAAATCAAAATTTTCAATGGACCAAATATTAGACTGTTAATCTCACGAATGGATAAGGTTGGAAAGCTTCAAGAATTATGGAACAAATGCTAGAGTGAAGATTTTGAAGAAAGAGGACGATTTCCCTCAATCAACTCAATGCAGTCAATATTATTGACAGTTGCACAATCTATCGAACAGATCTTGTTCAAGTAAGGTCGTCAGTAAACATTATTACTTTTAATGGAGATTTGTAATTTATCTTGTTTTGTATATAACTTCAACGGACCATTTCGAGTTGCACAATGCTAGTCCCTAGGATAGATGTTCGACACCTGACAAAAATATTACTCGAGTTTTGCACTTACTTATTGGGGTAACAAATACTTTGGCATAAATTTTAGGGGTTGTATCAATTTAGATCATAAACTTATATAAGTGAATTGATTTAGATTATTTGTTAATG

Coding sequence (CDS)

ATGGCAGAGAAAGAAGAACAGATCAATGGCTGCCACCTCAAACCTGTAGCAGAAGCCTCAAAACCGACAGAGAATGTCCCCGAAACCGAATGGAAAACGATGACGCCATGGGAGCAGCACTCTGCTGTCATAAGCATCCCTCGGTTCGATTACAATGCACCGTCTGCGCTTCTTCAACGTTGCCAGTCTGGATTCCTCATTACATGCAGTATCAAGAGGGAGAAGAGTGCCACAAAAGAAGCTATCTCCATCCTTGAAAAGTATATTCAGTACTTCAGTAGCTCTATGCCAGAAACTTTGACGGTATCTGATGAAAATAAAACTTCTAAAAGGAGGAAAGTTTGTACAGGGGACGTTGATCCCAGAAGTGATGAAGGAGTGGAAAGGAGTACTGATGAACATGCTGGAACTTCTTTGATTTCTACGAAGAGTGAGGCAAAAGTAGAGAAATGTTCTCCTATTTCACTAGTGAAGTTGACGCGGAGTGGCTTGCTTTTGTTTACTTTTATCAAGAATATCTCTCCTGATACTGTTTATATTGTCAAAGACATAATTCAGTGTCTGGAAGCAGGGACTTTGAAGTCACCCGCTTGGTGCCATCGCATATTCCCCATCCAAGCTACTTGCTGCTTGAATGAAAACGATCTCCACGCGGTTGTGTCAAAGCTTGTTCTTCATTTCATGAAGGATAAAGGAAACATTCTTTCACAGCCTGTAAAGTTTGCAGTAGGGTACAACAGAAGAGGAATTGAAGAGACTGAGATGAAGAAGACTTCCAAAGATAGTTCTGGTGCTAATGTTATGCTGGGGCGCGATAAATGCTTTAGCATCGTGGCTGCTGCCGTGAAAGATGTGGTCTCGAATGCCATTGTAGATTTGAAATCTCCAGAGCTCTGCATCCTTGTTGAGTTGCTTCCTGTTTCTGGGTTGCCTCTTGAATCATTGGTGGTGGGGGTGTCGGTTCTTCCAAGCAATCTTGTTACTACGAAGCCTCGACTTTGCATCAAAGCTTTGACTTCTGATACCAAGGCAAAGAGTTGA

Protein sequence

MAEKEEQINGCHLKPVAEASKPTENVPETEWKTMTPWEQHSAVISIPRFDYNAPSALLQRCQSGFLITCSIKREKSATKEAISILEKYIQYFSSSMPETLTVSDENKTSKRRKVCTGDVDPRSDEGVERSTDEHAGTSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVYIVKDIIQCLEAGTLKSPAWCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPVKFAVGYNRRGIEETEMKKTSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELCILVELLPVSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDTKAKS
BLAST of Bhi07G000020 vs. TAIR10
Match: AT1G09290.1 (unknown protein)

HSP 1 Score: 304.7 bits (779), Expect = 7.4e-83
Identity = 173/322 (53.73%), Postives = 214/322 (66.46%), Query Frame = 0

Query: 30  EWKTMTPWEQHSAVISIPRFDYNAPSALLQRCQSGFLITCSIKREKSATKEAISILEKYI 89
           E +T+TPWEQHS++ISIPRFDY APS+LL    SGFL+TC+IKREKSATKE +SIL KYI
Sbjct: 28  EAETLTPWEQHSSIISIPRFDYKAPSSLLHHSHSGFLVTCNIKREKSATKEVMSILGKYI 87

Query: 90  QYFSSSMPETLTVSDENKTSKRRKVCTGDVDPRSDEGVERSTD---EHAGTSLISTKSEA 149
                  PE L     +  SK++KVC  + +   ++ V    D   E      +     A
Sbjct: 88  GSMHEEKPEVL----NSTASKKQKVCAQETEEGGEKTVPLENDALQETGENPNVEDLKLA 147

Query: 150 KVEKCSPISLVKLTRSGLLLFTFIKNISPDTVYIVKDIIQCLEAGTLKSPAWCHRIFPIQ 209
             E  S +SLVKLT+SGLLLFTF    SP+T  IV  + Q +E+G LK+P WCHRIFP+Q
Sbjct: 148 NEEHNSLMSLVKLTKSGLLLFTFPVENSPNTTNIVSRVFQSMESGALKAPIWCHRIFPVQ 207

Query: 210 ATCCLNENDLHAVVSKLVLHFMKDKGNILSQPVKFAVGYNRRGIEETEMKKTSKDSSGAN 269
           ATC L E +L   VSKLV  F+ DK N LS+PVKFA GY RRG EET+  K  KD+S   
Sbjct: 208 ATCGLTEKELRETVSKLVQRFVNDKDNTLSKPVKFAAGYQRRGAEETK-GKIRKDASDVL 267

Query: 270 V---MLGRDKCFSIVAAAVKDVVSNAIVDLKSPELCILVELLPVSGLPLESLVVGVSVLP 329
           V   +L R KCF  VAA VKD+V +++VDLKSPELC+LVELLP+S +   S V  VSVLP
Sbjct: 268 VQCPLLDRIKCFETVAAGVKDIVPDSVVDLKSPELCVLVELLPLSRISSGSFVAAVSVLP 327

Query: 330 SNLVTTKPRLCIKALTSDTKAK 346
             LV+TKP+LCIK L  ++K K
Sbjct: 328 HRLVSTKPKLCIKPLVPESKHK 344

BLAST of Bhi07G000020 vs. TrEMBL
Match: tr|A0A0A0KQX3|A0A0A0KQX3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G273440 PE=4 SV=1)

HSP 1 Score: 543.9 bits (1400), Expect = 2.6e-151
Identity = 295/347 (85.01%), Postives = 310/347 (89.34%), Query Frame = 0

Query: 1   MAEKEEQINGCHLKPVAEASKPTENVPETEWKTMTPWEQHSAVISIPRFDYNAPSALLQR 60
           MAE EEQ NGCHLKP AEA K  ENV ETE K MTPWEQHSAVISIPRFDYNAPSALL R
Sbjct: 1   MAETEEQNNGCHLKPEAEAFKRAENVAETERKMMTPWEQHSAVISIPRFDYNAPSALLHR 60

Query: 61  CQSGFLITCSIKREKSATKEAISILEKYIQYFSSSMPETLTVSDENKTSKRRKVCTGDVD 120
           CQ+GFLITC+IKREKSATKEAISIL+KY+QYF+SSM ETL VSDEN+TSKRRKV + DVD
Sbjct: 61  CQTGFLITCTIKREKSATKEAISILQKYVQYFNSSMSETLVVSDENETSKRRKV-SEDVD 120

Query: 121 PRSDEGVERSTDEHA-GTSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVY 180
            RS  G E STDEHA  TSLISTKSEAKVEKCSPISLVKLTRSGLLLFTF K+ISPDTVY
Sbjct: 121 HRS-VGGESSTDEHAKETSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFTKDISPDTVY 180

Query: 181 IVKDIIQCLEAGTLKSPAWCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPV 240
           IVKDI+Q LEA TLKS AWCHRIFPIQATC LNENDL  VVSKLVLHFM DKGNILS PV
Sbjct: 181 IVKDIMQSLEARTLKSLAWCHRIFPIQATCSLNENDLQGVVSKLVLHFMNDKGNILSHPV 240

Query: 241 KFAVGYNRRGIEETEMKKTSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELC 300
           KFA+GYNRRGIEETEMKKT +DSSG NV+LGRDKCFSIVAAAVK VVS+AIVDLKSPELC
Sbjct: 241 KFAIGYNRRGIEETEMKKTFEDSSGVNVILGRDKCFSIVAAAVKGVVSDAIVDLKSPELC 300

Query: 301 ILVELLPVSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDTKAKS 347
           +LVELLPVSGLP  S VVGVSVL +NLVTTKPRLCIKALTSDTKAKS
Sbjct: 301 VLVELLPVSGLPSGSSVVGVSVLSNNLVTTKPRLCIKALTSDTKAKS 345

BLAST of Bhi07G000020 vs. TrEMBL
Match: tr|A0A1S3CD85|A0A1S3CD85_CUCME (uncharacterized protein LOC103499095 OS=Cucumis melo OX=3656 GN=LOC103499095 PE=4 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 5.1e-139
Identity = 277/335 (82.69%), Postives = 293/335 (87.46%), Query Frame = 0

Query: 13  LKPVAEASKPTENVPETEWKTMTPWEQHSAVISIPRFDYNAPSALLQRCQSGFLITCSIK 72
           +KP AEASK  ENV ETE KTMTPWEQHSAVIS+PRFDYNAPSALL RCQSGFLITC+IK
Sbjct: 4   VKPEAEASKRAENVAETERKTMTPWEQHSAVISLPRFDYNAPSALLHRCQSGFLITCTIK 63

Query: 73  REKSATKEAISILEKYIQYFSSSMPETLTVSDENKTSKRRKVCTGDVDPRSDEGVERSTD 132
           REKSATKEAI ILEKY+QYFSSSM ETL +SDEN+TSKRRKV + D+D  S  G ER+TD
Sbjct: 64  REKSATKEAIFILEKYVQYFSSSMTETLVISDENETSKRRKV-SEDIDHIS-VGGERNTD 123

Query: 133 EHA-GTSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVYIVKDIIQCLEAG 192
           EHA  TSLISTKSEAKVEKCSPISLVKLTRSGLLLFTF K+ISPDTVYIVKDI+Q LEA 
Sbjct: 124 EHAKETSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFTKDISPDTVYIVKDIMQSLEAR 183

Query: 193 TLKSPAWCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPVKFAVGYNRRGIE 252
           TLKS AWCHRIFPIQATC LNENDL  VVSKLVLHFMKDKGNILS PVKFAVGYNRRG+E
Sbjct: 184 TLKSLAWCHRIFPIQATCSLNENDLQGVVSKLVLHFMKDKGNILSHPVKFAVGYNRRGME 243

Query: 253 ETEMKKTSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELCILVELLPVSGLP 312
                    DSSGANV+LGRDKCFSIVAAAVK VVS+ IVDLKSPELC+LVELLPVSGLP
Sbjct: 244 ---------DSSGANVILGRDKCFSIVAAAVKGVVSDVIVDLKSPELCVLVELLPVSGLP 303

Query: 313 LESLVVGVSVLPSNLVTTKPRLCIKALTSDTKAKS 347
             S VVGVSVL +NLVTTKPRLCIKALTSD KAKS
Sbjct: 304 PGSSVVGVSVLSNNLVTTKPRLCIKALTSDAKAKS 327

BLAST of Bhi07G000020 vs. TrEMBL
Match: tr|A0A2R6P3P4|A0A2R6P3P4_ACTCH (Checkpoint protein like OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc32858 PE=4 SV=1)

HSP 1 Score: 374.0 bits (959), Expect = 3.6e-100
Identity = 205/331 (61.93%), Postives = 246/331 (74.32%), Query Frame = 0

Query: 28  ETEWKTMTPWEQHSAVISIPRFDYNAPSALLQRCQSGFLITCSIKREKSATKEAISILEK 87
           E E   M PWEQHS VISIPRFDYNAP++LL    SGFL+TC IKREKSATKEAISILEK
Sbjct: 11  EGEKLGMKPWEQHSGVISIPRFDYNAPASLLHHSHSGFLVTCPIKREKSATKEAISILEK 70

Query: 88  YIQYFSSSMPETLTVSDENKTSKRRKVCTGDVDPRSDEGVE-----RSTDEHAG-----T 147
           ++  F+    E+L  SD N  +KRRK+C G+VD      +E      S+ + +G     +
Sbjct: 71  FVGLFNIGSSESLKSSDANVVAKRRKICMGEVDGECPTSIESKDAAASSVDSSGKLLEDS 130

Query: 148 SLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVYIVKDIIQCLEAGTLKSPA 207
            + STKS   VE+    SLVKLTRSGLLLFT ++N SPD V +V  I   LE+G+L SP 
Sbjct: 131 CMSSTKSNTNVERSPTFSLVKLTRSGLLLFTALENDSPDLVDVVSKIFCSLESGSLSSPL 190

Query: 208 WCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPVKFAVGYNRRGIEETEMK- 267
           WCHRIFPIQATCCLNE +LHA VSKLV+ F+ DK N L+QP+KFAVGYNRRG+EETEMK 
Sbjct: 191 WCHRIFPIQATCCLNEKELHATVSKLVIQFVNDKRNKLAQPIKFAVGYNRRGMEETEMKI 250

Query: 268 --KTSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELCILVELLPVSGLPLES 327
              +SKDS     +L R+KCFSIVAAA+KDVVS+++VDLKSPEL +LVE+LP+SGLP  S
Sbjct: 251 SRNSSKDSKPL-ALLDRNKCFSIVAAALKDVVSDSVVDLKSPELSVLVEMLPLSGLPNGS 310

Query: 328 LVVGVSVLPSNLVTTKPRLCIKALTSDTKAK 346
           LVV VSVLP NL+TTKPRLCI+ L SDTKA+
Sbjct: 311 LVVAVSVLPCNLITTKPRLCIRPLVSDTKAR 340

BLAST of Bhi07G000020 vs. TrEMBL
Match: tr|A0A1Q3CS49|A0A1Q3CS49_CEPFO (Uncharacterized protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_26456 PE=4 SV=1)

HSP 1 Score: 363.2 bits (931), Expect = 6.4e-97
Identity = 202/328 (61.59%), Postives = 250/328 (76.22%), Query Frame = 0

Query: 28  ETEWKTMTPWEQHSAVISIPRFDYNAPSALLQRCQSGFLITCSIKREKSATKEAISILEK 87
           E E K M  WEQHS VISIPRFDYNAPS+LLQ   SGFLITC+IKREKSATKEAISIL+K
Sbjct: 9   EEESKEMKAWEQHSRVISIPRFDYNAPSSLLQHSHSGFLITCTIKREKSATKEAISILQK 68

Query: 88  YIQYFSSSMPETLTVSDENKTSKRRKVCTGDVDPRSDEGVE--RSTDEHAGTSLISTKSE 147
           Y  +F++    +  V  +    KRRK+CT +V  + D  +E   + DE  G S   + S 
Sbjct: 69  YFGFFNNDSSTSFGVDGD---GKRRKICTEEVGGKRDNVLESINNADELNGLSKDDSWSS 128

Query: 148 AKVEKCSPI-----SLVKLTRSGLLLFTFIKNISPDTVYIVKDIIQCLEAGTLKSPAWCH 207
            + +K +PI     SLVKLTRSGLLLFTF + ISP+T+ IV +I Q LE+G+LKSP WCH
Sbjct: 129 VETDK-NPITDYVLSLVKLTRSGLLLFTFPREISPETIDIVSNIFQSLESGSLKSPLWCH 188

Query: 208 RIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPVKFAVGYNRRGIEETEMKKTSK 267
           RIFPIQATC LNE +L AVVS +VL F+ DK N L++P+KFAVGYNRRGIEET+MK +++
Sbjct: 189 RIFPIQATCRLNERELRAVVSNIVLRFVNDKQNKLARPIKFAVGYNRRGIEETQMKISNE 248

Query: 268 DSSGANV--MLGRDKCFSIVAAAVKDVVSNAIVDLKSPELCILVELLPVSGLPLESLVVG 327
           ++ G+ +  +L RDKCF +VA+AVKDVVS+A VDLKSPEL ILVELLP+SG+P ESLVV 
Sbjct: 249 NTVGSELFTLLNRDKCFGVVASAVKDVVSDATVDLKSPELAILVELLPLSGVPKESLVVA 308

Query: 328 VSVLPSNLVTTKPRLCIKALTSDTKAKS 347
           VSVLP NLV+TKPRLCI+AL S+T A++
Sbjct: 309 VSVLPLNLVSTKPRLCIRALVSNTNARN 332

BLAST of Bhi07G000020 vs. TrEMBL
Match: tr|A0A2P5AJJ5|A0A2P5AJJ5_9ROSA (THUMP domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_348860 PE=4 SV=1)

HSP 1 Score: 354.4 bits (908), Expect = 3.0e-94
Identity = 198/336 (58.93%), Postives = 239/336 (71.13%), Query Frame = 0

Query: 33  TMTPWEQHSAVISIPRFDYNAPSALLQRCQSGFLITCSIKREKSATKEAISILEKYIQYF 92
           +M PWEQH  VISIPRFDYNAPS+LLQ   SGFLITC+IKREKSATKEA+SIL KY+  F
Sbjct: 45  SMKPWEQHGGVISIPRFDYNAPSSLLQHSHSGFLITCTIKREKSATKEAMSILGKYVGSF 104

Query: 93  SSSMPETLTVSDENKTSKRRKVCTGDVDPR----------SDEGVERSTDEHA------- 152
           ++   ETL  SDEN  SKRRK C  D+D            +D+  ++S +          
Sbjct: 105 TTCCKETLDNSDENLVSKRRKTCMEDLDSECIHDNESKAAADDSEDKSVESDVLFIITGN 164

Query: 153 ---GTSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVYIVKDIIQCLEAGT 212
              G SL S  +    E+   +SLVKLTRSGLLLFTF +  S D V+IV +IIQ LE+  
Sbjct: 165 HLKGNSLSSPNTNTTEERNHVLSLVKLTRSGLLLFTFARGTSFDPVHIVSNIIQSLESAG 224

Query: 213 LKSPAWCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPVKFAVGYNRRGIEE 272
             SP WCHRIFPIQATCCL+E +L  VVSKLV+ FM D+ N L+ P+KFA+GYNRRGIEE
Sbjct: 225 SSSPQWCHRIFPIQATCCLDEKELRVVVSKLVIEFMNDRQNKLAMPLKFAIGYNRRGIEE 284

Query: 273 TEMKK---TSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELCILVELLPVSG 332
           TEMK    TS  S+G+N +L R+ CFSIVAAAVKD V +++VDL+SPEL +LVELLP+SG
Sbjct: 285 TEMKNPKDTSHVSAGSN-LLDRNSCFSIVAAAVKDAVPDSVVDLRSPELSVLVELLPLSG 344

Query: 333 LPLESLVVGVSVLPSNLVTTKPRLCIKALTSDTKAK 346
           +P  SLVV VSVL  NLV+TKPRLC+KAL S+ KAK
Sbjct: 345 IPNGSLVVAVSVLSQNLVSTKPRLCVKALASNVKAK 379

BLAST of Bhi07G000020 vs. NCBI nr
Match: XP_011655096.1 (PREDICTED: uncharacterized protein LOC101219243 isoform X1 [Cucumis sativus] >KGN50822.1 hypothetical protein Csa_5G273440 [Cucumis sativus])

HSP 1 Score: 543.9 bits (1400), Expect = 4.0e-151
Identity = 295/347 (85.01%), Postives = 310/347 (89.34%), Query Frame = 0

Query: 1   MAEKEEQINGCHLKPVAEASKPTENVPETEWKTMTPWEQHSAVISIPRFDYNAPSALLQR 60
           MAE EEQ NGCHLKP AEA K  ENV ETE K MTPWEQHSAVISIPRFDYNAPSALL R
Sbjct: 1   MAETEEQNNGCHLKPEAEAFKRAENVAETERKMMTPWEQHSAVISIPRFDYNAPSALLHR 60

Query: 61  CQSGFLITCSIKREKSATKEAISILEKYIQYFSSSMPETLTVSDENKTSKRRKVCTGDVD 120
           CQ+GFLITC+IKREKSATKEAISIL+KY+QYF+SSM ETL VSDEN+TSKRRKV + DVD
Sbjct: 61  CQTGFLITCTIKREKSATKEAISILQKYVQYFNSSMSETLVVSDENETSKRRKV-SEDVD 120

Query: 121 PRSDEGVERSTDEHA-GTSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVY 180
            RS  G E STDEHA  TSLISTKSEAKVEKCSPISLVKLTRSGLLLFTF K+ISPDTVY
Sbjct: 121 HRS-VGGESSTDEHAKETSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFTKDISPDTVY 180

Query: 181 IVKDIIQCLEAGTLKSPAWCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPV 240
           IVKDI+Q LEA TLKS AWCHRIFPIQATC LNENDL  VVSKLVLHFM DKGNILS PV
Sbjct: 181 IVKDIMQSLEARTLKSLAWCHRIFPIQATCSLNENDLQGVVSKLVLHFMNDKGNILSHPV 240

Query: 241 KFAVGYNRRGIEETEMKKTSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELC 300
           KFA+GYNRRGIEETEMKKT +DSSG NV+LGRDKCFSIVAAAVK VVS+AIVDLKSPELC
Sbjct: 241 KFAIGYNRRGIEETEMKKTFEDSSGVNVILGRDKCFSIVAAAVKGVVSDAIVDLKSPELC 300

Query: 301 ILVELLPVSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDTKAKS 347
           +LVELLPVSGLP  S VVGVSVL +NLVTTKPRLCIKALTSDTKAKS
Sbjct: 301 VLVELLPVSGLPSGSSVVGVSVLSNNLVTTKPRLCIKALTSDTKAKS 345

BLAST of Bhi07G000020 vs. NCBI nr
Match: XP_022158307.1 (uncharacterized protein LOC111024819 [Momordica charantia])

HSP 1 Score: 537.0 bits (1382), Expect = 4.9e-149
Identity = 288/347 (83.00%), Postives = 304/347 (87.61%), Query Frame = 0

Query: 1   MAEKEEQINGCHLKPVAEASKPTENVPETEWKTMTPWEQHSAVISIPRFDYNAPSALLQR 60
           MAEKE   NG   KPVAEASK TENV E   KTMTPWEQHS VISIPRFDYNAPSALL  
Sbjct: 1   MAEKEANNNGYQPKPVAEASKATENVAENGGKTMTPWEQHSGVISIPRFDYNAPSALLHH 60

Query: 61  CQSGFLITCSIKREKSATKEAISILEKYIQYFSSSMPETLTVSDENKTSKRRKVCTGDVD 120
           C SGFLITCSIKREKSATKEAISILEKYIQYFSSS PE L VSDEN+ SKRRK+CT DVD
Sbjct: 61  CHSGFLITCSIKREKSATKEAISILEKYIQYFSSSTPENLEVSDENEVSKRRKICTEDVD 120

Query: 121 PRSDEGVERSTDEHA-GTSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVY 180
            +S +G ERS+DEH  GTS+ STK EA+VEK SPISLVKLTRSGL+LFT  K+ISPDTV+
Sbjct: 121 HKSVKGEERSSDEHVEGTSMNSTKCEARVEKSSPISLVKLTRSGLVLFTLPKDISPDTVF 180

Query: 181 IVKDIIQCLEAGTLKSPAWCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPV 240
           IV DIIQ LEAGTLKSPAWCHRIFPIQATCCLNE DL AVVSKLVL F+K+K NILS PV
Sbjct: 181 IVLDIIQSLEAGTLKSPAWCHRIFPIQATCCLNEKDLQAVVSKLVLRFLKNKENILSHPV 240

Query: 241 KFAVGYNRRGIEETEMKKTSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELC 300
           KFAVGYNRRGIEETEM KT KDSSGANVM  RDKCFSIVAAAVKDVVSNA+VDLKSPE C
Sbjct: 241 KFAVGYNRRGIEETEM-KTCKDSSGANVMASRDKCFSIVAAAVKDVVSNAVVDLKSPEFC 300

Query: 301 ILVELLPVSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDTKAKS 347
           +LVELLP+SGLPL SLVVGVSVLPS+LVTTKPRLCIKALTSDTKAKS
Sbjct: 301 VLVELLPLSGLPLPSLVVGVSVLPSDLVTTKPRLCIKALTSDTKAKS 346

BLAST of Bhi07G000020 vs. NCBI nr
Match: XP_023553260.1 (uncharacterized protein LOC111810722 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 531.9 bits (1369), Expect = 1.6e-147
Identity = 291/347 (83.86%), Postives = 304/347 (87.61%), Query Frame = 0

Query: 1   MAEKEEQINGCHLKPVAEASKPTENVPETEWKTMTPWEQHSAVISIPRFDYNAPSALLQR 60
           MAEK EQ NGC  K  AEASK  ENV ETE KTMTPWEQHSAVISIPRFDYNAPSALL  
Sbjct: 1   MAEK-EQFNGCRPKHAAEASK--ENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHH 60

Query: 61  CQSGFLITCSIKREKSATKEAISILEKYIQYFSSSMPETLTVSDENKTSKRRKVCTGDVD 120
            QSGFLITC+IKREKSATKEAISILEKY QYFS+S PET   SDEN+TSKRRKVCT D+D
Sbjct: 61  RQSGFLITCAIKREKSATKEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDID 120

Query: 121 PRSDEGVERSTDEHA-GTSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVY 180
            +S E   RSTDEH  GTS IST S AKVEKCSPISLVKLTRSGLLL TF K+ISPDTVY
Sbjct: 121 YKSVEDEGRSTDEHVNGTSTISTSSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVY 180

Query: 181 IVKDIIQCLEAGTLKSPAWCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPV 240
           IV D+IQ LEAGTLKSP WCHRIFPIQ+TCCLNENDL  VVSKLVL FMKDKGN LS PV
Sbjct: 181 IVSDLIQSLEAGTLKSPTWCHRIFPIQSTCCLNENDLRGVVSKLVLQFMKDKGNSLSHPV 240

Query: 241 KFAVGYNRRGIEETEMKKTSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELC 300
           KFAVGYNRRGIEETEM KTSKDSSGA +++GRDKCFS VAAAVKDVVSNAIVDLKSPELC
Sbjct: 241 KFAVGYNRRGIEETEM-KTSKDSSGA-IVMGRDKCFSAVAAAVKDVVSNAIVDLKSPELC 300

Query: 301 ILVELLPVSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDTKAKS 347
           IL+ELLP+SGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSD KAKS
Sbjct: 301 ILIELLPLSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDPKAKS 342

BLAST of Bhi07G000020 vs. NCBI nr
Match: XP_022994263.1 (uncharacterized protein LOC111490047 isoform X1 [Cucurbita maxima])

HSP 1 Score: 529.6 bits (1363), Expect = 7.8e-147
Identity = 289/347 (83.29%), Postives = 304/347 (87.61%), Query Frame = 0

Query: 1   MAEKEEQINGCHLKPVAEASKPTENVPETEWKTMTPWEQHSAVISIPRFDYNAPSALLQR 60
           MAEK EQ  GC  K VAEASK  ENV ETE KTMTPWEQHSAVISIPRFDYNAPSALL  
Sbjct: 1   MAEK-EQFYGCRPKHVAEASK--ENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHH 60

Query: 61  CQSGFLITCSIKREKSATKEAISILEKYIQYFSSSMPETLTVSDENKTSKRRKVCTGDVD 120
            QSGFLITC+IKREKSATKEAISILEKY QYFS+S PET   SDEN+TSKRRKVCT D+D
Sbjct: 61  RQSGFLITCAIKREKSATKEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDID 120

Query: 121 PRSDEGVERSTDEHA-GTSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVY 180
            +S E   RSTDEH  GTS IST+S AKVEKCSPISLVKLTRSGLLL TF K+ISPDTVY
Sbjct: 121 HKSVENEGRSTDEHVNGTSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVY 180

Query: 181 IVKDIIQCLEAGTLKSPAWCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPV 240
           IV D+IQ LEAGTLKSPAWCHRIFPIQ+TCCLNENDL  VVS LVL FMKDKGN LS PV
Sbjct: 181 IVSDLIQSLEAGTLKSPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPV 240

Query: 241 KFAVGYNRRGIEETEMKKTSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELC 300
           KFAVGYNRRGIEETEM KT KDSSGA +++GRDKCFS+VAAAVKDVVSN IVDLKSPELC
Sbjct: 241 KFAVGYNRRGIEETEM-KTCKDSSGA-IVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELC 300

Query: 301 ILVELLPVSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDTKAKS 347
           IL+ELLP+SGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSD KAKS
Sbjct: 301 ILIELLPLSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDPKAKS 342

BLAST of Bhi07G000020 vs. NCBI nr
Match: XP_022930026.1 (uncharacterized protein LOC111436464 isoform X1 [Cucurbita moschata])

HSP 1 Score: 526.9 bits (1356), Expect = 5.0e-146
Identity = 289/347 (83.29%), Postives = 304/347 (87.61%), Query Frame = 0

Query: 1   MAEKEEQINGCHLKPVAEASKPTENVPETEWKTMTPWEQHSAVISIPRFDYNAPSALLQR 60
           MAEK EQ  GC  K VAEASK  ENV ETE KTMTPWEQHSAVISIPRFDYNAPSALL  
Sbjct: 1   MAEK-EQFYGCRPKHVAEASK--ENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHH 60

Query: 61  CQSGFLITCSIKREKSATKEAISILEKYIQYFSSSMPETLTVSDENKTSKRRKVCTGDVD 120
            QSGFLITC+IKREKSATKEAISILEKY QYFS+S PET   SDEN+TSKRRKVCT D+D
Sbjct: 61  RQSGFLITCAIKREKSATKEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDID 120

Query: 121 PRSDEGVERSTDEHA-GTSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFIKNISPDTVY 180
            +S E   RSTDEH  GTS IST+S AKVEKCSPISLVKLTRSGLLL TF K+ISPDTVY
Sbjct: 121 YKSFEDEGRSTDEHVNGTSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVY 180

Query: 181 IVKDIIQCLEAGTLKSPAWCHRIFPIQATCCLNENDLHAVVSKLVLHFMKDKGNILSQPV 240
           IV D+IQ LEAGTLKSPAWCHRIFPIQ+TCCLNENDL  VVSKLVL FMKDK N LS PV
Sbjct: 181 IVSDLIQSLEAGTLKSPAWCHRIFPIQSTCCLNENDLRGVVSKLVLQFMKDKVNSLSHPV 240

Query: 241 KFAVGYNRRGIEETEMKKTSKDSSGANVMLGRDKCFSIVAAAVKDVVSNAIVDLKSPELC 300
           KFAVGYNRRGIEETEM KTSKDSSGA +++GRDKCFS+VAAAVKDVVSNAIVDLKSPELC
Sbjct: 241 KFAVGYNRRGIEETEM-KTSKDSSGA-IVMGRDKCFSVVAAAVKDVVSNAIVDLKSPELC 300

Query: 301 ILVELLPVSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDTKAKS 347
           IL+ELLP+SGLPLESLVVGVSVLPS LVTTKPRLCIKALTSD K KS
Sbjct: 301 ILIELLPLSGLPLESLVVGVSVLPSKLVTTKPRLCIKALTSDPKTKS 342

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT1G09290.17.4e-8353.73unknown protein[more]
Match NameE-valueIdentityDescription
tr|A0A0A0KQX3|A0A0A0KQX3_CUCSA2.6e-15185.01Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G273440 PE=4 SV=1[more]
tr|A0A1S3CD85|A0A1S3CD85_CUCME5.1e-13982.69uncharacterized protein LOC103499095 OS=Cucumis melo OX=3656 GN=LOC103499095 PE=... [more]
tr|A0A2R6P3P4|A0A2R6P3P4_ACTCH3.6e-10061.93Checkpoint protein like OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY0... [more]
tr|A0A1Q3CS49|A0A1Q3CS49_CEPFO6.4e-9761.59Uncharacterized protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_26456 PE=4... [more]
tr|A0A2P5AJJ5|A0A2P5AJJ5_9ROSA3.0e-9458.93THUMP domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_34886... [more]
Match NameE-valueIdentityDescription
XP_011655096.14.0e-15185.01PREDICTED: uncharacterized protein LOC101219243 isoform X1 [Cucumis sativus] >KG... [more]
XP_022158307.14.9e-14983.00uncharacterized protein LOC111024819 [Momordica charantia][more]
XP_023553260.11.6e-14783.86uncharacterized protein LOC111810722 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022994263.17.8e-14783.29uncharacterized protein LOC111490047 isoform X1 [Cucurbita maxima][more]
XP_022930026.15.0e-14683.29uncharacterized protein LOC111436464 isoform X1 [Cucurbita moschata][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003723RNA binding
Vocabulary: INTERPRO
TermDefinition
IPR004114THUMP_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006400 tRNA modification
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi07M000020Bhi07M000020mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004114THUMP domainPFAMPF02926THUMPcoord: 197..305
e-value: 1.1E-7
score: 31.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 114..138
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 114..132
NoneNo IPR availablePANTHERPTHR13452THUMP DOMAIN CONTAINING PROTEIN 1-RELATEDcoord: 13..345
NoneNo IPR availablePANTHERPTHR13452:SF9SUBFAMILY NOT NAMEDcoord: 13..345
NoneNo IPR availableCDDcd11717THUMP_THUMPD1_likecoord: 154..323
e-value: 2.77541E-14
score: 67.9938
NoneNo IPR availableSUPERFAMILYSSF143437THUMP domain-likecoord: 222..304

The following gene(s) are paralogous to this gene:

None