Cla97C07G137470 (gene) Watermelon (97103) v2

NameCla97C07G137470
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionHydroxyproline-rich glycoprotein family protein, putative isoform 1
LocationCla97Chr07 : 25004633 .. 25013620 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTAGATCAATCGGAAGAAGGGTTCCAGGTTTTTCACTTCTATCAAATGCCAACAAACTAAGCTTTGTGCCATTTTCTTCTTCCTCTTCTTTCGGTGGTCATGGTCGTGGCCGAGGTCGAGGTGGCTTTCCCTCCCATGCCGGACCCTTTGATTTCACTTCTCCAGTCCCCGGTCAAGAAGATTCAAATACGTCTAAACAAGATTCCGTAGGTTCTCGTCCTACTCCTGGCCTTGGACATGGTAAACCCACTCCTTCCTCCCCAATTCTTCCATCTTTCTCTTCCTTTTCGCCCTCTGTTAACCCATCGTCTGCTGGTCGGGGTAGAGTTGATGCCCCGCCACCGATTCGTTCCCCTTCTGGTCCAGGTTCGTCGAAGGGGTCAGATTCGGAGCCTAAGAAACCTGTGTTTTTCTCGAAGGATAATGCGGGCGACTCGGCTTCAACTACTCGACCTGGCGCGTTACACAGGGGTGTGGGAGAAAGAAACTTACCTGATACTTTACTTTCTGGATTTCCCGGTGTTGGACGAGGAAAACCTATGAAGCAACCAGGGCAGGAAGCTCAACCAAAGCAGGAGAATCGTCATCTCAGACCTAGTGGAGTTGGTGAGCCTGGAAGGGGTCGTGGCGGCGGCCAAAGGATGAGCCGTGATGGACCTGGGAGGAACACTGGTGGGATGATGTCAAGGATCGGGCCTGATGGTGAAGATGGTGGTGGTCGAGGGAAAAGCGGGCTCCGGGGCAGAGGAACGTTTCGAGGCAGAGGAGGATTCAGAGGGAGAGGAAGGGGGGTCTTTAGAACTGGGGAGAGATGGGAGAGAGGAAGAGTTCAAGATATGGAGGATGGATATGCTGCAGGACTTTATTTAGGCGACAATGCAGACGGTGAAAAGCTGGCAAAGAGGATTGGGACTGAACATATGAACCAACTGGTTGAAGGGTTTGAAGAGATGAGTGTTAGAGTGCTGCCTTCGCCACTGGAGGAAGAGTATTTGGATGCGGTGCATACTAATTATATGGTGAGCACTTCCTACGTCTTTTTTGTCCCTGTTTTTGCTCTCTAAGGACATAAAATGGACTGCGTATTAGGATTAGCTCTAAACTATTTTGGCATGATTTTTTGAAATGGAAACAAATTTTCATTGATATGAGGCTAGTGCTCCGAGTATAAAGGGACATAGAAATCATACAACTCAAATAGAACTATGGGTCAAGAGAGGCACTAGGCTATCTCAACTAGGTTGACACAATCTTAGCATCCTTGTCATTTCCCATTACTTAATATATCATATAGAATTAAGACATACTAACAAAAACAACAATGCTAAACAGTAACAATGAAAGCATTGAGGCATATGTCTTGTATGAAATAATTTGCAAAGAACTTGGATTGTGAACTCCATGAAGAAGCTAGAACAAAAGTTCTGAAAGTATGGCTTTAACTGCATTTGACCATAAAACTTGTGTTTTGGATGAGAGTGAGGGGCCAGCCAACAGTTGAGATACATTATCCTGGAAAGAATTTCCAAAAGTCCAAGACAAATTAAAATATGAGAAGAGTTTTCACCAGCAATTAGTAGAAAATGGACACTCAAAAAATAGGAGCGACCTTATGCTGTAACATGGCAGCACAGTTAAGAGTGCCAAAAATCATAGTCCAAACCAGAATGTTGACTCTTTTTGGGCTCTTGGACTTCTATAATGCTTGAAAAATTATCTTATCTATAGGAGAGGAAGCAATTAAAAAAAAAAAAGACAAAACTGTTTTCGCATGATTGGAAAGTATACTGTTTTCGGTGTTTGTGATCGAGTGATGAATTGAATAGTGTGCTGGTTGAATTATTACAAATCTGATTTTGTTTAAGTTTATCCTGAAAATTGTAGAGATTGGGTTACTAACGCTCGTTTTGACTTTTTTCTGTATGGCAGATAGAGTGTGAGCCGGAGTACTTGATGGGAGATTTTGAAAGTAATCCTGATATTGATGAGAATCCACCTATTCCTCTTCGGGATGCACTTGAGAAGATGAAACCATTTTTAATGGCCTATGAAAACATCCAAAGCCACGAAGAGTGGGAGGTATCATTTTCTGTTACCACTCCATGTTTTCATTCCTTTATTCTTGGGTCTTTTCTACTTGTTAATTGTGCTCTGCATTTGAGTGAAGTGTATATGTGAGTTATTTTTGTGGTTTTTGAGCATTGATGAAAATATCCACTATACTCTACTTACGTTGGTATTTCCTCGATCTTGCTATTATTAGATATTATATTTATCTGATACAGGTAAAGATGCAGGAAGTGTGCAAAGTATATAGACGTCTTACATTAAATTATTTAATTGCTCGTCTCAGTTGCTATTTCTCTAGAGGAGTTTTTAAGCTGACATTTTTGAGAATTGCATCAACAAGCTTGCTTAGGATATTTATCAAGAAAATTGAAAGTTGTTCAAAAGAAAGGACTAGACACAAGAGCCTGAAGTAGAGGAGCCCACAAGAACCAAACAAGGAATTACTGGTAGTGGCGGCAGTTTTTAGAATGAAAAACAATGGAAAGGTACTGAAAAACTCTGTGAGAACACTATGAGGGATTATTGATTCTTCTATCAATGAAACCTACATATGTTATATCCCTGAAAAGTCAGTCGCAAGGAGAGTGAAAAACTTCATATCCATCAACCGTGTTACCAACATCTTATGAGATTGATTATTGCCAAGGTGTTGACATATTGATCGAGAAATGTCTCGAAGTTTCACAATGCTTTTATGATGAGAGAGGTTATGGATGAGTTCTGGTTGGTAATGATGCCATTTTTTTGGAATAACTAACATTGCAAGGTACAAGACAGGTGATCCCCCATTACTACCCTTTGTCGTGAATGAATGCACAAACAAAATAACAGGGAGATACTAATATATTTTCATGCTATAAAAAACCAGTTAACAGGAAATGGTCAGACCAATATTATGGAAAGATCAAGTACATTGACAGCCTAAATGCCGTTACTATCATCTACATCTTCATCGCTTGTCATTGTCCTCCTCTTTGCTGCCCTCATCTCCATCCTCATCATCGTTCACCTCTGACATTTACTTCTCAGATTCATCTTCTTTCGCAACATTGGCTCCACTGGCCTGTTCCTTGTTATAGGCCTTCATATTTTTTTCATATTCAACCTTCCTCTTGTCAGCCTTAGCATTGTAAGGTGCTTTTTCAGCATCCGACATTGATTTCCATTTCTGTCCAGCAGCTTTACCAACAGTAGACACTGCTTTGTTATTCTTTTTTTGAAACACAAACTGCATAGCATTTTTTATTATTAAACTGATGAACAAAAAATCAGACAAACTAATTGTTTACCTCATTTTTCCCTCTCAAAAAACATCATGGAATATAAAACCCCCACAAATGAAGCACAAATTCAAGAACACAAAGAACATGAGGCAAACCACAGTTTGAACCATGCACAATCAATTCTCTCAAGACTTCTAAAATCTCTTCTCCAATTCATCCATGGTCATTTTGTTTTTTGGCACTGGCTGTAAAGCAGTTCTCACCTTCCCCCCACCTCCAACAACTTTTGTGTTGATGTTATTATTGTCCTTCTTTTTGCTAAGCTGTTATTCGGATTCTCCTCATTGAACTTCTTCTTGAACTCTTCCATAAAAACGAAGAAGGCACTGGCAGGGCTCTTGGGTTTGTTAGGATCTTACCCTGCCTTCTTACTTCCTTTCTTGCCCACGTTGGCTGCACCTTTTTGCACTGCAAGCTTGGTGTCTGCTTTCTTTGACTCCGTCTTCAATTTACTGCCTTTCATTTCAAATCCGAAGGGAGGAGGTTAGGTTAGGATTAGGGTTTGAAGAGGGAGGAAGAAGCAGAGGTGCGTGTGAGAGAGAAAGAAGGAAGGGGTGGTTGGTAATGATGCTATTGAGAACCATATAGCTTGTTAGTGAGAAAGAATTGTTTTCAAGATTGATTTTGAGTAAGACGTTCTCCCCCTCCTTTTGTATCATTCTTTTTGATAATGAAAGTTGTGTTTGTTATGAAAAAGAAGAAGGAGAAAAAAAAGAGATTGGTTTTGAGAGACTGATAATGTGGATTTGGGTTTCTTGGATAAGGTGGTTTGAATGAAAATTTTTGGGTTCAAGGGGAGATCTTGGTTATGGAATTGCATTAGATTTGTTAACTATCCTATTTCTGGTAAATAGTAAGCCTAGGGGTGGAATTTTTGCCTCTACTGGGACTGGGATTAGGCGAGAGACATTATTTCCTGTCCTTTTCCTAGTGGTTGGTGATGTCCTTGGTAGGTTAGAATGAATCAGGGTTTGGTTTGGGTTTTCACATGGGCGAAGAGTGGATTTGCCACTTTTTTAAATAATTTGAAAAATATTAGGCACTTGTATTAACCAAAAAAGAGCAGGCAACTCAAGCATAGGAGAAGGAGAAAACCCTCTCCGTATGGCCTTCCAATTGTGTCGTCCAAACCACGAAGGACTACGTGCAACCTTCCAATTGTGTGAAATCACAAGGAGGGTAAAATTTCTTGTGGTTGGAACACCACCAAGATGCTGTATACATCATTCCAAGAAAAAGACATGAGTAGTATTGCTATGCTAAAGGATCCTCTAATTCCTTTCCTTCCAAAGGAGCCAAAAAAGTACTAATGTTGCAGCTCCATGTGACTTTGGCTCTAGCTCTACTTCTCAGCCTTACTCCAGGTCTATCCAAACACTCCAACAACCAATCCTCAATTCAGTTTGAAAGGCACGTTGAGAAATCAAATTCCTTGAGCAGCCTCATCCACCCCCTCAAAGCAGAAGGTCAATGGGTGAAAAACTCCTACTATTTAACAATGACGGCAGATCGAATCGAGGGAGAAAACATCCACCTAGAGGATTTTCTTTGGACTCTATTAAACTATTCGCCGTGTCGAGACTTCTATACCAAAGAGTCCAAAGAAAAAACTTAACTTTATTTGGGATCTTAAATTTCCATATCATCCATACCATGAATTCACCTTTTTGTTTTTTAGTTTGTTGAGGACAATTTTTTCTTGCATGAGTAGGAATTCATTAATTGTTGAGAAAAATTTGCCGCTTTTTAGAGTCGTATGGTATTCAAGATCAATTGGAGAAAGGCTTCATTGGTCGGTATTAATTGTGATTTCGTTAAATTGGATAGATGAAGTTTTAAGTTGGTCTCATTTTCACTAATTGCCTTGGTCTTTCTCTTAGTCATTATTCCAATTGTCTCTACTTGTAAATCCTATTACAAAGAAGTTTATAAACGGCTTTGCACATGAAAAGTGCATTCTTCTTTGAAGGTGTTAGGTTGATACTCATCTAGTCTGTGCTTGAGTGGGAGCTCGTCGTACTTCTCCTCTCTCTTTTAGTCGATTTCAAAGATCTTTTTCAATTATTCTGTAGGTACCCCCCAGCTAAAGCCCCTTTCTCTAGTTAGATTGCCTTCTGTGGGCTTGGCTTTTCGTATGCCCTTGTATTTTTTTTCCTTCTTTCTATCTCAATAAAAGTTGTTATCCAAAGAAAAAAAAAAAAAATCACCTTTTAGTAGTTCTAAGGTGGAAAAAATGATAAGAGATTTTTTATTGTAGGGTGTGGAAGAGAGCTGGACGTTCCAGTTCAGTCAGGTGTAGGAGGTTTCTTTTAGAACAATTGGGGAAGGTGTGAGGAATTGGCCATGAAAAACTTGAGGTTTGTTATTTGTAGTACACCCTTGTTGTCTAAGTAGACGTGACATTTCCCTTTTGAGTTTGATGTTGTGTGGTGTAGGACGGTAGCGAGCAAACTGCAAAGATGTTTGTCACCAATCTAGTGGGTCTTGATGGTTGGATTTGAAGGAATCCTAGGAACGTTTGGAAGGTTGTGGTTGTCACATTTGTTTCCTTTTCTTTTACTAAGTTCTTGCAATTATCAATTGGATATGACTTGATTGTCCTATAGGAGGGATCTTAGGTAGGGGTAGTTCATTATGCCTTTGATTCCTTCACTTGTATCATCTTTCTTCTCATTATTCCCTGTATGGGGATTCCAATTAATGTATATTTGGCTTTAAACTCCATTGTCTGATAAAAATAACTTTTGATTTCACCTATTTACTCTCCTCCTTGGGTGGGTTAGTGTTAGGAATACTAGGGATATTAGGGCATGTTTGTCAGTAGCAAGGAGTTGGTTGGTGGAAGTCAATTATAAATAGAGGGGTTATGGGGTATTTCTACCAGTTTAATTGTTGTCTCGGGGCTAGTGGGGGTGGGGGGTTTTCTTGTAAGGACTTTTTCTCTATATCATACTACTCTGATTCTTTTATGCTCTCTTTGTTCACTCTGGTGTAAAAGGTAAGAATCTCCAAGAAGGGCATGTTCTTTCTTTGACCCGTATTACATAGAGGATTAACATCATTATCAATTTATCACTGTCACCATTGGCTACCATCTAGAGGTAGCCTTTTGTGCTGTTGGAGCACATTGGTGCCCTTTATGTAAGTTTGCTTTGGAGGACTTGTCACATTTTGTTGGGTTGTTAGTTCACACATTGGGTTTGCGGCAATTGTTTGGGAGCCTTTGGGGATGTGTCTGGTTCTTTTTAACAGCCTATCAAGTCGGTCTTCTGATAATTGAGGAGGCGTTGTATTGCCAGCCATTTTGGAATAAGGGAGAGATCCTTGATACGATCTTTAAATCAAGTCTAATTTTGGTTAAATTTGGTGCTTATATCTAGTTACTTTACTTCCTTAAGATGTTGGTCTACATTCGTTTCATTGTGATTTTATCTAGTGATTAGGATTTTTGGTGTTGGAGATGTTATATGTATATATATATACACGCCAACTGCTTTTTGTATTTTCATTATTTTCAAATATTGCTTTCCTTTGATTTAGAGTTAGAGAAGGTTACATCTGAAGATCTATACACTTGAAAATGTTTCCTTCTTAGCTAGTGCTTTAATGGGGTTGGAGAGGGTACTAAATATAAAGTTTCAGCCTGGAAGTAACTCTGTTCAGGTTTCAGGAATCCAAGAATTAGGTAAGTCCTCTGTTGTTGAGTGATGTTATGTATGATTAGAAAGGCCAGCCTACTGTCAACATTTAGTCATGTTTGGCCTTGAGCTCTTTGAAATTTCGAACCTTCACATCAGAATGAAAAATATTGAGTTCAAATTTTATCAGCATCCTCCTGGATTGTCATTCATCGATCTCTTTCTCTCTCTATCTCCCTTTTCTTTTTTTTTTTTTTAAAAATAAAAATTTCAACTGGAGAGGCTTTCTTAACTCAGCTTTGGTTGGGAGGATGACTTGAAAATATGATACGCTCCTTATAAAGGATGGCTCTTCTTCTTCTTCATCTAACTTTTTCATTTGGCTGTACAAGGAAATCATGGAAGAAACCATGCAAAGAGTTCCACTGCTGAAGGAGATAGTTGACTATTACAGTGGACCAGATAGAGTAACTGCAAAGCAACAACAAGGGGAGCTGGAAAGAGTTGCAAAAACACTTCCACAAAGTGCGCCTAATTCCATAAAGCAATTCACCAATCGTGCAGTTCTTTCTTTACAGGTTAGTGAAGTCACCTTAATGACTTATTATTCTGAATTATTTTATGAAAAGAAAATAGGGAAAAGAAAAAAAAAACCCAGGAAGAAATTCTAAGCTGGAAACATACGTTTCAAATCACAATGGATGTATACTATATTTGTATGTATTTTCTTTCTCGGGAATGCAAGAAGACTATGTTTAAGAATTATTCCAAATTTGTAATGATCAATATCCATGAATGCCATGAATTATTTTTCATCCAAAAAAATGTATGCCATGCTTTACATATTTTATGAGTAATAGTTGCAATGGAAACAGCATTTGTAGCACTATACACAATTTAATGTTACCATCAAATATAGTGCATATTAATTTAGTATTAATAGTATTGAATAAAGTTTTACAAAAAATCATTATTTTTATATCGAAGCTATTGCAGTTTTTTTTTTCTTTTTTTTTGGGGGAGGGGGGGGGGGGGAAGGGGGGGGAAGGGGGGTAGAATTGTTTAATTGCATTCTTACTTGGTAGGAATATTGCATACCTACTTACAAGTTCCATTAATAGGTCTTACAAGTACACTGACAGCTTCTACCTAAGTTATTTAGAATACATGTAATAATAGGAATGAATTGTTGGGATGTGATGTAAAGGGAACCTGTAAATTTCATACACCAATGAAGTTGTTTCTTATAGAAAAAAGAAAAAGAAAAAGAGAAGAATTGTATAGTGAATTAAGTATTTGTGTAGAATGAAGAGATGGTGATGCTATTCTGTCCTCTTTGTTTTGGATGTTCAGGGGAACTAAGTACTGATTTGTGAAACCCTGAGATGTGCGGGAACACGAATGCTTGTTAACTAACATTGCTTTTCACTTTTCTAACCAAAGGAAAACCCTGAGAAAAGAGCATTGAGTAGAAATAAGAATGACGAACGGGCATAAAACTATAAAATGATATGGGCTTAAATGCACAGATACAATGCAACCAGAAAAGGAAAAAGAAAAAACAGAAATTTCATTTGGTTTTGCTGCAATAATCTTAAACTGATATAAATGAACTGCTCTGTATTGATTTTATCTGTTTATGGAGCCAATATCTCATTTTACACAACTATTGTGTATATTGTTACACTAGGCAGCCTTATTTAATGTTGTTATGATCTACCTAGATCCTGAAAGTCAAAATGTGTAAGAACAAGGCATTTTTAGTTCAGTCTCTGTACTCCAGAGGTTTAATTGACATTTCCTGTCTATGTTGCTTGTTAGTGAGTAATACAATAGGCTTTGACATTCAATATGAAGATATTGAAAGGGATAGAAATGTGGAAGCTGATTTGCACAATTCTTCTATTTGCAGAGCAACCCAGGGTGGGGATTTGACAAGAAATGCCAGTTCATGGACAAGCTTGTTAGGGAGTTCTCCCAGCGATACAAGTAG

mRNA sequence

ATGAGTAGATCAATCGGAAGAAGGGTTCCAGGTTTTTCACTTCTATCAAATGCCAACAAACTAAGCTTTGTGCCATTTTCTTCTTCCTCTTCTTTCGGTGGTCATGGTCGTGGCCGAGGTCGAGGTGGCTTTCCCTCCCATGCCGGACCCTTTGATTTCACTTCTCCAGTCCCCGGTCAAGAAGATTCAAATACGTCTAAACAAGATTCCGTAGGTTCTCGTCCTACTCCTGGCCTTGGACATGGTAAACCCACTCCTTCCTCCCCAATTCTTCCATCTTTCTCTTCCTTTTCGCCCTCTGTTAACCCATCGTCTGCTGGTCGGGGTAGAGTTGATGCCCCGCCACCGATTCGTTCCCCTTCTGGTCCAGGTTCGTCGAAGGGGTCAGATTCGGAGCCTAAGAAACCTGTGTTTTTCTCGAAGGATAATGCGGGCGACTCGGCTTCAACTACTCGACCTGGCGCGTTACACAGGGGTGTGGGAGAAAGAAACTTACCTGATACTTTACTTTCTGGATTTCCCGGTGTTGGACGAGGAAAACCTATGAAGCAACCAGGGCAGGAAGCTCAACCAAAGCAGGAGAATCGTCATCTCAGACCTAGTGGAGTTGGTGAGCCTGGAAGGGGTCGTGGCGGCGGCCAAAGGATGAGCCGTGATGGACCTGGGAGGAACACTGGTGGGATGATGTCAAGGATCGGGCCTGATGGTGAAGATGGTGGTGGTCGAGGGAAAAGCGGGCTCCGGGGCAGAGGAACGTTTCGAGGCAGAGGAGGATTCAGAGGGAGAGGAAGGGGGGTCTTTAGAACTGGGGAGAGATGGGAGAGAGGAAGAGTTCAAGATATGGAGGATGGATATGCTGCAGGACTTTATTTAGGCGACAATGCAGACGGTGAAAAGCTGGCAAAGAGGATTGGGACTGAACATATGAACCAACTGGTTGAAGGGTTTGAAGAGATGAGTGTTAGAGTGCTGCCTTCGCCACTGGAGGAAGAGTATTTGGATGCGGTGCATACTAATTATATGATAGAGTGTGAGCCGGAGTACTTGATGGGAGATTTTGAAAGTAATCCTGATATTGATGAGAATCCACCTATTCCTCTTCGGGATGCACTTGAGAAGATGAAACCATTTTTAATGGCCTATGAAAACATCCAAAGCCACGAAGAGTGGGAGGAAATCATGGAAGAAACCATGCAAAGAGTTCCACTGCTGAAGGAGATAGTTGACTATTACAGTGGACCAGATAGAGTAACTGCAAAGCAACAACAAGGGGAGCTGGAAAGAGTTGCAAAAACACTTCCACAAAGTGCGCCTAATTCCATAAAGCAATTCACCAATCGTGCAGTTCTTTCTTTACAGAGCAACCCAGGGTGGGGATTTGACAAGAAATGCCAGTTCATGGACAAGCTTGTTAGGGAGTTCTCCCAGCGATACAAGTAG

Coding sequence (CDS)

ATGAGTAGATCAATCGGAAGAAGGGTTCCAGGTTTTTCACTTCTATCAAATGCCAACAAACTAAGCTTTGTGCCATTTTCTTCTTCCTCTTCTTTCGGTGGTCATGGTCGTGGCCGAGGTCGAGGTGGCTTTCCCTCCCATGCCGGACCCTTTGATTTCACTTCTCCAGTCCCCGGTCAAGAAGATTCAAATACGTCTAAACAAGATTCCGTAGGTTCTCGTCCTACTCCTGGCCTTGGACATGGTAAACCCACTCCTTCCTCCCCAATTCTTCCATCTTTCTCTTCCTTTTCGCCCTCTGTTAACCCATCGTCTGCTGGTCGGGGTAGAGTTGATGCCCCGCCACCGATTCGTTCCCCTTCTGGTCCAGGTTCGTCGAAGGGGTCAGATTCGGAGCCTAAGAAACCTGTGTTTTTCTCGAAGGATAATGCGGGCGACTCGGCTTCAACTACTCGACCTGGCGCGTTACACAGGGGTGTGGGAGAAAGAAACTTACCTGATACTTTACTTTCTGGATTTCCCGGTGTTGGACGAGGAAAACCTATGAAGCAACCAGGGCAGGAAGCTCAACCAAAGCAGGAGAATCGTCATCTCAGACCTAGTGGAGTTGGTGAGCCTGGAAGGGGTCGTGGCGGCGGCCAAAGGATGAGCCGTGATGGACCTGGGAGGAACACTGGTGGGATGATGTCAAGGATCGGGCCTGATGGTGAAGATGGTGGTGGTCGAGGGAAAAGCGGGCTCCGGGGCAGAGGAACGTTTCGAGGCAGAGGAGGATTCAGAGGGAGAGGAAGGGGGGTCTTTAGAACTGGGGAGAGATGGGAGAGAGGAAGAGTTCAAGATATGGAGGATGGATATGCTGCAGGACTTTATTTAGGCGACAATGCAGACGGTGAAAAGCTGGCAAAGAGGATTGGGACTGAACATATGAACCAACTGGTTGAAGGGTTTGAAGAGATGAGTGTTAGAGTGCTGCCTTCGCCACTGGAGGAAGAGTATTTGGATGCGGTGCATACTAATTATATGATAGAGTGTGAGCCGGAGTACTTGATGGGAGATTTTGAAAGTAATCCTGATATTGATGAGAATCCACCTATTCCTCTTCGGGATGCACTTGAGAAGATGAAACCATTTTTAATGGCCTATGAAAACATCCAAAGCCACGAAGAGTGGGAGGAAATCATGGAAGAAACCATGCAAAGAGTTCCACTGCTGAAGGAGATAGTTGACTATTACAGTGGACCAGATAGAGTAACTGCAAAGCAACAACAAGGGGAGCTGGAAAGAGTTGCAAAAACACTTCCACAAAGTGCGCCTAATTCCATAAAGCAATTCACCAATCGTGCAGTTCTTTCTTTACAGAGCAACCCAGGGTGGGGATTTGACAAGAAATGCCAGTTCATGGACAAGCTTGTTAGGGAGTTCTCCCAGCGATACAAGTAG

Protein sequence

MSRSIGRRVPGFSLLSNANKLSFVPFSSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVPGQEDSNTSKQDSVGSRPTPGLGHGKPTPSSPILPSFSSFSPSVNPSSAGRGRVDAPPPIRSPSGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVGRGKPMKQPGQEAQPKQENRHLRPSGVGEPGRGRGGGQRMSRDGPGRNTGGMMSRIGPDGEDGGGRGKSGLRGRGTFRGRGGFRGRGRGVFRTGERWERGRVQDMEDGYAAGLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIECEPEYLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVREFSQRYK
BLAST of Cla97C07G137470 vs. NCBI nr
Match: XP_022136793.1 (uncharacterized protein LOC111008406 [Momordica charantia])

HSP 1 Score: 611.7 bits (1576), Expect = 2.1e-171
Identity = 386/494 (78.14%), Postives = 399/494 (80.77%), Query Frame = 0

Query: 1   MSRSIGRRVPGFSLLSNANKLSFVPFS-----SXXXXXXXXXXXXXXGFPSHAGPFDFTS 60
           MSRSIGR+VPG S L NA KLSFVPFS      XXXXXXXXXXXXXX          FTS
Sbjct: 1   MSRSIGRKVPGLSFLPNATKLSFVPFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFTS 60

Query: 61  PVPGQEDSNTSKQDSVGSRPTPGLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGRVDA 120
            VPGQEDSN SKQ+SV S  T GLGH  GKP PSSPILPSFSSF+PSV  SSAGRGRV  
Sbjct: 61  RVPGQEDSNASKQESVDSPGTSGLGHGRGKPGPSSPILPSFSSFTPSVKSSSAGRGRVAG 120

Query: 121 PPPIRSPSGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGF 180
            PPIRS    GSSK SD EPKKPVFFSKDNA +SA++TR GAL RGVGERNLPD+LLS  
Sbjct: 121 SPPIRSTPESGSSKQSDLEPKKPVFFSKDNAANSAASTRLGALDRGVGERNLPDSLLSVL 180

Query: 181 PGVGRGKPMKQPGQEAQPKQENRHLRP------SGVGEPGRGRGGGQRMS-XXXXXXXXX 240
            GVGRGKPMKQP  E QPKQENRHLRP       G+G P RG GGG RMS          
Sbjct: 181 SGVGRGKPMKQPVPEDQPKQENRHLRPRQESGGRGIGGPVRGVGGGPRMSRDEGVRNTGR 240

Query: 241 XXXXXXXXXXXXXXXRGKXXXXXXXXXXXXXXXXXXXXGVFRTGERWERGRVQDMEDGYA 300
                    XXXXXX   XXXXXXXXXXXXXXXXXXXXG FRTGER ERGR QDMEDGYA
Sbjct: 241 MVSRGGPDGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGPFRTGERGERGRAQDMEDGYA 300

Query: 301 AGLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIECEP 360
           AGLYLGDNADGEKLAKRIGTE+MNQLVEGFEEMS R LPSPLEEEYLD +HTNYMIECEP
Sbjct: 301 AGLYLGDNADGEKLAKRIGTENMNQLVEGFEEMSGRTLPSPLEEEYLDGMHTNYMIECEP 360

Query: 361 EYLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETM-QRVPLLK 420
           EYLMGDFESNPDIDE PPIPLRD LE  KPFLMAYENIQSHEEWEEI+EE M QRVPLLK
Sbjct: 361 EYLMGDFESNPDIDEKPPIPLRDVLEMTKPFLMAYENIQSHEEWEEIVEEIMQQRVPLLK 420

Query: 421 EIVDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQ 480
           EIVDYYSGPDRVTAKQQQ ELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFD+KCQ
Sbjct: 421 EIVDYYSGPDRVTAKQQQEELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDRKCQ 480

BLAST of Cla97C07G137470 vs. NCBI nr
Match: XP_022942601.1 (uncharacterized protein LOC111447586 [Cucurbita moschata])

HSP 1 Score: 573.5 bits (1477), Expect = 6.5e-160
Identity = 359/490 (73.27%), Postives = 376/490 (76.73%), Query Frame = 0

Query: 1   MSRSIGRRVPGFSLLSNANKLSFVPFS--SXXXXXXXXXXXXXXGFPSHAGPFDFTSPVP 60
           MSRSIGR+VPG S LSNANKL FVPFS   XXXXXXXXXXXXXX  P+H GPFDF+S VP
Sbjct: 1   MSRSIGRKVPGLSFLSNANKLGFVPFSXXXXXXXXXXXXXXXXXXXPTHGGPFDFSSRVP 60

Query: 61  GQEDSNTSKQDSVGSRPTPGL--GHGKPTPSSPILPSFSSFSPSVNPSSAGRGRVDAPPP 120
           GQEDSN SK +SV SR T GL            ILPS SSF+PSV  S AGRGR D   P
Sbjct: 61  GQEDSNESKHESVDSRGTSGLXXXXXXXXXXXXILPSLSSFTPSVKSSFAGRGRGDGSQP 120

Query: 121 IRSPSGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGV 180
           IRSP    SS  SDSE  KPVFFSKDNAGDSA + RPGAL R VGER+LPD+ LS   G 
Sbjct: 121 IRSPPESRSSNESDSERTKPVFFSKDNAGDSAGSARPGALDRDVGERHLPDSFLSVLSGA 180

Query: 181 GRGKPMKQPGQEAQPKQENRHLRP-------SGVGEPGRGRGGGQRMSXXXXXXXXXXXX 240
           GRGKPMKQP  EAQPKQENRHLRP        G G PGRG GGG R+S            
Sbjct: 181 GRGKPMKQPVPEAQPKQENRHLRPRQEAGGRGGYG-PGRGSGGGPRIS-----RDESVRN 240

Query: 241 XXXXXXXXXXXXRGKXXXXXXXXXXXXXXXXXXXXGVFRTGERWERGRVQDMEDGYAAGL 300
                           XXXXXXXXXXXXXXXXXXX               DMEDGYA+GL
Sbjct: 241 TGRMMPRGGPDGEDGGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDMEDGYASGL 300

Query: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIECEPEYL 360
           YLGDNADGEKLAKRIGTEHMNQLVEGFEEMS RVLPSPLEE Y++A+ TNYMIECEPEYL
Sbjct: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSGRVLPSPLEEGYVEAMDTNYMIECEPEYL 360

Query: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVD 420
           MGDFESNPDIDENPPIPLRDALEKMKPFLMAYE IQSHEEWEEI+EETMQRVPLLKEIVD
Sbjct: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYEGIQSHEEWEEIVEETMQRVPLLKEIVD 420

Query: 421 YYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDK 480
            YSGPDRVTAKQQQGELERVAKTLPQSAPNS+K+FTNRAVLSLQSNPGWGFDKKCQFMDK
Sbjct: 421 SYSGPDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDK 480

BLAST of Cla97C07G137470 vs. NCBI nr
Match: XP_023544535.1 (uncharacterized protein LOC111804080 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 565.5 bits (1456), Expect = 1.8e-157
Identity = 355/490 (72.45%), Postives = 373/490 (76.12%), Query Frame = 0

Query: 1   MSRSIGRRVPGFSLLSNANKLSFVPFS--SXXXXXXXXXXXXXXGFPSHAGPFDFTSPVP 60
           MSRSIGR+VPG S LSNANKL FVPFS   XXXXXXXXXXXXXX  P+H GPFDF+S VP
Sbjct: 1   MSRSIGRKVPGLSFLSNANKLGFVPFSXXXXXXXXXXXXXXXXXXXPTHGGPFDFSSRVP 60

Query: 61  GQEDSNTSKQDSVGSRPTPGLG--HGKPTPSSPILPSFSSFSPSVNPSSAGRGRVDAPPP 120
           GQEDSN SK +SV SR T GLG           ILPS SSF+PSV  S       D   P
Sbjct: 61  GQEDSNESKHESVDSRGTSGLGXXXXXXXXXXXILPSLSSFTPSVKSSXXXXXXXDGSQP 120

Query: 121 IRSPSGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGV 180
           IRSP    SS GSDSE KKPVFFSKDNAGDSA + RPGAL    GERNLPD+ LS   G 
Sbjct: 121 IRSPPESRSSNGSDSERKKPVFFSKDNAGDSAGSARPGALGGDAGERNLPDSFLSVLSGA 180

Query: 181 GRGKPMKQPGQEAQPKQENRHLRP-------SGVGEPGRGRGGGQRMSXXXXXXXXXXXX 240
           GRGKPMKQP  E+QPKQENRHLRP        G G PGRG GGG R+S            
Sbjct: 181 GRGKPMKQPIPESQPKQENRHLRPRQEAGGRGGYG-PGRGSGGGPRIS-----RDESVRN 240

Query: 241 XXXXXXXXXXXXRGKXXXXXXXXXXXXXXXXXXXXGVFRTGERWERGRVQDMEDGYAAGL 300
                          XXXXXXXXXXXXXXXXXXXX               DMEDGYA+GL
Sbjct: 241 TGRMMSRGGPDGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDMEDGYASGL 300

Query: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIECEPEYL 360
           YLGDNADGEKLAKRIGTEHMNQLVEGFEEMS RVLPSPLEE Y++A+ TNYMIECEPEYL
Sbjct: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSGRVLPSPLEEGYVEAMDTNYMIECEPEYL 360

Query: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVD 420
           MGDFESNPDIDENPPIPLRDALEKMKPFLMAYE I+SHEEWEEI+EETMQRVPLLKEIVD
Sbjct: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYEGIRSHEEWEEIVEETMQRVPLLKEIVD 420

Query: 421 YYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDK 480
            YSGPDRVTAKQQQGELERVAKTLPQSAPNS+K+FTNRAVLSLQSNPGWGFDKKCQFMDK
Sbjct: 421 SYSGPDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDK 480

BLAST of Cla97C07G137470 vs. NCBI nr
Match: XP_008451827.1 (PREDICTED: translation initiation factor IF-2 [Cucumis melo])

HSP 1 Score: 562.0 bits (1447), Expect = 2.0e-156
Identity = 350/495 (70.71%), Postives = 371/495 (74.95%), Query Frame = 0

Query: 1   MSRSIGRRVPGFSLLSNANKLSFVPFSSXXXXXXXXXXXXXXGFPSHAGPFDFTSPVPGQ 60
           MSRSIGR+VPGFSLLSNANKL  VPFSS XXXXXXXXXXXXX FPS  GPFDFT PVP Q
Sbjct: 1   MSRSIGRKVPGFSLLSNANKLGVVPFSS-XXXXXXXXXXXXXXFPS--GPFDFTPPVPSQ 60

Query: 61  EDSNTSKQDSVGSRPTPGLGHGK--PTPSSPILPSFSSFSPSVNPSSAGRGRVDAPPPIR 120
           E  N SKQ+ + SRPTPGLGHG+  PTPSSPI PSFSSFSPSV PSS GRGR DA P IR
Sbjct: 61  EHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIR 120

Query: 121 SPSGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVGR 180
           SP  P      DSEPKKPVFFS++NAGDSA++T  G LHR  GERNLPD+L SGF GVGR
Sbjct: 121 SPPEP------DSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGR 180

Query: 181 GKPMKQPGQEAQPKQENRHLRPSGVG--------------EPGRGRGGGQRMSXXXXXXX 240
           GKPMKQP  E QPKQENRHLRP                  EP  GRG   R +       
Sbjct: 181 GKPMKQPVPEDQPKQENRHLRPXXXXXXXXXXXXXXXXXVEPRIGRGEPWRNTNRMASRG 240

Query: 241 XXXXXXXXXXXXXXXXXRGKXXXXXXXXXXXXXXXXXXXXGVFRTGERWERGRVQDMEDG 300
                               XXX   XXXXXXXXXXXXXX               D EDG
Sbjct: 241 GPDGEVG-----------XXXXXSGYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDKEDG 300

Query: 301 YAAGLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIEC 360
           YAAGLYLG+N DGE+LAK++G E MNQLVEGFEEMS RVLPSPLE+  LD +  N+MIEC
Sbjct: 301 YAAGLYLGNNEDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIEC 360

Query: 361 EPEYLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLL 420
           EPEYLMGDFESNPDIDENPPI LRDA EKMKPFLMAYENIQSHEEWEEI+EETMQ VPL+
Sbjct: 361 EPEYLMGDFESNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLM 420

Query: 421 KEIVDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKC 480
           KEIVD YSGPDRVTAK+QQGELERVAKTLPQSAPNS+KQFTNRAVLSLQSNPGWGFDKKC
Sbjct: 421 KEIVDAYSGPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKC 475

BLAST of Cla97C07G137470 vs. NCBI nr
Match: XP_022980643.1 (uncharacterized protein LOC111479946 [Cucurbita maxima])

HSP 1 Score: 549.3 bits (1414), Expect = 1.3e-152
Identity = 355/482 (73.65%), Postives = 371/482 (76.97%), Query Frame = 0

Query: 1   MSRSIGRRVPGFSLLSNANKLSFVPFSSXXXXXXXXXXXXXXGF-PSHAGPFDFTSPVPG 60
           MSRSIGR+VPG S LSNANKL FVPFSSXXXXXXXXXXXXXX   P+H GPFDF+S VPG
Sbjct: 1   MSRSIGRKVPGLSFLSNANKLGFVPFSSXXXXXXXXXXXXXXXXSPTHGGPFDFSSRVPG 60

Query: 61  QEDSNTSKQDSVGSRPTPGLG--HGKPTPSSPILPSFSSFSPSVNPSSAGRGRVDAPPPI 120
           QEDSN SK +SV SR T GLG           ILPS SSF+PSV  S       D   PI
Sbjct: 61  QEDSNESKHESVDSRGTSGLGXXXXXXXXXXXILPSLSSFTPSVKSSXXXXXXXDGSQPI 120

Query: 121 RSPSGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVG 180
           RSP    SS GSDSE KKPVFFSKDNA DSA + RPGAL R VGERNLPD+ LS   G G
Sbjct: 121 RSPPESRSSNGSDSERKKPVFFSKDNAADSAGSARPGALDRDVGERNLPDSFLSVLSGAG 180

Query: 181 RGKPMKQPGQEAQPKQENRHLRPSGVGEPGRGRGGGQRMSXXXXXXXXXXXXXXXXXXXX 240
           RGKPMKQP  E+                            XXXX              XX
Sbjct: 181 RGKPMKQPIPESXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXISRDESVRNTGRMMXX 240

Query: 241 XXXXRGKXXXXXXXXXXXXXXXXXXXXGVFRTGERWERGRVQDMEDGYAAGLYLGDNADG 300
           XXXX   XXXXXXXXXXXXXXXXXXXX  FRTGER ERGR QDMEDGYA+GLYLGDNADG
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXFRTGERGERGRGQDMEDGYASGLYLGDNADG 300

Query: 301 EKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIECEPEYLMGDFESNP 360
           EKLAKRIGTEHMNQLVEG EEMS RVLPSPLEE Y++A+  NYMIECEPEYLMGDFESNP
Sbjct: 301 EKLAKRIGTEHMNQLVEGXEEMSGRVLPSPLEEGYVEAMDMNYMIECEPEYLMGDFESNP 360

Query: 361 DIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDYYSGPDRV 420
           DIDENPPIPLRDALEKMKPFLMAYE IQSHEEWEEI+EETMQRVPLLKEIVD YSGPDRV
Sbjct: 361 DIDENPPIPLRDALEKMKPFLMAYEGIQSHEEWEEIVEETMQRVPLLKEIVDSYSGPDRV 420

Query: 421 TAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVREFSQR 480
           TAKQQQGELERVAKTLPQSAPNS+K+FTNRAVLSLQSNPGWGFDKKCQFMDKLVREFSQR
Sbjct: 421 TAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDKLVREFSQR 480

BLAST of Cla97C07G137470 vs. TrEMBL
Match: tr|A0A1S3BT69|A0A1S3BT69_CUCME (translation initiation factor IF-2 OS=Cucumis melo OX=3656 GN=LOC103492997 PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 1.3e-156
Identity = 350/495 (70.71%), Postives = 371/495 (74.95%), Query Frame = 0

Query: 1   MSRSIGRRVPGFSLLSNANKLSFVPFSSXXXXXXXXXXXXXXGFPSHAGPFDFTSPVPGQ 60
           MSRSIGR+VPGFSLLSNANKL  VPFSS XXXXXXXXXXXXX FPS  GPFDFT PVP Q
Sbjct: 1   MSRSIGRKVPGFSLLSNANKLGVVPFSS-XXXXXXXXXXXXXXFPS--GPFDFTPPVPSQ 60

Query: 61  EDSNTSKQDSVGSRPTPGLGHGK--PTPSSPILPSFSSFSPSVNPSSAGRGRVDAPPPIR 120
           E  N SKQ+ + SRPTPGLGHG+  PTPSSPI PSFSSFSPSV PSS GRGR DA P IR
Sbjct: 61  EHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIR 120

Query: 121 SPSGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVGR 180
           SP  P      DSEPKKPVFFS++NAGDSA++T  G LHR  GERNLPD+L SGF GVGR
Sbjct: 121 SPPEP------DSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGR 180

Query: 181 GKPMKQPGQEAQPKQENRHLRPSGVG--------------EPGRGRGGGQRMSXXXXXXX 240
           GKPMKQP  E QPKQENRHLRP                  EP  GRG   R +       
Sbjct: 181 GKPMKQPVPEDQPKQENRHLRPXXXXXXXXXXXXXXXXXVEPRIGRGEPWRNTNRMASRG 240

Query: 241 XXXXXXXXXXXXXXXXXRGKXXXXXXXXXXXXXXXXXXXXGVFRTGERWERGRVQDMEDG 300
                               XXX   XXXXXXXXXXXXXX               D EDG
Sbjct: 241 GPDGEVG-----------XXXXXSGYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDKEDG 300

Query: 301 YAAGLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIEC 360
           YAAGLYLG+N DGE+LAK++G E MNQLVEGFEEMS RVLPSPLE+  LD +  N+MIEC
Sbjct: 301 YAAGLYLGNNEDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIEC 360

Query: 361 EPEYLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLL 420
           EPEYLMGDFESNPDIDENPPI LRDA EKMKPFLMAYENIQSHEEWEEI+EETMQ VPL+
Sbjct: 361 EPEYLMGDFESNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLM 420

Query: 421 KEIVDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKC 480
           KEIVD YSGPDRVTAK+QQGELERVAKTLPQSAPNS+KQFTNRAVLSLQSNPGWGFDKKC
Sbjct: 421 KEIVDAYSGPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKC 475

BLAST of Cla97C07G137470 vs. TrEMBL
Match: tr|A0A0A0KVG1|A0A0A0KVG1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G064010 PE=4 SV=1)

HSP 1 Score: 529.6 bits (1363), Expect = 7.1e-147
Identity = 332/495 (67.07%), Postives = 356/495 (71.92%), Query Frame = 0

Query: 1   MSRSIGRRVPGFSLLSNANKLSFVPFSSXXXXXXXXXXXXXXGFPSHAGPFDFTSPVPGQ 60
           MSRSIGR+V GFSLLSNANKL  VPFSSXXXXXXXXXXXXXX     +GPFDFT PVP Q
Sbjct: 1   MSRSIGRKVTGFSLLSNANKLGVVPFSSXXXXXXXXXXXXXXXXXXPSGPFDFTPPVPNQ 60

Query: 61  EDSNTSKQDSVGSRPTPGLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGRVDAPPPIR 120
           E SN SKQ+ + SRPTPGLGH  GKPTPSSP+ PSFSSFSPSV PSS GRGR DA P IR
Sbjct: 61  EHSNASKQEPIDSRPTPGLGHGRGKPTPSSPLRPSFSSFSPSVRPSSVGRGRGDASPSIR 120

Query: 121 SPSGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVGR 180
           SP  P      DSEPKKPVFFSK+NAGDSA++T  G LHR  GERNLP++L S F GVGR
Sbjct: 121 SPPEP------DSEPKKPVFFSKNNAGDSAASTSLGGLHRVSGERNLPESLHSEFSGVGR 180

Query: 181 GKPMKQPGQEAQPKQENRHLRPSGVG--------------EPGRGRGGGQRMSXXXXXXX 240
           GKPMKQP  E                              EP  GRG   R +       
Sbjct: 181 GKPMKQPVPEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXFEPRIGRGEPWRNTNRMVSKD 240

Query: 241 XXXXXXXXXXXXXXXXXRGKXXXXXXXXXXXXXXXXXXXXGVFRTGERWERGRVQDMEDG 300
                             G       XXXXXXXXXXXXXX                  DG
Sbjct: 241 GPDGEVG-----------GGRGTSGYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDG 300

Query: 301 YAAGLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIEC 360
           YAAGLYLG+N DGE+LAKRIGTE+MN+LVEGFEEMS RVLPSPL ++YLD + TN+MIEC
Sbjct: 301 YAAGLYLGNNEDGERLAKRIGTENMNKLVEGFEEMSGRVLPSPLVDQYLDGMDTNFMIEC 360

Query: 361 EPEYLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLL 420
           EPEYLMGDFE+NPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEI+EETMQ VPLL
Sbjct: 361 EPEYLMGDFENNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLL 420

Query: 421 KEIVDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKC 480
           KEIVD Y GPDRVTAK+QQGELERVAKTLPQSAPNS+KQFTNR VLSLQSNPGWGFDKK 
Sbjct: 421 KEIVDAYGGPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRVVLSLQSNPGWGFDKKW 478

BLAST of Cla97C07G137470 vs. TrEMBL
Match: tr|A0A200QQL5|A0A200QQL5_9MAGN (Uncharacterized protein OS=Macleaya cordata OX=56857 GN=BVC80_8541g14 PE=4 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 1.7e-100
Identity = 231/411 (56.20%), Postives = 275/411 (66.91%), Query Frame = 0

Query: 78  GLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGRVDAPPPIRSPSGPGSSKGSDS---E 137
           GLGH  GKP PS+PILPSFSS+   +NP SAGRGR                   +S   +
Sbjct: 85  GLGHGRGKPIPSNPILPSFSSWVSGMNP-SAGRGRTIQXXXXXXXXXXXXXXXHESQSFQ 144

Query: 138 PKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVGRGKPMKQPGQEAQPK 197
           PKKP+FF ++++ DS   T+     R   E  LP +L SG  G GRGK  K PG E +P+
Sbjct: 145 PKKPIFFRREDSSDSTQRTQFNDSGRNPEESILPSSLSSGLTGAGRGKASKFPGHEERPE 204

Query: 198 QENRHLRPSGVGEPGRGRGGG----QRMSXXXXXXXXXXXXXXXXXXXXXXXXRGKXXXX 257
           +ENRH+RP           GG    QR S                            XXX
Sbjct: 205 EENRHIRPXXXXXXXXXXXGGGAELQRRSPPSSPRLSREDAVKKAVGILSRGGGXXTXXX 264

Query: 258 XXXXXXXXXXXXXXXXGVFRTGERWERGRVQDMEDGYAAGLYLGDNADGEKLAKRIGTEH 317
           XXXXX XXXX XXXXX              +D ED YA GLYLG+NADGE+LA +IG E+
Sbjct: 265 XXXXXVXXXXIXXXXXXXXXXXXXXXXXXXRDSEDDYATGLYLGNNADGERLATKIGAEN 324

Query: 318 MNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIECEPEYLMGDFESNPDIDENPPIPLR 377
           M++LV+GFEEMS RVLPSP+++ YLDA+HTN +IE EPEYLM +F +NPDID+ PP+PLR
Sbjct: 325 MDKLVQGFEEMSSRVLPSPMDDAYLDALHTNNLIEYEPEYLMEEFGTNPDIDDKPPLPLR 384

Query: 378 DALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDYYSGPDRVTAKQQQGELER 437
           DALEKMKPFLMAYE IQS EEWEEIM+ETM++VP +KE++D YSGPDRVTAKQQQ ELER
Sbjct: 385 DALEKMKPFLMAYEGIQSQEEWEEIMKETMEKVPYMKELIDIYSGPDRVTAKQQQQELER 444

Query: 438 VAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVREFSQRYK 480
           VA TLP  AP+S+K+FT+RAVLSLQSNPGWGFDKKCQFMDKLV E SQ+YK
Sbjct: 445 VAATLPDKAPSSVKRFTDRAVLSLQSNPGWGFDKKCQFMDKLVWEVSQQYK 494

BLAST of Cla97C07G137470 vs. TrEMBL
Match: tr|A0A1U8B2H5|A0A1U8B2H5_NELNU (uncharacterized protein LOC104610136 OS=Nelumbo nucifera OX=4432 GN=LOC104610136 PE=4 SV=1)

HSP 1 Score: 369.0 bits (946), Expect = 1.6e-98
Identity = 239/432 (55.32%), Postives = 281/432 (65.05%), Query Frame = 0

Query: 58  PGQEDSNTSKQDSVGSRPTP-GLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGRVDAP 117
           PG+ D+  +          P GLGH  GKP PS+PILPSFSS+   + PS+   GR    
Sbjct: 62  PGKPDTTGADDAEADDSFLPSGLGHGRGKPIPSTPILPSFSSWVSGMRPSAGRGGRSTQQ 121

Query: 118 PPIRSPSGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRP-GALHRGVGERNLPDTLLSGF 177
                PS P      D +PKKP+FFS+++     +   P     R  G   LP +L SG 
Sbjct: 122 QSDSHPSEP-----QDFQPKKPIFFSREDPQGPLTQNPPISEPGRSPGGIVLPSSLSSGL 181

Query: 178 PGVGRGKPMKQP--GQEAQPKQENRHLRPS----GVGEPGRGRGGGQRMSXXXXXXXXXX 237
           PG GRGKP K      E    +ENRHLRP      VG   R      R+S          
Sbjct: 182 PGAGRGKPPKPSLGPSETSVSEENRHLRPRREGVAVGLQDRTSPSPPRLSREDAVKKAVG 241

Query: 238 XXXXXXXXXXXXXXRGKXXXXXXXXXXXXXXXXXXXXGVFRTGERWERGRVQDMEDGYAA 297
                           + XXXXXXXXXXXXXXXXXXX            R +D+ED Y  
Sbjct: 242 ILRRGGDGME------EGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRFRDLEDNYGT 301

Query: 298 GLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIECEPE 357
           GLYLGDNADGE+LA R+GTE+M++LVE FEEMS  VLPSP+++ YLDAVHTN +IE EPE
Sbjct: 302 GLYLGDNADGERLANRLGTENMDKLVEAFEEMSYSVLPSPMDDAYLDAVHTNNLIEYEPE 361

Query: 358 YLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEI 417
           YLMGDFE+NPDIDE PPIPLRDALEK+KPFLMAYE IQS EEWEEIM+ETM+++P +KE+
Sbjct: 362 YLMGDFETNPDIDEKPPIPLRDALEKVKPFLMAYEGIQSQEEWEEIMKETMEKLPYMKEL 421

Query: 418 VDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFM 477
           +D YSGPDRVT KQQQ ELERVAKTLP++ P+S+K FT+RAVLSLQSNPGWGFDKKCQFM
Sbjct: 422 IDIYSGPDRVTGKQQQQELERVAKTLPENVPSSVKCFTDRAVLSLQSNPGWGFDKKCQFM 481

Query: 478 DKLVREFSQRYK 480
           DKLV E SQ YK
Sbjct: 482 DKLVWEVSQHYK 482

BLAST of Cla97C07G137470 vs. TrEMBL
Match: tr|A0A2I4EYR0|A0A2I4EYR0_9ROSI (uncharacterized protein LOC108993926 OS=Juglans regia OX=51240 GN=LOC108993926 PE=4 SV=1)

HSP 1 Score: 361.7 bits (927), Expect = 2.6e-96
Identity = 242/401 (60.35%), Postives = 272/401 (67.83%), Query Frame = 0

Query: 87  SSPILPSFSSFSPSVNPSSAGRGRVDAPPPIRSPSGPGSSKGSDSEPKKPVFFSKDNAGD 146
           SSP LP+FSSF  S+ P  AGRG    PPP             DS PK+P+FF K+   D
Sbjct: 91  SSPTLPNFSSFISSIKPPPAGRGHSVPPPP-------------DSAPKEPIFFKKE---D 150

Query: 147 SASTTRPGALHRGVGERNLPDTLLSGFPGVGRGKPMKQPGQEAQPKQENRHLR------- 206
             +     A  +   +RNLP ++L    G GRGKP++QP  EAQ  +ENRH R       
Sbjct: 151 GPANLLDAA--KSFTDRNLPPSILPVSTGRGRGKPLEQPSLEAQFVEENRHTRXXXXXXX 210

Query: 207 PSGVGEPGRGRGGGQRMS-XXXXXXXXXXXXXXXXXXXXXXXXRGKXXXXXXXXXXXXXX 266
                          RM+               XXXXXXXXXX   XXXXXXXXXXXXXX
Sbjct: 211 XXXXXXXXXXXXXAPRMTREEAVKNAVKILSRGXXXXXXXXXXXXXXXXXXXXXXXXXXX 270

Query: 267 XXXXXXGVFRTGERWERGRVQDMEDGYAAGLYLGDNADGEKLAKRIGTEHMNQLVEGFEE 326
           XXXXXX                 ED Y AGLYLGDNADGEKLAKR+G E+MNQLVEGFEE
Sbjct: 271 XXXXXXXXXXXXXXXXXXXXXSKEDAYGAGLYLGDNADGEKLAKRLGVENMNQLVEGFEE 330

Query: 327 MSVRVLPSPLEEEYLDAVHTNYMIECEPEYLMGDFESNPDIDENPPIPLRDALEKMKPFL 386
           MS  VLPSPLE+  +DA+  NY IECEPEYLMG+F+ NPDID+ PPIPLRDALEKMKPFL
Sbjct: 331 MSGAVLPSPLEDALVDAMDVNYAIECEPEYLMGEFDQNPDIDDKPPIPLRDALEKMKPFL 390

Query: 387 MAYENIQSHEEWEEIMEETMQRVPLLKEIVDYYSGPDRVTAKQQQGELERVAKTLPQSAP 446
           MAYE IQS EEWEEIM+ETM+RVPLLKEIVDYYSGPDRVTAKQQQ ELERVAKT+P SAP
Sbjct: 391 MAYEGIQSQEEWEEIMKETMERVPLLKEIVDYYSGPDRVTAKQQQEELERVAKTVPISAP 450

Query: 447 NSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVREFSQRYK 480
           +S+K+F +RAV+SLQSNPGWGFDKKCQFMDKLV E S  YK
Sbjct: 451 DSVKRFADRAVISLQSNPGWGFDKKCQFMDKLVGEVSHHYK 473

BLAST of Cla97C07G137470 vs. TAIR10
Match: AT1G53645.1 (hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 255.4 bits (651), Expect = 7.1e-68
Identity = 121/194 (62.37%), Postives = 151/194 (77.84%), Query Frame = 0

Query: 286 AAGLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLDAVHTNYMIECE 345
           A  ++ GD+ADGEK A+++G E M  L EGFEE+  + LPS   +  +DA  TN MIECE
Sbjct: 330 AMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECE 389

Query: 346 PEYLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLK 405
           PEY+M DF SNPDIDE PP+ LR+ LEK+KPF++AYE I+  EEWEE + E M + PL+K
Sbjct: 390 PEYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMK 449

Query: 406 EIVDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQ 465
           EIVD+YSGPDRVTAK+Q  EL+R+A TLP SAP+S+K+F +RA L+L+SNPGWGFDKK Q
Sbjct: 450 EIVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQ 509

Query: 466 FMDKLVREFSQRYK 480
           FMDKLV E SQ YK
Sbjct: 510 FMDKLVLEVSQSYK 523

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022136793.12.1e-17178.14uncharacterized protein LOC111008406 [Momordica charantia][more]
XP_022942601.16.5e-16073.27uncharacterized protein LOC111447586 [Cucurbita moschata][more]
XP_023544535.11.8e-15772.45uncharacterized protein LOC111804080 [Cucurbita pepo subsp. pepo][more]
XP_008451827.12.0e-15670.71PREDICTED: translation initiation factor IF-2 [Cucumis melo][more]
XP_022980643.11.3e-15273.65uncharacterized protein LOC111479946 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3BT69|A0A1S3BT69_CUCME1.3e-15670.71translation initiation factor IF-2 OS=Cucumis melo OX=3656 GN=LOC103492997 PE=4 ... [more]
tr|A0A0A0KVG1|A0A0A0KVG1_CUCSA7.1e-14767.07Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G064010 PE=4 SV=1[more]
tr|A0A200QQL5|A0A200QQL5_9MAGN1.7e-10056.20Uncharacterized protein OS=Macleaya cordata OX=56857 GN=BVC80_8541g14 PE=4 SV=1[more]
tr|A0A1U8B2H5|A0A1U8B2H5_NELNU1.6e-9855.32uncharacterized protein LOC104610136 OS=Nelumbo nucifera OX=4432 GN=LOC104610136... [more]
tr|A0A2I4EYR0|A0A2I4EYR0_9ROSI2.6e-9660.35uncharacterized protein LOC108993926 OS=Juglans regia OX=51240 GN=LOC108993926 P... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT1G53645.17.1e-6862.37hydroxyproline-rich glycoprotein family protein[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009640 photomorphogenesis
biological_process GO:0000338 protein deneddylation
biological_process GO:0009220 pyrimidine ribonucleotide biosynthetic process
biological_process GO:0042742 defense response to bacterium
biological_process GO:0008150 biological_process
biological_process GO:0006413 translational initiation
cellular_component GO:0005575 cellular_component
cellular_component GO:0009505 plant-type cell wall
molecular_function GO:0003674 molecular_function
molecular_function GO:0003743 translation initiation factor activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C07G137470.1Cla97C07G137470.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 57..73
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 26..263
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..105

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C07G137470Silver-seed gourdcarwmbB0043
Cla97C07G137470Silver-seed gourdcarwmbB0682
Cla97C07G137470Cucurbita maxima (Rimu)cmawmbB350
Cla97C07G137470Cucurbita moschata (Rifu)cmowmbB335