CmaCh04G019700 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G019700
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionCoffea canephora DH200=94 genomic scaffold, scaffold_16
LocationCma_Chr04 : 11256096 .. 11267154 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGAGAAAGAACAGTTCTATGGCTGCCGCCCCAAACATGTCGCCGAAGCGTCGAAAGAAAATGTCGCTGAAACCGAACGGAAAACGATGACGCCATGGGAGCAGCACTCGGCTGTCATAAGCATCCCTCGGTTCGACTACAATGCACCGTCTGCGCTTCTTCACCATCGCCAGTCGGGATTCCTCATCACATGCGCTATCAGTGAGAATCTCTTCTACAGTTGCTTACAGCCTATTTTATGTTCCTTCAATTTTTCCCCTATTGGATGTTCATATTTGAATGTTTCCTTTCAAGTTAAACTGAGTTAGCATGGAGTTTCTTACTTCAAAATAATAATAAAGAGTGAGAGAGAAATTGGGGTTTAATTATTTCCTTTGTTTTTTCAGAGAGGGAGAAGAGTGCCACAAAAGAAGCTATCTCCATCCTTGAAAAGGTTCTGTTTTTGTACTTATTTGAATTTTCTTCACGTCATGCATTTTAACCTTCATGACTTTCTGAAGTTTAATGGCAAACTAGCTAGTTGAATATTGGGTGTTTGAAGTACTAATTTCGGTGATCTTTATCTGATTCACTATAACCGCCTTGGAAATTACCTAGGAACTTTGTACTTGAATATAGAAGCCATAGAACAATAAACAACTGCTGCAAAATTGATTAAAATTGAATTTAAATATTTTTCTATTGATTGCTTCAGAGCCTTTACAATAGCTATATCAAAACAAAGGTTGAAATACATCTAACTCAACCATGGAAGTAGACTTATAAGTTAAATTAGTTTTTAGAAATCAATAATTACAATTACAATTTAGGATATCTTACTCATATTATGTTATTGTAACTCTAATATACTGAAATAGTAATATTACATTAACTTCTATGAATGAGTTGCATTTGACATCTCACCAAAAGATATCAAAGAAACTGCCGCACTGTCATTTAAAAAAGAGGAGATGTTTTCAGCCACTTCTTGGGTCATTCCCACCCCACTTCTTCAAAAGCCTGAATCTTCTCTTTCTAACTAGACTCACTAAAAGAAATGTCGTCATAATCAACCTTGTCATAAAAGTCAATGGTTGAGCCAACATCTTTCACAGAATTACGGGTTGATGTGGAACAAATCCAACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGTTAGCATTATGTAAGGACCCAATCATTATTCGGTACACTTGGAGGGGCATGTAGGAACATTTGCCTTCAAGCAGTTAGTTAGGCGAGAGAATGTTGGACTGAATGTTGTGTATGATTTGCGTAGAGAATGTTGGACTGAATGTTGTGTATGATTTGCGTAGAGAATGTTGGACTGAACGTTGTGTATGATTTGCGTAGAGAATGTTGGACTGAATGTTGTGTATGATTGCGTAGAGAATGTTGGACTGAATGTTGTGTATGATTTGCGTAGGCATATTACTATGATTGTCTTTATCTTTTTGTTAGCTAAGAAGTTGAAGGGAAAGGGGAGAAAGTTTACTAGCCCTCTCTAATCTGCTGAGTATGATTTTGCATTGCCTTAGGGCACCATATACTTCTACATGTCAATGCATAGTACGGAAATACCTTACGTACTGCAATTGGAATGTGTTCATCTTAGGGAAACTTGGAAAATGGCCCATAAATGCTTAGAGAGCGAATAGATTAGTTGGATTAGGAGATGATGAGCATTAAAGGCAAAGTCTAAAAATGTTTGGAGATCAATGGAAGTCTGTCCACATTGATGAAGATGGTGGATTGCTTGAGTATGCAAGTAGAGATTTAGAAATGGGAATTAGATGATACTCACAATGTTAGTGCTTCCTTCTAGGGGTTCTTTATCCAGATCTTTTGGTGCTTCTATTTTACAAAATATTTAGCTTTTCAGAGTAATGGTTCAAAAGATGATACTCACAATGTTAGTGGCTATTGCCAGAGAGTTGTTCTCATGAGAACATAAGATTTGGCACCAAAGGTTATAGAGGGTTCATCCCCAAAAAAGGAAAGAGTGGGACTACTCAGGAGAGAATTCTGACCAAACTCAAGAGTGAGGAAAAAGGGTGTCACAAGAGAAAATATAAAAGAGCAGGGGAGACTAGAGTAAATTCGATAAGGTGGCATGCCAGTGTTTGGGTGGAGATTTGGCTTTTCAAAAGAAACAAGTACTGCCAAATCCATAAATTAACTGATTATGAGAAATTAATTGTTGTTGTTGTGAGTTTTGAAAGAGTTGTGCTAAACCAATGTCGTGGGAAGGAAGAATGTGAGCCATTAGAAGATTGTAACCATTTGAAATTTTGACTCAGAACAATTTCAAGTTTCAAAGAAAGGAAATGTATGGCGAGGAAGGAAATGTATGCGAGTGATTCCTCGCCATGAAACAAAAAACCTTGGTGGTTGAGTATTGGAGTGCATTTGAGATGTTGGTGGTGTTGTTGCCGCACTTGACCAACGAAGTTTTGGAGGATACTTTCCTTAATGAATTGGACCCTATGGTCAAGGGTTAAAATTTCCTCAATTTTCTTGTGTGGGTCCACTAAGGTGATAGTTTTTAAGGAACCAAAAAGAATTGGGTGAGAACCGGTCTGAAAAGGGGCTAAGCATCACAAAATACTCAAGGTGGTCCAATTTGGGTGACGTGGCTAGTTCAATTGAACTGACCACATCATCATCTTCAATATCACATTTTGAGGGAAACGTTGAGACCGTGGCTTCCTACCTTTGTAAGTCGCCTTCCACAACTTTGCACTGTGGTTATTCCTCCTCGATTCCTTCTACAAACTTGTAGAAGACGTTGCAAGGAAGCGTTTGGGAATGGTTTCGTGGGCTCTCGTGTACTGCAGACGACGATATTCGCTAGAGAATGCCCCCTGGAAGCCAACTTCCTCGCTGGAGCTTTACCCATTTCTCCCTCCTATTTGTTGAGGCACTTCTCGAGCCTATTATCCCTTGAGTCCTTCAGCAACAATCCTCCTCTCCTTCTCCTTAAGCTGATGGAATAGGCCTCAAGCAATGAATCTGTGTGTAGCCTCCCAATTTTAAGGTCAAAACTCATGGATCTGTGGTCAAAAAGCACGTAGAAACATCCCTATCGGGTAGGGCGTTCTAGACCTCCATAGCTCTCAACTCCTTCATGAAAAGGTTCCCTCTCAACACCATGAAGCTAAAGAAATTTGTTTAAGCAACGACATGAAACGTAAAGACTAAATTTTGAATTTTCCACAAATTGACTAAAAGCTTACCTCATAACGTACTTCTCCAGTTACTCCATATCCAGAACTCCTTTGGCCTCGATGTCTTCTACACAAAAGCTCCTTCAACTCCCAAGAACTGGATGAAATGGGTCTTGTTGTACGAATTTGGGGAAAATCATTCCAGCCACTTTCAGCGTTCAAGTTCTAAAGGAAACTTATGAGTAATCTTATTAGGAGGGTGTAAAAAGTGATTTGAACTTAAATGGTGCAGTGTAGCTCCCAATTTTAGAAGAAAACCGAGTGCAAGAAACACATAATAGGAAAGCTTGATTCTCGAATTTGAGGGATTCTCAATGCTTGTTTCTTGTTCCAAATCCTTGCCCAAGGTCTCTTATTTATAGCCCATGCCTCCACCAACTTTGAAGGCTCTAGAATTGGCCTGAATTTGTAGTTGAGTTTCTGGATTTTTTTGGTTGAAAATCCACTCCGTTTAGCTGCTCATTGGCTTCAATTAAATTCTGTAGTTAGATGTTTAGGTGGCAAGGATTCATCAAAATTGTGGCTTATGTGGACAGTTGGATCTTCACTAATTTATGATTTACTCCGACTGTCCAAATCTACAAGCTAATATGTGTGGCAGGGGCGTTTTTGCAAATATCCAAAATTTAGGTACTTTTTTTGTAATTATCCATTTCTAAATATTTATTATTATTTTTTCAAAACTTTTCTAATAATTCTTTACCTTATTTTACACACTTTTTCCTTCTTGGGCCCCTTGAGACTCCTTTTTGACAATTAATTAAAAACTCATGAAGTTTCATGAAAAATTACCTTTTTGTCCTCTTTTTTCCAGTTTCCTTCCTACTTCTTTAGGGCTTCACTAATCACTTGGGCATTAAAATTCCTTCCTTGGGCATTTCAAACCTTCCTTGGGTGTTACAAAAGTGGTAGACGTGGCATAGCTAGTCTTTAAATTATGTCATGGGTTTCTCAGCACTGTGAACGATCAAGATGAAAGGGAAAATTGAACAACAGGAGGTAATTGTCTTGATCGATTGTGGAGCAATGCACAACTTCATCACCCAGAGGTTAATCAATACCTTAAACGTCTGTTAATGGACACAACCCATTATGGGGTGATCATGGGCACATGCATCGCTGTAAAAGGAAAAAAAAAATGTTTGTAAAGTTCTTGTTTTGAGCATCGGGGATTTGACCATGGTGGAAAACATGTTGTCGTCAGAACTAAGACGTGGATGATGTCTTGGGCATGCAGAGGTTACACACTGTCTGTAATTCAAGTTTTAAGTTGAAATGCATGCTCCATCTTTCGTTTGAAATACATAGAAAAAAAATTTAATGTTAAGACTGATGATAAACTCTTTATTAGCAAGAGCGTTGTTTTTTAATTTTTTATTTAGAAGGTTTTTTCTTTTTAATTTTTAAATTTGAATTAATCATCTACTTTTGCTTCTTTTTAGCCAATTTTTTAATTCAACCTTGGATCGCCACATTAGTATTCAGTTATAAGAAAAGTTTTGCTCTCTTTTTTTTTTTTTGCTGCCCAACAATCATACATACTTGACTTTTGTTGTGTTTCTTGCAGTATAGTCAGTACTTCAGTAATTCCACGCCAGAAACTTCGGAGAGTTCTGATGAAAATGAAACTTCTAAAAGGAGGAAAGTTTGTACAGAGGACATTGATCACAAAAGTGTTGAAAATGAGGGAAGGAGTACTGGTGAGAGTGAATATGGTGACCTAAATGTAGCTTCTTGTGGCCAATATACATAATTTACTTTGAGGTTGGAAATTTTGGTTGTTGTCCCTTCCACTGATTTCTTTTATGTTTTGTTGTATTTGACCTTGATGACATCAGATGAACATGTTAATGGAACTTCTACGATCTCTACGAGGAGTGGAGCAAAAGTAGAGAAATGTTCTCCTATTTCACTAGTGAAGCTGACACGGAGTGGTCTGCTTTTACTTACCTTTGCCAAGGATATCTCTCCGGATACTGTTTACATTGTCTCAGACTTGATTCAGTCTCTGGAAGCAGGGACATTGAAGTCACCCGCGTAAGTAGTTTTCTTACTTGTTTGGTTTGAATTTATCATTTTAATTAAGAATTGATTGAAATAAGCTTACTATTTTTTACATGTGTTAGACAAAAACTTTTTTGCTTGCATTTGCTTCTGAGAAATTTATATAGTCGTTAGGGCTTTTATTTGGTAGTTCATTCTTTTCTAGGCAAATAACTTCGGAAACATCTTGTTCTAATAGGTATATCGTTATTATCACTATTATTTGTTGTTTGTTAATTTACTTTCTGGAAAGCAATTTAACACTATGTTTTAGAGTTGTTGAGAAACTTGGATAAAATTAGATAAATGAAAATTTTAGAAATTGTTATTTTCTTCCTTGAAAGATAACGCCTGGAAAAATTGAAATGAGCACCTAAACTTAGAAAAGCATTTTTCTTTTGAAGTAAATGAAAGAAAGTTCTAAAAAGATGTTAAAACAACCTAGGTAGGTTCAAATGTGAGCAAACCAAAAGAAAACTAGAATCCAATCACCTCCAGAAGTTTTCTCCAAAAGCTATACTTGCATTTTGCTCATAGCGGAGATCAGTGCTCCAATTCAATGTTTTGACATCTCTTTTTTATCTAAATAATTTTTATCTCTAAAACCTTGAACCACTTCCATCTAATTTTATTTCTTTCAAGCTTGTCCATCAAATACTAAAGGCCAACGTTATACATGTATATTTTTTGTAAGGCTAAAGTGCAACGTATTTAGACTGAAAAAAATTATTTTATAGGGTATTTATAGCATAAGTGGTTTTGGATAAGATTAAGTATTGGTTCACTTTTGGCACTTAAACTTGTTAATAGAAACATTGTGAATTGAACATTGATTAAATCTTGCATTCTGATAAGACATGCGATTGGTTATATAATCATTTTTCTAAGAGTAAGAAGCCATCCATGATCCATTGATTGAACGGGTATGAAATTGGGGAAATGGACAAGAAATGTCCCAACTTGATGTCAGATGTTTAGTTTATCAACAGAGGATTGTCTTGAAGCTTGATATTGGGAAAGTTGAACGGTTGATTGATATTTCCTTGATATGGTGCTTCGAGAAAGGTCTTGTTATTAAGTGCTGGAAGATGATTTTTAGGTCTATCATCTTAAATTGAAAGTGTTGCTAAAACATTGTTTACCGGGCATATAGTTTGAAGACAAGAGGACCCTCCTTTCCCCTCTTCACATTTTGACTGTGGCAAACCATTGTCAAAGGATATGGATTCTAGATTGTGGAAGATTCCTTAAAAGTTAGAGATTACCTCCAAATTGTTGATGTGTTGTTAGGTGCAGTTGCTCCATTCTGGCTATGTCAACCAAAAGAAATCCTTTTTCGTTTGTAAGCGTAGTGCATCATTTCCTTCTTAAGAGGATGGTGGATCAAAATCATCTCTTGGACTAGGAGGGAGATCATGTTAGTACGGAGAGACCAATTAAAGGCCTTAAGAATGCCGCATCAAGATTCATTTGCAGCTTTTAAAAAAAAGTTAATACAAGTGTTTACAAAAAAGTCCACACAAAAAGGGAGCCAAGCAACAAATAAGAAAAAATGGTTCAAATCTAGTAAAATAAGATGCGACAGAGTAATTACAAAAGGATTTAGTCATCAAAATCCAAAAAGAAACATATAATCTAAAGAGGGACCAAACATTATTGAAAGATATCGCCTGCCCCTAAAGATTCTATTATTTATTTTCAAATTTCCATGAAATAGAACGAACCCAATATGCCACAAAAACGCCCTTTCTCCCATAAAGGAACGTATAGAAGAAGCTATGTGAATGCCTCCCTACATCTCTGCAAGTTGGCAAAGAAAAAGCCTAATGTCTCAAGAAACACCTTCTCATATCTCTGGAAAAAACATAACTCCATAAGAAATGATCAAGATCCTCTGCTATCCTCCTACAAAGAATACAACAAAGCTGCCCAACTAAAGAGGACTCCCAAGTCGAAATTTGATCCAAGGTGTTTGCACTGTGGTGCATAAATTGCCACTCGAAAAATTGTAGAAGGGAAAAAATAGAGCCAGACTGAGAGGAACTAGCCAAACAACGAAAGAAAGAGCCCCCAGAAAAGCCTTTGGAGTGGCAAAAAGTTCAGAGGTTGAAGGCACAGTGAAGGAACCTGTTGGTTCTAAAGGCCGATGCTAGATGTCTTGTTCAGCAGAGGCAGAAAAGACTAGATAATTAGTTTGATAATGATGCCCATTCCTGAACTTTTATTTTATTAAACAACTTCTGGAAATCAAGTCCCAATATTCATATGGATACTATATCTATGCTACAACTCCTAGCTTTCAGTTTGAGAGGTAGTAACGTTTTTTGGAATTTTGAGGAAAAATATCACCCAACCATCTATCCTGCCAAACCAAGGCAAGCAATGCCACATCAACATTCATTGAGCTCTTGATCTTATGAGGAGGTTAGCTAGCTTTCTGATGATATAGAACCATGGATCCTGTTGAAGCATTTTTTTATAAGCAGCTGGCCAAGTATTTGTAAGATGACGGGAGGCAAAAGTTTTAACTGGGTATTGGAAGTGTCTCTTTTCTCTCTCTAGCTCTTTTGTATTTCAATTCTCCATTGAATTCATTGCATTAGATTGATTGATTGATTTTATTTTTGATGCAACACTTGACTTTCAGTGAGAAAAGATGAAACGATACATTACATACAAAAAGAAAAGCCCTCAAAATGGAGCCAAAATACAAGAAAGGGCTTCACTGGAGAGAAATAAGCCCTAACGAACAATTACTAGCTCAAAAAGAGAGAACATCGAATAAGGGACCAAATATTGCTATAAGAACTCTCACGCTCTCTGAAGATTCTATTGTTTCTCTTGCTCCCACAATCCCCAAAAATAACATACACCTCGTCTACCACAAAAATAGTCCTTCACCACAAAAGGGTGGATGGAAGATAAGTTCCTCAACTAACAGCCTACAAATCTGAGAGTTGACAAAGCTAAAATCAATCACCTCAAAGAAATTGAGCCAGACTGCACGAGCAAATTCAACCTCACATGATATGATCTGGTCATGTGTTGCCCTCCTGTAGGGAATACAACAATACGTCCCAACTCCAAAGACTCTCTAACCGAAATTCGATCCAAGATGTTAACTCTTCTACGCAAAATTGCAAAGAACTCCACCTTCTTGGAATTCTTTACCTTCCACATCGAGGAGAAAACAGAATCACTCACCGAGAACGAGTTAGACAAGCACCAAAAGAAAGACCTACATGAAAATCTGTTGGGCAGACAAGAAATCCAAAGGCGGACATCCCCTTTTGAGGCCTAAAGTGAAAATAAGAGAGCACGGATAAAAGAGCCAAGACATTCGATGTTTTCCTATCGGTCAATGGCCAATGGAGACATAGTGAAGCAGAGGGTATGAGGAACAGAGGGTCTATCGCCCAACAATTTTTCCTTTCAAAAATAAGTATCCCACCCATCCCCCACATTATGAATAAATTGTGAAAGAAGAGGAAAAACTATCTCAATCACTGTCCATGGGTTGTTGGAAGTGTCTTTTAACCCCTCACCGGTCCACACAAAGGGATGGGGCCCGTACTTACTGACATTAACCTTTGTGTCATAGGGTATCGGCTTCATGATATCTCCAAGTCCAAGGACCGCCTCCCAGCTCACCAGATGTGAACTCTTCCTCTTCAACCCTCTTGATAAGAATTATCTCATAAGTTTCTCGTTATTTTGATGTCCATGTTTTGTATCATCTAAAGTTTTTCTTCATTTGTAAGAGGAATACCCTAAATTGAATGATTTTTCCAACTTCTTTTTCATAGTTGAGTGTGTTATATCACAATTGTATATACAGCAATTAGGTAACCGGGTATTTTCTTTATGCAGTTGGTGTCATCGCATATTCCCCATTCAATCTACCTGTTGTTTGAATGAAAATGATCTCCGGGGAGTTGTGTCAAACCTCGTTCTTCAGTTCATGAAAGATAAAGGAAACAGTCTTTCACACCCTGTAAAGGTAAAGCTTGGGTTATTTTCATTAATGAGTGAAGGTTATTTTCCTGTTTTTTTTCCTATCCTTTTCAGTAGCGATTAAATCTTGGGTCATTCATGAGAGAGAGATGGCAAGCTTCAGGAGCCTTGTAGTTTTGACATTTATTTCATGTAGTTTGCAGTAGGGTACAACAGAAGAGGAATTGAAGAGACTGAGATGAAGACTTGCAAGGATAGTTCTGGTGCTATTGTTATGGGTCGCGATAAATGCTTTAGCGTCGTGGCTGCTGCTGTTAAAGATGTTGTCTCGAACACCATTGTGGATCTGAAATCTCCAGAGGTGAATGGATTACACTATAATGGCGCCTTGGTACATACATAGATATTCCATGCTATTTCATCTTATAATTTTTGTTTTGTGCTCATTGAATATCCTACAGCTCTGCATCCTTATTGAGCTGCTTCCCCTTTCTGGGTTGCCTCTTGAATCATTGGTAGTTGGGGTGTCGGTTCTTCCAAGCAATCTTGTTACTACGAAGCCTCGACTTTGTATCAAAGCTTTGACTTCAGATCCTAAGGCAAAGAGTTGAAGGCATAAATCAAAGTTTTCAACGGGAAAAATACTAGACTGTTAAATCTCACGAATGGATTAGGTTGGAAAACTTCTATCAGTTATGGAACAAATGCTAGACTGCAGATTTAGAAAAGAGGAAGATTTGTCTCAATCAGCTCAATGCAGTCAGTAGTATTAACTTTGATTGAGATTATCCTTATTAGATTTGCCTCAATGTAAAGAGCACTTGAAGAACTGTGGTTTTCAAGTTCCACAATGCTGGTTGCTAGGATTAATGTTCTACATCTGACAAAATGTTGCTCCAATTCTGCACATATTTATCATCATGAACTTTGGATTACAGTAGTTTTCTAATTGATGAATAAATATTGTGTAACTTTATGTAGAATTTACTACTCGTCATTCCATTGAACACAATCAACCAGTAGAAAGTGGAAGCTCCACTGATTTTCAGCTACTATTGCCCCCCTGCACACCATATTTTTGTATTGGAGTATACGTCTCGATGAATTGTGTTTAAATCCCTTTAATTATAGAGGGTTAGAACACTTATAAATCATAATGTAATTTTTTGGGCAAGATTTCTTCTCCTTTTATTTTGTAGGTTACACTAACACTGTGTGAGAGAAACCTCCTGATGGCTAGGAAGAAGATGGC

mRNA sequence

ATGGCAGAGAAAGAACAGTTCTATGGCTGCCGCCCCAAACATGTCGCCGAAGCGTCGAAAGAAAATGTCGCTGAAACCGAACGGAAAACGATGACGCCATGGGAGCAGCACTCGGCTGTCATAAGCATCCCTCGGTTCGACTACAATGCACCGTCTGCGCTTCTTCACCATCGCCAGTCGGGATTCCTCATCACATGCGCTATCAAGAGGGAGAAGAGTGCCACAAAAGAAGCTATCTCCATCCTTGAAAAGTATAGTCAGTACTTCAGTAATTCCACGCCAGAAACTTCGGAGAGTTCTGATGAAAATGAAACTTCTAAAAGGAGGAAAGTTTGTACAGAGGACATTGATCACAAAAGTGTTGAAAATGAGGGAAGGAGTACTGATGAACATGTTAATGGAACTTCTACGATCTCTACGAGGAGTGGAGCAAAAGTAGAGAAATGTTCTCCTATTTCACTAGTGAAGCTGACACGGAGTGGTCTGCTTTTACTTACCTTTGCCAAGGATATCTCTCCGGATACTGTTTACATTGTCTCAGACTTGATTCAGTCTCTGGAAGCAGGGACATTGAAGTCACCCGCTTGGTGTCATCGCATATTCCCCATTCAATCTACCTGTTGTTTGAATGAAAATGATCTCCGGGGAGTTGTGTCAAACCTCGTTCTTCAGTTCATGAAAGATAAAGGAAACAGTCTTTCACACCCTGTAAAGTTTGCAGTAGGGTACAACAGAAGAGGAATTGAAGAGACTGAGATGAAGACTTGCAAGGATAGTTCTGGTGCTATTGTTATGGGTCGCGATAAATGCTTTAGCGTCGTGGCTGCTGCTGTTAAAGATGTTGTCTCGAACACCATTGTGGATCTGAAATCTCCAGAGCTCTGCATCCTTATTGAGCTGCTTCCCCTTTCTGGGTTGCCTCTTGAATCATTGGTAGTTGGGGTGTCGGTTCTTCCAAGCAATCTTGTTACTACGAAGCCTCGACTTTGTATCAAAGCTTTGACTTCAGATCCTAAGGCAAAGAGTTGAAGGCATAAATCAAAGTTTTCAACGGGAAAAATACTAGACTGTTAAATCTCACGAATGGATTAGGTTGGAAAACTTCTATCAGTTATGGAACAAATGCTAGACTGCAGATTTAGAAAAGAGGAAGATTTGTCTCAATCAGCTCAATGCAGTCAGTAGTATTAACTTTGATTGAGATTATCCTTATTAGATTTGCCTCAATGTAAAGAGCACTTGAAGAACTGTGGTTTTCAAGTTCCACAATGCTGGTTGCTAGGATTAATGTTCTACATCTGACAAAATGTTGCTCCAATTCTGCACATATTTATCATCATGAACTTTGGATTACAGTAGTTTTCTAATTGATGAATAAATATTGTGTAACTTTATGTAGAATTTACTACTCGTCATTCCATTGAACACAATCAACCAGTAGAAAGTGGAAGCTCCACTGATTTTCAGCTACTATTGCCCCCCTGCACACCATATTTTTGTATTGGAGTATACGTCTCGATGAATTGTGTTTAAATCCCTTTAATTATAGAGGGTTAGAACACTTATAAATCATAATGTAATTTTTTGGGCAAGATTTCTTCTCCTTTTATTTTGTAGGTTACACTAACACTGTGTGAGAGAAACCTCCTGATGGCTAGGAAGAAGATGGC

Coding sequence (CDS)

ATGGCAGAGAAAGAACAGTTCTATGGCTGCCGCCCCAAACATGTCGCCGAAGCGTCGAAAGAAAATGTCGCTGAAACCGAACGGAAAACGATGACGCCATGGGAGCAGCACTCGGCTGTCATAAGCATCCCTCGGTTCGACTACAATGCACCGTCTGCGCTTCTTCACCATCGCCAGTCGGGATTCCTCATCACATGCGCTATCAAGAGGGAGAAGAGTGCCACAAAAGAAGCTATCTCCATCCTTGAAAAGTATAGTCAGTACTTCAGTAATTCCACGCCAGAAACTTCGGAGAGTTCTGATGAAAATGAAACTTCTAAAAGGAGGAAAGTTTGTACAGAGGACATTGATCACAAAAGTGTTGAAAATGAGGGAAGGAGTACTGATGAACATGTTAATGGAACTTCTACGATCTCTACGAGGAGTGGAGCAAAAGTAGAGAAATGTTCTCCTATTTCACTAGTGAAGCTGACACGGAGTGGTCTGCTTTTACTTACCTTTGCCAAGGATATCTCTCCGGATACTGTTTACATTGTCTCAGACTTGATTCAGTCTCTGGAAGCAGGGACATTGAAGTCACCCGCTTGGTGTCATCGCATATTCCCCATTCAATCTACCTGTTGTTTGAATGAAAATGATCTCCGGGGAGTTGTGTCAAACCTCGTTCTTCAGTTCATGAAAGATAAAGGAAACAGTCTTTCACACCCTGTAAAGTTTGCAGTAGGGTACAACAGAAGAGGAATTGAAGAGACTGAGATGAAGACTTGCAAGGATAGTTCTGGTGCTATTGTTATGGGTCGCGATAAATGCTTTAGCGTCGTGGCTGCTGCTGTTAAAGATGTTGTCTCGAACACCATTGTGGATCTGAAATCTCCAGAGCTCTGCATCCTTATTGAGCTGCTTCCCCTTTCTGGGTTGCCTCTTGAATCATTGGTAGTTGGGGTGTCGGTTCTTCCAAGCAATCTTGTTACTACGAAGCCTCGACTTTGTATCAAAGCTTTGACTTCAGATCCTAAGGCAAAGAGTTGA

Protein sequence

MAEKEQFYGCRPKHVAEASKENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHHRQSGFLITCAIKREKSATKEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDIDHKSVENEGRSTDEHVNGTSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSLEAGTLKSPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRRGIEETEMKTCKDSSGAIVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLPLSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDPKAKS
BLAST of CmaCh04G019700 vs. TrEMBL
Match: A0A0A0KQX3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G273440 PE=4 SV=1)

HSP 1 Score: 490.0 bits (1260), Expect = 2.4e-135
Identity = 272/347 (78.39%), Postives = 296/347 (85.30%), Query Frame = 1

Query: 1   MAE-KEQFYGCRPKHVAEASK--ENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHH 60
           MAE +EQ  GC  K  AEA K  ENVAETERK MTPWEQHSAVISIPRFDYNAPSALLH 
Sbjct: 1   MAETEEQNNGCHLKPEAEAFKRAENVAETERKMMTPWEQHSAVISIPRFDYNAPSALLHR 60

Query: 61  RQSGFLITCAIKREKSATKEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDID 120
            Q+GFLITC IKREKSATKEAISIL+KY QYF++S  ET   SDENETSKRRKV +ED+D
Sbjct: 61  CQTGFLITCTIKREKSATKEAISILQKYVQYFNSSMSETLVVSDENETSKRRKV-SEDVD 120

Query: 121 HKSVENEGRSTDEHVNGTSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVY 180
           H+SV  E  STDEH   TS IST+S AKVEKCSPISLVKLTRSGLLL TF KDISPDTVY
Sbjct: 121 HRSVGGES-STDEHAKETSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFTKDISPDTVY 180

Query: 181 IVSDLIQSLEAGTLKSPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPV 240
           IV D++QSLEA TLKS AWCHRIFPIQ+TC LNENDL+GVVS LVL FM DKGN LSHPV
Sbjct: 181 IVKDIMQSLEARTLKSLAWCHRIFPIQATCSLNENDLQGVVSKLVLHFMNDKGNILSHPV 240

Query: 241 KFAVGYNRRGIEETEM-KTCKDSSGA-IVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELC 300
           KFA+GYNRRGIEETEM KT +DSSG  +++GRDKCFS+VAAAVK VVS+ IVDLKSPELC
Sbjct: 241 KFAIGYNRRGIEETEMKKTFEDSSGVNVILGRDKCFSIVAAAVKGVVSDAIVDLKSPELC 300

Query: 301 ILIELLPLSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDPKAKS 343
           +L+ELLP+SGLP  S VVGVSVL +NLVTTKPRLCIKALTSD KAKS
Sbjct: 301 VLVELLPVSGLPSGSSVVGVSVLSNNLVTTKPRLCIKALTSDTKAKS 345

BLAST of CmaCh04G019700 vs. TrEMBL
Match: D7SMG7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0137g00060 PE=4 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 3.0e-93
Identity = 196/331 (59.21%), Postives = 242/331 (73.11%), Query Frame = 1

Query: 23  VAETERKT-MTPWEQHSAVISIPRFDYNAPSALLHHRQSGFLITCAIKREKSATKEAISI 82
           ++E ER+  M PWEQHSAVISIPRFDYNAPS+LL H  SGFL+TC IKREKSATKEA+ I
Sbjct: 1   MSEEEREEGMKPWEQHSAVISIPRFDYNAPSSLLDHSHSGFLVTCTIKREKSATKEAMPI 60

Query: 83  LEKYSQYFSNSTPETSESSDENETSKRRKVCTEDIDHK---SVENE------GRSTDEHV 142
           LEKY   FS+ + E+ ESSD N T+KRRK+CTE+ID +   SVEN+      G    E  
Sbjct: 61  LEKYVGSFSSCSSESLESSDANATTKRRKICTEEIDEECVNSVENKTASNNCGEDGGELS 120

Query: 143 NGTSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSLEAGTLK 202
                 S    A VE    +SLVKLTRSGLLL  F ++ S DTV +VS +I+SL++G++K
Sbjct: 121 KDAGVSSANRDAIVENGHVLSLVKLTRSGLLLFVFPRNNSVDTVDVVSQIIRSLQSGSVK 180

Query: 203 SPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRRGIEETE 262
            P WCHRIFPIQ+TC L+E +L  VV+ LV+QF+ ++ N  + P+KFAVGYNRRGIEETE
Sbjct: 181 PPLWCHRIFPIQATCRLDEKELHEVVTKLVVQFVNNEQNKFARPIKFAVGYNRRGIEETE 240

Query: 263 MK----TCKDSSGAIVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLPLSGLPL 322
           MK    T +D +   ++ R KCFSVVA AVK  VS+++VDLKSPEL +L+ELLPLS +P 
Sbjct: 241 MKIPKSTPRDCNSHALLDRKKCFSVVATAVKGAVSDSVVDLKSPELSVLVELLPLSRVPN 300

Query: 323 ESLVVGVSVLPSNLVTTKPRLCIKALTSDPK 340
            S+VV VSVLP NL+TTKPRLCIKAL SD K
Sbjct: 301 GSMVVAVSVLPQNLITTKPRLCIKALLSDTK 331

BLAST of CmaCh04G019700 vs. TrEMBL
Match: A0A061E9D8_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_011165 PE=4 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 1.1e-90
Identity = 195/340 (57.35%), Postives = 239/340 (70.29%), Query Frame = 1

Query: 13  KHVAEASKENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHHRQSGFLITCAIKREK 72
           K V   SKE   E  + TMTPWEQH+++ISIPRFDY APS+LL    S FLITC IKREK
Sbjct: 8   KPVKMESKEEEGEESKTTMTPWEQHASIISIPRFDYKAPSSLLQRSHSAFLITCTIKREK 67

Query: 73  SATKEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDIDHKSVENEGRSTD--E 132
           SATKEA+SI  KY   F +        SD N  +KRRK+CT++ID +++ N   S++  E
Sbjct: 68  SATKEAMSIFSKYVGSFKSEI----SCSDANADAKRRKICTDEID-QNIANSVDSSEITE 127

Query: 133 HVNGTSTISTRSGAKVEKCSP----ISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSL 192
              G       S AK +K       +SLVKLTRSGLLLLTF  +   DT+ IVSD+ Q L
Sbjct: 128 AAGGIQNDDHFSSAKTDKSGAPDFVLSLVKLTRSGLLLLTFPGENPLDTIDIVSDIFQDL 187

Query: 193 EAGTLKSPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRR 252
           E+G+LKSP WCHRIFPIQ+TC LNE  L+ VVS LVL F+ DK N L+ P+KFAVG+NRR
Sbjct: 188 ESGSLKSPLWCHRIFPIQATCSLNEKGLQAVVSKLVLHFVNDKRNKLARPIKFAVGFNRR 247

Query: 253 GIEETEMKTCKD----SSGAIVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLP 312
           G EE+++K  KD    S  ++++ R KCFSVVAAAVK +VS+++VDLKSPEL ILIELLP
Sbjct: 248 GTEESQVKIPKDVSKNSDLSVLLDRGKCFSVVAAAVKGIVSDSVVDLKSPELSILIELLP 307

Query: 313 LSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDPKAKS 343
           LSG+P  SLVVGVSVLP NLV+TKPRLCIK L  D   ++
Sbjct: 308 LSGVPNGSLVVGVSVLPQNLVSTKPRLCIKPLVCDKSGRN 342

BLAST of CmaCh04G019700 vs. TrEMBL
Match: A0A151TQ39_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_008359 PE=4 SV=1)

HSP 1 Score: 340.5 bits (872), Expect = 2.4e-90
Identity = 191/337 (56.68%), Postives = 250/337 (74.18%), Query Frame = 1

Query: 15  VAEASKENVAETERK-----TMTPWEQHSAVISIPRFDYNAPSALLHHRQSGFLITCAIK 74
           +AEA+ E   E E+      TM+PWEQHSAVI++PRFDYNAPS+LL +  S FLITC IK
Sbjct: 1   MAEAADEARKEEEKSDGVGMTMSPWEQHSAVINLPRFDYNAPSSLLRNSHSAFLITCTIK 60

Query: 75  REKSATKEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDIDHKSVEN-EGRST 134
           REKSATKEAI+IL K+ ++ S++ P+      E+ +SKRR++CT+D D +  E  E  S 
Sbjct: 61  REKSATKEAITILHKFLRHDSSNNPK------EDTSSKRRRICTQDTDRECQETKETDSA 120

Query: 135 DEHVNGTSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSLEA 194
            E    +S +  + G  V   + +SLVKLTR+GLLLLTF  ++SPDTV IVS++IQ++E+
Sbjct: 121 SEDGKLSSPVKDKDGVAV---AALSLVKLTRNGLLLLTFPSNMSPDTVNIVSNIIQAVES 180

Query: 195 GTLKSPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRRGI 254
           GT+  PAWCHRIFPIQ+TC LNE +L+ VVS LV +F+ DK N L  P+KFAVG+NRRGI
Sbjct: 181 GTVSLPAWCHRIFPIQATCSLNEKELQEVVSMLVKKFLADKQNKLERPLKFAVGFNRRGI 240

Query: 255 EETEM--KTCKDSSGA-IVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLPLSG 314
           EET    +   DSS A +++ R+KCF VVA+AV  VV +++VDL+SPEL +L+ELLPLSG
Sbjct: 241 EETTFAKEKSNDSSNAFLLLDRNKCFGVVASAVNHVVEDSVVDLRSPELSVLVELLPLSG 300

Query: 315 LPLESLVVGVSVLPSNLVTTKPRLCIKALTSDPKAKS 343
           +P  S++VGVSVLP NLV+TKPRLCIKALTS+ K  S
Sbjct: 301 VPNRSIIVGVSVLPRNLVSTKPRLCIKALTSNTKEGS 328

BLAST of CmaCh04G019700 vs. TrEMBL
Match: B9S7B4_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0775790 PE=4 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 4.0e-90
Identity = 190/331 (57.40%), Postives = 235/331 (71.00%), Query Frame = 1

Query: 20  KENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHHRQSGFLITCAIKREKSATKEAI 79
           +E V E E+++M PWEQH+A+ISIPRFDYNAPSALLH+  SGFLITC+IKREKSATKE +
Sbjct: 18  EEAVKEKEKESMKPWEQHAAIISIPRFDYNAPSALLHNSHSGFLITCSIKREKSATKEVM 77

Query: 80  SILEKYSQYFSNSTPETSESSDENETSKRRKV-----CTEDIDHKSVENEGRSTDEHVNG 139
           SILEKY   +      T +SS+ ++  KRRK      C + ++ K V  +     E    
Sbjct: 78  SILEKYIGSY------TKDSSNGSQGIKRRKTLMGGTCAQGMESKDVSEDPDQVSEE--- 137

Query: 140 TSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSLEAGTLKSP 199
           T T+        E    +SLVKLTRSGLLLL F  + SPD   IVS++ Q +E+G+LKSP
Sbjct: 138 THTVE-------ETGFTLSLVKLTRSGLLLLNFVGENSPDATEIVSNIFQRIESGSLKSP 197

Query: 200 AWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRRGIEETEMK 259
            WCHRIFPIQ+TCCL+E +LR VVS LVL+F+ DK N    P+K+AVGYNRRGIEET+ K
Sbjct: 198 LWCHRIFPIQATCCLDEKELRTVVSKLVLRFINDKANKFERPIKYAVGYNRRGIEETQAK 257

Query: 260 ----TCKDSSGAIVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLPLSGLPLES 319
               T KDS+   ++ R+KCF VVA+AVKDV+S++ VDLKSPEL IL+ELLPLSG+P  S
Sbjct: 258 NVKDTSKDSALCSLLDRNKCFDVVASAVKDVISDSAVDLKSPELSILVELLPLSGVPNGS 317

Query: 320 LVVGVSVLPSNLVTTKPRLCIKALTSDPKAK 342
           LV  VSVLP NLV+ KPRLCIK L SD  AK
Sbjct: 318 LVAAVSVLPQNLVSVKPRLCIKPLVSDANAK 332

BLAST of CmaCh04G019700 vs. TAIR10
Match: AT1G09290.1 (AT1G09290.1 unknown protein)

HSP 1 Score: 310.5 bits (794), Expect = 1.3e-84
Identity = 176/322 (54.66%), Postives = 218/322 (67.70%), Query Frame = 1

Query: 27  ERKTMTPWEQHSAVISIPRFDYNAPSALLHHRQSGFLITCAIKREKSATKEAISILEKYS 86
           E +T+TPWEQHS++ISIPRFDY APS+LLHH  SGFL+TC IKREKSATKE +SIL KY 
Sbjct: 28  EAETLTPWEQHSSIISIPRFDYKAPSSLLHHSHSGFLVTCNIKREKSATKEVMSILGKYI 87

Query: 87  QYFSNSTPETSESSDENETSKRRKVC---TEDIDHKSVENEGRSTDEHVNGTSTISTRSG 146
                  PE   S+     SK++KVC   TE+   K+V  E  +  E       +     
Sbjct: 88  GSMHEEKPEVLNST----ASKKQKVCAQETEEGGEKTVPLENDALQE-TGENPNVEDLKL 147

Query: 147 AKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSLEAGTLKSPAWCHRIFPI 206
           A  E  S +SLVKLT+SGLLL TF  + SP+T  IVS + QS+E+G LK+P WCHRIFP+
Sbjct: 148 ANEEHNSLMSLVKLTKSGLLLFTFPVENSPNTTNIVSRVFQSMESGALKAPIWCHRIFPV 207

Query: 207 QSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRRGIEETEMKTCKDSSGAI 266
           Q+TC L E +LR  VS LV +F+ DK N+LS PVKFA GY RRG EET+ K  KD+S  +
Sbjct: 208 QATCGLTEKELRETVSKLVQRFVNDKDNTLSKPVKFAAGYQRRGAEETKGKIRKDASDVL 267

Query: 267 V----MGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLPLSGLPLESLVVGVSVLP 326
           V    + R KCF  VAA VKD+V +++VDLKSPELC+L+ELLPLS +   S V  VSVLP
Sbjct: 268 VQCPLLDRIKCFETVAAGVKDIVPDSVVDLKSPELCVLVELLPLSRISSGSFVAAVSVLP 327

Query: 327 SNLVTTKPRLCIKALTSDPKAK 342
             LV+TKP+LCIK L  + K K
Sbjct: 328 HRLVSTKPKLCIKPLVPESKHK 344

BLAST of CmaCh04G019700 vs. NCBI nr
Match: gi|778701843|ref|XP_011655096.1| (PREDICTED: uncharacterized protein LOC101219243 isoform X1 [Cucumis sativus])

HSP 1 Score: 490.0 bits (1260), Expect = 3.5e-135
Identity = 272/347 (78.39%), Postives = 296/347 (85.30%), Query Frame = 1

Query: 1   MAE-KEQFYGCRPKHVAEASK--ENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHH 60
           MAE +EQ  GC  K  AEA K  ENVAETERK MTPWEQHSAVISIPRFDYNAPSALLH 
Sbjct: 1   MAETEEQNNGCHLKPEAEAFKRAENVAETERKMMTPWEQHSAVISIPRFDYNAPSALLHR 60

Query: 61  RQSGFLITCAIKREKSATKEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDID 120
            Q+GFLITC IKREKSATKEAISIL+KY QYF++S  ET   SDENETSKRRKV +ED+D
Sbjct: 61  CQTGFLITCTIKREKSATKEAISILQKYVQYFNSSMSETLVVSDENETSKRRKV-SEDVD 120

Query: 121 HKSVENEGRSTDEHVNGTSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVY 180
           H+SV  E  STDEH   TS IST+S AKVEKCSPISLVKLTRSGLLL TF KDISPDTVY
Sbjct: 121 HRSVGGES-STDEHAKETSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFTKDISPDTVY 180

Query: 181 IVSDLIQSLEAGTLKSPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPV 240
           IV D++QSLEA TLKS AWCHRIFPIQ+TC LNENDL+GVVS LVL FM DKGN LSHPV
Sbjct: 181 IVKDIMQSLEARTLKSLAWCHRIFPIQATCSLNENDLQGVVSKLVLHFMNDKGNILSHPV 240

Query: 241 KFAVGYNRRGIEETEM-KTCKDSSGA-IVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELC 300
           KFA+GYNRRGIEETEM KT +DSSG  +++GRDKCFS+VAAAVK VVS+ IVDLKSPELC
Sbjct: 241 KFAIGYNRRGIEETEMKKTFEDSSGVNVILGRDKCFSIVAAAVKGVVSDAIVDLKSPELC 300

Query: 301 ILIELLPLSGLPLESLVVGVSVLPSNLVTTKPRLCIKALTSDPKAKS 343
           +L+ELLP+SGLP  S VVGVSVL +NLVTTKPRLCIKALTSD KAKS
Sbjct: 301 VLVELLPVSGLPSGSSVVGVSVLSNNLVTTKPRLCIKALTSDTKAKS 345

BLAST of CmaCh04G019700 vs. NCBI nr
Match: gi|659120473|ref|XP_008460211.1| (PREDICTED: uncharacterized protein LOC103499095 [Cucumis melo])

HSP 1 Score: 474.9 bits (1221), Expect = 1.2e-130
Identity = 265/333 (79.58%), Postives = 284/333 (85.29%), Query Frame = 1

Query: 13  KHVAEASK--ENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHHRQSGFLITCAIKR 72
           K  AEASK  ENVAETERKTMTPWEQHSAVIS+PRFDYNAPSALLH  QSGFLITC IKR
Sbjct: 5   KPEAEASKRAENVAETERKTMTPWEQHSAVISLPRFDYNAPSALLHRCQSGFLITCTIKR 64

Query: 73  EKSATKEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDIDHKSVENEGRSTDE 132
           EKSATKEAI ILEKY QYFS+S  ET   SDENETSKRRKV +EDIDH SV  E R+TDE
Sbjct: 65  EKSATKEAIFILEKYVQYFSSSMTETLVISDENETSKRRKV-SEDIDHISVGGE-RNTDE 124

Query: 133 HVNGTSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSLEAGT 192
           H   TS IST+S AKVEKCSPISLVKLTRSGLLL TF KDISPDTVYIV D++QSLEA T
Sbjct: 125 HAKETSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFTKDISPDTVYIVKDIMQSLEART 184

Query: 193 LKSPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRRGIEE 252
           LKS AWCHRIFPIQ+TC LNENDL+GVVS LVL FMKDKGN LSHPVKFAVGYNRRG+E 
Sbjct: 185 LKSLAWCHRIFPIQATCSLNENDLQGVVSKLVLHFMKDKGNILSHPVKFAVGYNRRGME- 244

Query: 253 TEMKTCKDSSGA-IVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLPLSGLPLE 312
                  DSSGA +++GRDKCFS+VAAAVK VVS+ IVDLKSPELC+L+ELLP+SGLP  
Sbjct: 245 -------DSSGANVILGRDKCFSIVAAAVKGVVSDVIVDLKSPELCVLVELLPVSGLPPG 304

Query: 313 SLVVGVSVLPSNLVTTKPRLCIKALTSDPKAKS 343
           S VVGVSVL +NLVTTKPRLCIKALTSD KAKS
Sbjct: 305 SSVVGVSVLSNNLVTTKPRLCIKALTSDAKAKS 327

BLAST of CmaCh04G019700 vs. NCBI nr
Match: gi|778701847|ref|XP_011655097.1| (PREDICTED: uncharacterized protein LOC101219243 isoform X2 [Cucumis sativus])

HSP 1 Score: 369.0 bits (946), Expect = 8.9e-99
Identity = 203/269 (75.46%), Postives = 228/269 (84.76%), Query Frame = 1

Query: 76  KEAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDIDHKSVENEGRSTDEHVNGT 135
           +++ S+   Y QYF++S  ET   SDENETSKRRKV +ED+DH+SV  E  STDEH   T
Sbjct: 12  QDSSSLALSYVQYFNSSMSETLVVSDENETSKRRKV-SEDVDHRSVGGES-STDEHAKET 71

Query: 136 STISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSLEAGTLKSPA 195
           S IST+S AKVEKCSPISLVKLTRSGLLL TF KDISPDTVYIV D++QSLEA TLKS A
Sbjct: 72  SLISTKSEAKVEKCSPISLVKLTRSGLLLFTFTKDISPDTVYIVKDIMQSLEARTLKSLA 131

Query: 196 WCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRRGIEETEM-K 255
           WCHRIFPIQ+TC LNENDL+GVVS LVL FM DKGN LSHPVKFA+GYNRRGIEETEM K
Sbjct: 132 WCHRIFPIQATCSLNENDLQGVVSKLVLHFMNDKGNILSHPVKFAIGYNRRGIEETEMKK 191

Query: 256 TCKDSSGA-IVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLPLSGLPLESLVV 315
           T +DSSG  +++GRDKCFS+VAAAVK VVS+ IVDLKSPELC+L+ELLP+SGLP  S VV
Sbjct: 192 TFEDSSGVNVILGRDKCFSIVAAAVKGVVSDAIVDLKSPELCVLVELLPVSGLPSGSSVV 251

Query: 316 GVSVLPSNLVTTKPRLCIKALTSDPKAKS 343
           GVSVL +NLVTTKPRLCIKALTSD KAKS
Sbjct: 252 GVSVLSNNLVTTKPRLCIKALTSDTKAKS 278

BLAST of CmaCh04G019700 vs. NCBI nr
Match: gi|225424542|ref|XP_002285300.1| (PREDICTED: uncharacterized protein LOC100267955 [Vitis vinifera])

HSP 1 Score: 350.1 bits (897), Expect = 4.3e-93
Identity = 196/331 (59.21%), Postives = 242/331 (73.11%), Query Frame = 1

Query: 23  VAETERKT-MTPWEQHSAVISIPRFDYNAPSALLHHRQSGFLITCAIKREKSATKEAISI 82
           ++E ER+  M PWEQHSAVISIPRFDYNAPS+LL H  SGFL+TC IKREKSATKEA+ I
Sbjct: 1   MSEEEREEGMKPWEQHSAVISIPRFDYNAPSSLLDHSHSGFLVTCTIKREKSATKEAMPI 60

Query: 83  LEKYSQYFSNSTPETSESSDENETSKRRKVCTEDIDHK---SVENE------GRSTDEHV 142
           LEKY   FS+ + E+ ESSD N T+KRRK+CTE+ID +   SVEN+      G    E  
Sbjct: 61  LEKYVGSFSSCSSESLESSDANATTKRRKICTEEIDEECVNSVENKTASNNCGEDGGELS 120

Query: 143 NGTSTISTRSGAKVEKCSPISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSLEAGTLK 202
                 S    A VE    +SLVKLTRSGLLL  F ++ S DTV +VS +I+SL++G++K
Sbjct: 121 KDAGVSSANRDAIVENGHVLSLVKLTRSGLLLFVFPRNNSVDTVDVVSQIIRSLQSGSVK 180

Query: 203 SPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRRGIEETE 262
            P WCHRIFPIQ+TC L+E +L  VV+ LV+QF+ ++ N  + P+KFAVGYNRRGIEETE
Sbjct: 181 PPLWCHRIFPIQATCRLDEKELHEVVTKLVVQFVNNEQNKFARPIKFAVGYNRRGIEETE 240

Query: 263 MK----TCKDSSGAIVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLPLSGLPL 322
           MK    T +D +   ++ R KCFSVVA AVK  VS+++VDLKSPEL +L+ELLPLS +P 
Sbjct: 241 MKIPKSTPRDCNSHALLDRKKCFSVVATAVKGAVSDSVVDLKSPELSVLVELLPLSRVPN 300

Query: 323 ESLVVGVSVLPSNLVTTKPRLCIKALTSDPK 340
            S+VV VSVLP NL+TTKPRLCIKAL SD K
Sbjct: 301 GSMVVAVSVLPQNLITTKPRLCIKALLSDTK 331

BLAST of CmaCh04G019700 vs. NCBI nr
Match: gi|1021566451|ref|XP_016174481.1| (PREDICTED: uncharacterized protein LOC107617250 [Arachis ipaensis])

HSP 1 Score: 342.4 bits (877), Expect = 8.9e-91
Identity = 185/334 (55.39%), Postives = 245/334 (73.35%), Query Frame = 1

Query: 17  EASKENVAETERKTMTPWEQHSAVISIPRFDYNAPSALLHHRQSGFLITCAIKREKSATK 76
           E   E     E+K M+PWEQHSAVI +PRFDYNAPS+LLH   SGFLITC IKREKSATK
Sbjct: 18  EREDERETPKEKKEMSPWEQHSAVIKLPRFDYNAPSSLLHGSHSGFLITCTIKREKSATK 77

Query: 77  EAISILEKYSQYFSNSTPETSESSDENETSKRRKVCTEDIDHKSVENEGRSTDEHV--NG 136
           EAISIL K+++ FS  +  +  + +++  SK+R+VCTED   + ++N+ + T      +G
Sbjct: 78  EAISILHKFARPFSKGSYNSLNNLEDDNASKKRRVCTEDDAEECLDNKEKETASATANSG 137

Query: 137 TSTISTRSGAKVEKCSP----ISLVKLTRSGLLLLTFAKDISPDTVYIVSDLIQSLEAGT 196
              + + SG + E  +     +SLVKLTRSGLLL TF +D  PDTV IVS++IQ+ E+G+
Sbjct: 138 DGKLLSGSGNRAETDAEGVPGLSLVKLTRSGLLLFTFPEDALPDTVDIVSNIIQAYESGS 197

Query: 197 LKSPAWCHRIFPIQSTCCLNENDLRGVVSNLVLQFMKDKGNSLSHPVKFAVGYNRRGIEE 256
           +KSPAWCHRIFPIQ+TC LNE +L+ VVS LV +F+ DK N L  PVKFAVGYNRRGIEE
Sbjct: 198 VKSPAWCHRIFPIQATCGLNEKELQEVVSMLVKKFVDDKQNILEQPVKFAVGYNRRGIEE 257

Query: 257 TE--MKTCKDSSGAIVMGRDKCFSVVAAAVKDVVSNTIVDLKSPELCILIELLPLSGLPL 316
           T+   +  KDS    ++ R+KCF +VA+AV  VV +++VDL++PELC+L+E+LP+SG+P 
Sbjct: 258 TKSVKEKSKDSDAFSLLDRNKCFGIVASAVNGVVGDSVVDLRTPELCVLVEVLPISGVPN 317

Query: 317 ESLVVGVSVLPSNLVTTKPRLCIKALTSDPKAKS 343
            S+VV VSVLP NLV+TKPRLC++AL S+ K  S
Sbjct: 318 GSIVVAVSVLPRNLVSTKPRLCVRALNSNTKEGS 351

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KQX3_CUCSA2.4e-13578.39Uncharacterized protein OS=Cucumis sativus GN=Csa_5G273440 PE=4 SV=1[more]
D7SMG7_VITVI3.0e-9359.21Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0137g00060 PE=4 SV=... [more]
A0A061E9D8_THECC1.1e-9057.35Uncharacterized protein OS=Theobroma cacao GN=TCM_011165 PE=4 SV=1[more]
A0A151TQ39_CAJCA2.4e-9056.68Uncharacterized protein OS=Cajanus cajan GN=KK1_008359 PE=4 SV=1[more]
B9S7B4_RICCO4.0e-9057.40Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0775790 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G09290.11.3e-8454.66 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778701843|ref|XP_011655096.1|3.5e-13578.39PREDICTED: uncharacterized protein LOC101219243 isoform X1 [Cucumis sativus][more]
gi|659120473|ref|XP_008460211.1|1.2e-13079.58PREDICTED: uncharacterized protein LOC103499095 [Cucumis melo][more]
gi|778701847|ref|XP_011655097.1|8.9e-9975.46PREDICTED: uncharacterized protein LOC101219243 isoform X2 [Cucumis sativus][more]
gi|225424542|ref|XP_002285300.1|4.3e-9359.21PREDICTED: uncharacterized protein LOC100267955 [Vitis vinifera][more]
gi|1021566451|ref|XP_016174481.1|8.9e-9155.39PREDICTED: uncharacterized protein LOC107617250 [Arachis ipaensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004114THUMP_dom
Vocabulary: Molecular Function
TermDefinition
GO:0003723RNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G019700.1CmaCh04G019700.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004114THUMP domainPFAMPF02926THUMPcoord: 197..301
score: 7.
NoneNo IPR availablePANTHERPTHR13452THUMP DOMAIN CONTAINING PROTEIN 1-RELATEDcoord: 14..90
score: 4.2E-134coord: 107..342
score: 4.2E
NoneNo IPR availablePANTHERPTHR13452:SF9SUBFAMILY NOT NAMEDcoord: 107..342
score: 4.2E-134coord: 14..90
score: 4.2E
NoneNo IPR availableunknownSSF143437THUMP domain-likecoord: 221..300
score: 1.1

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh04G019700Cucurbita moschata (Rifu)cmacmoB679
CmaCh04G019700Cucurbita maxima (Rimu)cmacmaB018