CsaV3_4G017570.1 (mRNA) Cucumber (Chinese Long) v3

NameCsaV3_4G017570.1
TypemRNA
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionN utilization substance protein like
Locationchr4 : 10979476 .. 10991405 (-)
Sequence length1137
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGACGGCTTACGCACCACCAAACAAGTCGGAGGAGCTTCCTTCTTCTTCCACTAGGAAATGGCTCTGCCTCCAAGTCCTCTGTGAGTAACGAGAGCTGATTTTGGGGCATTTCACAATGTCTTTAGCTCCACCCACCTCCCTTTATCCCTATTCTTCCCACTCTCATCCCCTCCCCAAATCCCATTTATCTTCCTTTACTCAACGTTCCCATCCCAACTTCTTTTCCTTATGCTTTCCCTTTCATCTTAGTTTCTCCACCTCCTTTTCAACCCTTCACTCCCTTAAATACTCAGCTTTCAAAACCGACACTGGACTCGGAGATTCCCATGATGCTGACCAACCTCATAGTTTGAAATTTGCACCTGGAGCTTCTGCCCACAAGACTCGACTCTTCACCGTTGGGGATAAAGTTATCACTACCAGGTTTCTTTTTCCTTTGCCCTTCCACTTCTTCTGTTCTCTATTCTTTACCTACTCCCCTCTTTCCTTTTTCCATTTAAACTCTGCTTTCTAAATCTTGCTCACCATGTAAGGGGAGTTAAGGAATTAGTGGTAAATCTTGGGTAAATTCATGCTAATCATTTTTTAGTTTCTTATTTGAATTTGAATTCCTTTTTAGTTTCATTTTGGGATTTCATTTTCCATATTCTGGAGTCCTCTCTCGTGTACCTTTACAACTTCGATTCATATTAAGTTTTGAAAGTTGATAATTGTATAATTTTCTCCTTTGGGTCCTTTAGGCTACATCAGTTCACCAACAACAATTGACCTCTCTGTTAATTTCTCACGCTTAGCAATTTGAATTGTGAAATAGAGGCGGTAGGATGTGGCATGCTTCATCACTTCATACTAGTTACTCCTTCTAAATACTTGCATCATCTTTACAACTTTCCAGAATGGTGGATGCAGGGTCAGTGAACATTTGCTTTAAATATCAGTTTTGGATATTTCCTTAGTAGTCTAAAATTGGACTATACCAGATGTGGCACCTGGATGAGCACCGAGATAACACGACCACCATCTTATACTGGTCAACAGTAACGAACTTGGTTTTATGTAATTCAACCTATGGTTACTATGTAGAAAAACACTACTGAAACTAAAACCTTGTTCTTCCTGATACTCTTTCACTTGCGAGCTTCGAGTTGCTCCTGAACTCAACAAGCTTAAGAGGTTAAGGGCTGATTGGGGAGAGGAGTTGGGTGGGGTGGAGTTGTTTGGCCGCAGGAGTTATTAAACACATTGTAGATCACCCGATTCATTAGCATCAATTTCATGATTTATTAACTCCTTACATTGTGGGTCCATAACTAAACAACTTTATTTTTCATTCTCTCGTGGGTTATGGCAAATTTAGTGAGTAATTTATATTATTGACACTCTAGACAATTGGATATTAATTGAGACAACTGCTAGAGGCAAGGAAATTGTCATATTGTTGTTTTGCCTAGTTTTTCTTAAAGAAAGGAAAAAGGGAAAAGAAAAAGAAAAGATATAGGCTCCATGATTACTACATCTTTCCAGAAAACATGACTTAACTTAGCATGCTTAGTTTTCATTCTTGCTCATCCTCTCTTTTCTTCAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATATGTCAGTTCCCATTTCATGCTAGTTCAATTGTTCCCCACATAAAAGATTCCATGCCACGTTTCTGCTGTCAAGCCTCGCTTCGGGCTTCTACTTCTTTTTCTGAAAATCGTGTGGCTGAAGAGAGGAGTTCTATTTCAATATCTTCCATTGAGATGATTCCAAAGGTTGACAAGAGCGGCAAATTTTGTAGTCCGAGAGCTGCTAGAGAGCTTGCTTTGTAAGATCCACTACTTCCTTTTCACCTTTTTTTTTCTCCAACATATGAATATCTCCTCTCTACCTTTTACGTATTCAAAAAGTTGGCTTACTTGCCAAGATATGGGCTAAGAATTTGTTTAATTTCTAGACTTGTGATAAATTCTAATTCTACTAGATTATTCAATGGCAGGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCTGTTCGGCTTTTTGAGAAAAGGTTAAATGCCCGACGAGGTAATCTATGGTAGAGACATGATACATGGTTTTCTGTTACCCCATATACTATTTGGTGTGTGTGTTTTAATATAAAAAAGATCGTGCTGGGGAAATAGATGTCACTCAAATATAAAATCCTGCTTGCTTTTTTTTCTTTGTTTGTACAATTCAGAATCAGGATATGAATTTGACAAGACATCATTGATGGAATATAATCATATGAGCTTTGGAGGTCCGCCGGTTACCGTTGAAACAATTGAAGAGGCAGATGAGCTCTTACGTAAGGATGAAAGGGATTCTACAATTGGTAAACAATTATTTGACCTTCATATAACACACACACACACACATTATTTCTTCTTCTTAATTCACCCATTACTTTATATTAAAAACATGATTGGCTAGAAGAGACAACTATTCATTGAAAAGATGAAAAGTGTAAAAGAGAGACTAATGCTCAAAAGATACAAAATCTCTCAAAGGAGTAAAAAGGAAAAAGAAAAGCAGAAAAATGTTAAGTTTCCACAAACCATAACAAATGATTGATTTACTTAAAAGCTTAACAAAGAACTAATAGATTAACATCAAGCATCGGTATGAACGTTGAGTTGCCCATAGAAGTTGGAGTAGATCTCGAAATACTGCAAGTGTTGCTGACAAATTACGATCTTTAGTTTGGTTAACTTTATTTGACAAAGAAGTTTCTTCTTCTGAGTGTGAACCAATTCTTTGTCAAGATGAGATATCCACGTTATATCAATACTGTATTAGGTTCCATTTTTTTTTTTTTTTTTTTTTGATGAGAAACATATGTATTCATATAACAAAATAGAACAACCTAAGGGCAAGGGACAAAAGGGCCCTCCCCAGAAGAAACTAAAAAATGATGGCGTTCCAATTGCTGAATATCATTGAAAGACTATAATTACAAAAATGTTTGGTGTAATTCATACTCCACCAAGAAGTTGTGTGTTGCAACACCGTCCAAAAAGAATCAAAAGAATTATAGTCATTCTAAAAAATTCTTCTATTCCTTTCTTTCCAAATGTACCACAAAAGGGACCGAGTAGCACATCTCCATAACATGTTTCTTTTCAGGCTATAACCTCTAATGTTGAGCCCTTCAATCAACCAGCTATCAACCTTACTAGGAAGACAAAGCTCCAAATAAAAAATTCTAAACAAAGTGTTCCAAGCTTTCCTAGTAAAAAGACAGTGCAAAAATATGTGGTCCAAGGTTTCCTCTCCTTTGAAGCAAAGGCAACACATCGAGGGGCTAAGCACAGTGTACTGTATTAGGTTCCATAAGAAAGCTCTATATCATGTGAGAAAAGAGCTAAAGATTTGATTTGAGAAGGGCCATCATCAACTTGATTGTGCAAATTGACTCCCACCTTGATTTTTTGAAACGGGGTTTAGAGACCTGTTTTTTGAGCCATCATTCATCAGTAGCGATCTTCATCAAACTTGGCATTTTTCCTTTTAGACGTGCTCCATCATTTTGTCTCCTTGCTGCCTTTGTACTAGGGCTGATTGATGTAGCCTAAGCAACCTCAAAGAGTAATTATCTAAGCACCAAGATTGTTAATCTTTTTATTAGAATGGATAATGTGCTTCATACAAGAGGATAATACTCCTATTTAAGAGTTTTATTACAAGTGATGGGTAAAAATGGAATTAACCTAGAATATTATCTAATTTCCCAACTAACCCCTATTCTTCTTACATCATTCTACCCTTCCAAAAAGAAAACCTGTTCTCGAGTTTTAAGAAGGAAAAAATTATGAAGCAAATAATTAATGAAATAAACTTGTTCAATGGAATATGACCAAACCAACCTCCGATGTTTTATACAAAAATATTATGATAAATTAGATTAAGTTGAGACAAAAAATCCAATATTTCCACCAAAAGAGGAACCTCTCCATTGACCGTAAAGCCAAAAAAGATCTTTTCCTTCATGCTCATTCAAAACCACATTCAATGGACCAATCACTCAAAGTCTAGAACACCACATTCCAAAGAGAATCGTTCAAGAATCATGTCTTCTCTATCATGCGCTTCAAGTTCTTCAACGATTCGATGATCCATTTCGACTACAATCAATGTGTCATTTTCTTATTTGATTTTTGCTTGGTGTTCGTCCTTTCTTCCAATCTTCTTAGTTGGATTTTTCTTCACTTGAGACTATTTCCAGATCAACATTTTCAAATTCTTGTTTTCCTTCAATTTTATCGTCCAGATTTTCTATTTGAAAATTTTCCAACTATGATTCCTTTGTTTTTTCTTCCGTATGTGAAACCAATTTGTTCTTGAAATCTTGCAAATGAATTGATAGTTGATCGATATGTGAGTCATTTTCCCCAATATGTTGGATTTCTTTTGTTATTTTCTTTATTTCCTCCAGTGCTACGGCCTAAGGGATCAAAGTAGTCATTTTGTTTTTGTTTTTTTTTCTCTCTAATCGTTCTTTGGATAAAAAAAAGAAGAAAAAAGTAGTCATTTTGTTAGGGAATTGAATATGTTTAAATCTCATAGTGGGTATAAGACTCATCCATTGACTTTCTTGAATCAACCTTTCTTGATCGTGTTTCATAACCAAAAGAGTAATTTTGAGTCTTGAGCGTTCTTCTAGGGTTGTAAAAATCCATTCTTGACGATTCTTTAGTTCATTGAACTCCCAAGTTCTACGTTGATTTCTTAAAAAATGGGTTTGATCGAATCATTGGGCCGTTTGTCGATATTTCCGTCTTCTTATTCCATCCCAAGCTTTCTTAGAAATATCATCGGAGTCACTAGAATCACTAGAATCAAATTTCCTTTGTGGATAATGATAGGTTCTTTGAGAAGATTGTTTAGTGATATATAATTAAATTTTCCTTCAACCACTAGCTTAAGCTTTTGGGTGAATTGGTGGTTTAATATCGTATCAGAGCAGGTGGTCCAGGAGGTCCTATGTTTAAGCCTCTACATTGTTGTTTCCTCCCCAATTAAAATTAATTTCCACTTGTTAGGCCTTTCAGATATTTCGAGCCCACAAGTAAGGGGAGTGTTAGTAATTGTAATTAAATTTGCCTTCAACCACTAGCTTAATATTTTGGGTGAATTGGTGGTTTAATAGATTGAACATAGGTAAAAATAGCAAGTTGAAATTCTTCTTCGGAATCACTTTAGTCAAAATTCCAGCACTGTTTTTGGAAGTAAAATTGATTTTCTTGCCTATACAAACTGCTGTGTTTTTCTTGGACATCTTCTTTGATTGGTAATAGTCCTTTCCTCCCCAATTTCTTCTTTCTTTCTTGAAATTTTAAAGGTTCCCCCAAATTAGATGTGTGTTCGACAATAAGCACAACTCATGAAGCTCCTTCTAGTGATTGGCGTCACCAAAAGTTTCTCGATCTTCCATCGTTTGTAGTCAACGTTGGCCCAGGTGGCCGCTCTGAAACTAAAATGATGTAGTCTAAGTAACCTTAAGGAGTAATTATCCAAGAACCAAGACTGTAAATCTTTTTATTAGAATGAATAATGTGTTTCAGACAAGAGGGGAAGGCTCTTATTTATAGAGTTTATTACAAGTGGTGGGTAGAGAGGGAATTAACCTAGAATAGTAACTAATTACCCAATTAACTTCTAGTCTTCTTACATTAGGGGCTCTCTGTTTTTTTTTTTTGTTTTAATCCAACACATTTGGTTGTACTTCTTTTCATTTTGTCTACTTCTTTTTCAGTTGTAATGTTTCCCTCTCTTTTGGGCTTCTTTTTTATTCTTTTCATTTCGTCTACTTCTTTGGATTATTTCATTCATCAATAAATTAATTTTTTATTAAAAAAATCCAACCGAATCCATAATTAGGGACTAAGGGTATTTCAATTTTTCCCATGTTTTTACTATAATTGCTAATTCAGTATTGTCTAGCCTCCAAGTCCTCTTTTATTGCGGTTAGTATTAAAATTCTGTTTTCTGCATGACTTACCTCAGAGGCAGAAATCCTCGCAGCCCCACCAAAGATTGTCTACAGCAAACTGATCTTACGGTAAACTTGGTTTATACTCCCCTACCTTTGGCAGAGGATGATAAGATGAAAATTTTGTATCACTTTCATCTATTTTATCAATATTATTCACTTTAAGAATGAAATAAGGCAAGCGATCTTCTTCATGGTTTTCAGGTTTACACGAAAACTTTTGGTTGCAGTTGGGGACGGATGGGACAGTCGTGCGCTTAAAATTGAAAAAGTCATTCCTCCAACTTGGAAGGTCGTGAAAAAAGCTTTTTTTTGTTTACTATTTTATTTTGTGATTCTCTTCTTTATGTTGTTATCATGTGGTTAGAATACTCCTCCTGAATTGATAGGTGGAGGATTATGTCCCTCTCCAAAGGGGGTAGGCTAACTCTTATGCAATTGGTTCTCAACAGCCTTCCTTGTTACTTATTTTCCCTTGCACAAGCCCCGGTTGGTGTTATCAATAGACTGGAAAAGATTATTTGAAATTTTGTTTGGATAGGGTTTATTAATTCGGTTTTCCACATTGTCAAATGGGATTGTTCTTCTCGCCTAATCTCTTATGGTGGCCTTGAGATTGGATCATTTAGGCAAAAGAACACTGCTCTTGTCAATAAATGGCTTTGAAGGTTTAGTAAGGAAGAAACCACTTAATGGAGGCGCTTAATTGTGGCCATTTACCGCTTAGAGGAGAGGGTGGTCTACTAAAGCTCCGAACAGAGGAAGGTCTTATAGATTGTGGGCCGTTATTTTGAAGCATAAGGAGACTTATAAATTCACAGCTTTTTTGTTGGGAGAGGGAACAAAAATCAGATTCTGGAAAGATAGTTGGTGTGGCGTGGAACCGCTTGCATAAAAATTCCTCTACCTTGTTCTCCTTGGCACTGAACAAAGATGCTTTTGTGTCTGAATGTTGGTGTAATGTTACTCATTCGTGGAACTTGGGCCTTAGAAGAAATGTGCTCGATAATGAGCTTGACAACGTGGCCTCAATTCTAAAAAATTTCCATCCTTGGCCCCCTCAGATGGTGGTGATAGTCTTAAATGGACTTCTAACGTTAATGGCAGCTTTACTACAAAGTCTACTTTCCGTAATTTTACCAAGAGATCTCCCTCCATGCTGCTCCTCCGATTCACCACATGTGGAAAACTAAAATCTCGAAAAAGGTGAAATTCTTCTTTTGGTCGCTCGCCTATAGAAGTCTAACACTCGTGAGAAAATTTAAAAAAAAAAAAAAAAATCCAACACATATGCTCAGCCCCTCAATGTTCTGTTCTTGTTTCAAAGATGAGGAGACCCTAAACCACTTATTTTTGCACTCTCTTTACTAGGAAAGCTTGGAACAATTTGTTTAGAATTTTGATTTGGAGCTTTGTCTTCCTAGTAAGGTTGATAGTTGGATGCTGGAAGGGCTTAACATTAGAGGTTATAGTTCGAAAGGAAACATCTTATGGAGATGTGCTACCCAGTTTCTTTTGTGGTGCATTTGGAAATAAAGGAATAGTAGAATTTTTTAAGATAGCTATAACTCTTTTGATTCTTTTTGGACTGTGTTGCAACACACAACTTCTTAGTGGAGTACGAATTACACCAAAGACTTCTGTAATTACAGCCTTTCTATGATTTTCAACAATTGGAAGGTGATTATTCTTTAGCTTCTGTTGGGGAGGGCCCTCTTTGTTTCCAAATTTATTTAAAGATTTTGATTGTTCAATCTTTTAAAGTGAAAGTTGCATTTCTGCTAAACATCATCGACCCAAACCTTGTAAGGGTTCCTCGCCTTTTTACTTCATTCACACTAATGTGTGGGGTCCGTCTAGAGTGTTGGCTTATAGTGGTAAGCGTTGAGTTGGGTTGGTTTGTTACCTTGATCACACTTGTTTAACTTGATTTTATCTTTTAACAAAAAAAATGAAGGTAGAAGAGGTTTTTGTTTGGTTTTACAATATGATTGAGACCTAGTCTCAAACTAAAATCCGCATTCTTCACTCTGATAATAGTACTGAATATTTTAACAAACAATTAACTTATTTTTTGCAAGATAAGGGTATTTTTCCATCAAGCTACATGTCAAGATACTCTTTAGCAAAATGACGTCGTTGAGCGAAAAAATAGATATTTACTTGAAGTTGCTTGTGCCCTTATGCTTTCTATGCATGTACCAAAATATTTGTGTGGTGATGCAGTCCTTACTGCTGCATACCTCATAAATCAAATGCCAACCAAGATTTTGAATTTTAAAACTCCTTTGAGTCGCTTCAAAACGGGTTTTTTCCTACTGTTAGATTGTTTTCTGACTTAACAATAAAAGTATTTGGGTGTATTGTTTATGTTCATATTCTTTACCCTATACTAACTAAACTTGATCCTTGAGCCGTTAAATACATTTCTGTATGCTATGTCTCCCATAAGAAGGCTTTCAAATGTTTTGACCCTTCAACCAAAAAGTTTTTGGAGAGTATGGACGTGCCTTTCTGTCTTCAAAGGGAGACATCTAACCTCGAAGATAATTTCTAGGACACTTCACTTACCCCAGACATCATTGGTCCTGAAATTATGAGTTATAATCCCTTGATGCCAAGTGTGGAAAGTTCTTCTTTAGGGAAGAAACACTACAAAAACCTGAACTTCAGATTTATACTAGAAAAACCTTGCCTCAAAGTAATTGAGAATAAACAGTTGATCTATCATAGGACCAATCTAATTCTTCGATGAATGATTCTGCAGATCTAGGTAACATACATCTCCTTCTCATAATTCTCCCAGTACTTCTCATAATTTTCCTAGTTCTCCTTACCCGATGTCTCTAATCTTGACATTCCAATTGCCCATAGGAAAATCACCCGTAAAGGTGCCAAATATCCCATTGCAAACTATCTTTCTTTATCACAGATTATCTAACAGTCATAAAGCCTTCACATCCAAAATAAACCACCTATTTGTTAGAAGGAATACACAGGAGGCCCTAAATGATTTGAATTGGAAAGTGGCAGTAATGGAAGAGATGAATGCACTGAAACAAAACTGCACATGGGACATAGTTGAACTACCTAAAAACAAGAAAACGGTTGGATGTAAGCGAGTGTTCACTGTATAATGTAAAGCTGATGGTAGTATTGAAAGGTACGAGGTCAGATTGGTTGCTAAGAAATTTACTCTGACCTATGGAGTTGATTATCAAGAAATATTTGCTCCAGTTGCTGAAATTAATTCCATAAGAATTTTGCTACATGTTGCAGTTAATTTTGATTGCCACTTTATCAACTGGATGTTAAGAATGACTTTCTCAATTGGATCATGAAGAAGAGGTATTTATACGTTTGCTACCTGGTTTTGAGGTGATCTCAGGATTAACAAAGTGTGAGTTAAAGTAATCATTATATGAGTCTCCTAGAGTCTGGTTTGAACGTTTTAAAAAAGCAGTCACAAGCTATGGATCTAGTCAAAGTCAAGTCAATCGTACTATGCTCTATAAACATACAGGAAATGACAAGGTTATGTTTTTGATAGTTTATGTTCATGATATCATTTTTGATGAGACATGACTTTCTTGAAGAAAAAGTTAGTTGATGATTTTCAAATCAGACACCTAGGAACTATGAAATATTAACCAGGCATGGAGTTTACTAGGTCCAGAAGTGGCATTCTTGTCAACCGAAGGAAGTATATTTTTTATCTACGGAAAGAAATAGGTTTACTTGGCTGTAGGGTGGTTGAAACTCCTATTAAGCAGAACTTGAAATTAGAAGTTGTTGAAAAAGAAATAAAAGAAAAAGAGAAGTATCAGAGATTTGTGGGGAGACTTACATATCTCGATCACACACGTCCTGACATCGCTTTCGCAGTGAATACTTAGTATGGGAAGTCAGTTCATGCATGCTCCTGGACCAGCTCACTTTGATGTTCACAGAATTCTAATATATTTGAAAAGTACTCTAGGAAAAGGCATATTGTTTTAAAAACATGACCACTAAATGTCGAAGTGTACACTGATACAGATTGGTTAGGTAGTACAACTGATAAAAGAGCCACTTCTGGTTATTGCTTTTGTTGGAGGAAATTTAGTTACTTGATGAAGCAAAAAATAGAGCATGGTTGCGAGTAGCGCAGAAGCAGAATTTAAAGCATCAACCCATGGTATTTGTAAGGGCATATGGATAAGAAGACTATTGGAAGAATTGAGATTCTCCCAGAAAATGCCTATGCTCATTTATTGTGATAACAAGGCAACAATTTCCATTGCCCACAGTCCAGTCCTTCATGATAGGACGGAACATAGTGAAGGTGATAAACACTTTATGAAGGAAAAGATTGTTCAAGGGATAATATGCATCCCTTATCTTTTGAGAACAAAACAAATCGCAAATGTGTTAACTAAAGGTCTTCCAAAATAGCTATTCAACAACTTAATTGAAAAGTTGGCTATGGATGATATCTTAACTTGAGGGGGAGTGTTGATTATTTCCTTTATTGTTAATATTTTCATTGTATTAAAGAAACATTTGTGTATTTGTTTTTTCCTATTTTGTATTGGGTATTTCTTCTATTTAATAAAACCCTTTCTTTCGAAGAGAAATAACAGAAAATACATTCTGCACAACATGTTTTATTATTCTATTCCATAAATCATTCCATCTTCAGACATATCAGTTGCACGGATACAAAGCAAGGATCATGCACTTGGACTTGAACTTCTTGAATTTGCAAATTTTTTAGAATGTTTAAGATTTGTAATAGTTAAGGAGATTAAATATCTCATGCATTCGTATATATTTCAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCACCTGGCCATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGTAAATATGTCTTAAGATGTTTTACATGAAGGTCAAAACCAACATCTCTTATGTCTGTTTGATATCTGTCTAAGATTACTCCCTTTTCTAACATAAATTATTTTTTTGTTAAAAAAAAATATATCGTTCTTGGTTATTGATGAAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCTCGTATTATTAATGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAGATTGATTCAATGCCTGCTCGAGAGAAGTAAGAAGTCCGTGCATGACCCTGAACTTTTGGGGTGCCTCAGAACCTTTGTAAGGATATCGAAGAATTTGACTCAACTCATGCTGTGGTGCAAGTAAGCATTTCTGAAATCTGAACTTTTGGATAGTTTGTTAACGTGCATGGTTGAGAGTGATTTTGAAATGATTAAAATTTCTTTTACCATGTTTAAAATAATTTCGAAACATGGATGGATGTGATCATTCAATCAATTTAATATTTAATTTTATACTTTTAAATACAATTTTATATTATTAGAATTGATTTTGAATGACAAAATATATTTCAAAGTAATTTTCATCATTTCAGAATCACTTTCAAACATGCTCGGAACTTGTTACAGTCAACAGCAGCCAAAGTGAAGTGATTTGAAGAATCCCTTGTGCTTGCCGCAATCCTCAGTTAGAATTTCTCTCCCGTTCTCCATGTTGCGGTGGATTTCTGTTATATTTGATTATCCATAGAGTTGAGTAATGGAGGTGCATGAGTGTGAGAGAATGAGTAGTCAGATTTACCCAGTACTGGGCGACATGTTATTATATGCACAATATTATCAGTGTATGAAGTTTTGATAAAGGTGTGATGTCTCGTTTGTACCAACCTCTGTTTATCATGTTTTCATTATGTGTATTTTACTTTGCATTTATTTTTGTGTGATTCCTCACTACTTTGCATGTGGCTCAAAACGTATTATGGGG

mRNA sequence

ATGTCTTTAGCTCCACCCACCTCCCTTTATCCCTATTCTTCCCACTCTCATCCCCTCCCCAAATCCCATTTATCTTCCTTTACTCAACGTTCCCATCCCAACTTCTTTTCCTTATGCTTTCCCTTTCATCTTAGTTTCTCCACCTCCTTTTCAACCCTTCACTCCCTTAAATACTCAGCTTTCAAAACCGACACTGGACTCGGAGATTCCCATGATGCTGACCAACCTCATAGTTTGAAATTTGCACCTGGAGCTTCTGCCCACAAGACTCGACTCTTCACCGTTGGGGATAAAGTTATCACTACCAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATATGTCAGTTCCCATTTCATGCTAGTTCAATTGTTCCCCACATAAAAGATTCCATGCCACGTTTCTGCTGTCAAGCCTCGCTTCGGGCTTCTACTTCTTTTTCTGAAAATCGTGTGGCTGAAGAGAGGAGTTCTATTTCAATATCTTCCATTGAGATGATTCCAAAGGTTGACAAGAGCGGCAAATTTTGTAGTCCGAGAGCTGCTAGAGAGCTTGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCTGTTCGGCTTTTTGAGAAAAGGTTAAATGCCCGACGAGAATCAGGATATGAATTTGACAAGACATCATTGATGGAATATAATCATATGAGCTTTGGAGGTCCGCCGGTTACCGTTGAAACAATTGAAGAGGCAGATGAGCTCTTACGTAAGGATGAAAGGGATTCTACAATTGAGGCAGAAATCCTCGCAGCCCCACCAAAGATTGTCTACAGCAAACTGATCTTACGGTTTACACGAAAACTTTTGGTTGCAGTTGGGGACGGATGGGACAGTCGTGCGCTTAAAATTGAAAAAGTCATTCCTCCAACTTGGAAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCACCTGGCCATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCTCGTATTATTAATGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAGATTGATTCAATGCCTGCTCGAGAGAAGTAA

Coding sequence (CDS)

ATGTCTTTAGCTCCACCCACCTCCCTTTATCCCTATTCTTCCCACTCTCATCCCCTCCCCAAATCCCATTTATCTTCCTTTACTCAACGTTCCCATCCCAACTTCTTTTCCTTATGCTTTCCCTTTCATCTTAGTTTCTCCACCTCCTTTTCAACCCTTCACTCCCTTAAATACTCAGCTTTCAAAACCGACACTGGACTCGGAGATTCCCATGATGCTGACCAACCTCATAGTTTGAAATTTGCACCTGGAGCTTCTGCCCACAAGACTCGACTCTTCACCGTTGGGGATAAAGTTATCACTACCAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATATGTCAGTTCCCATTTCATGCTAGTTCAATTGTTCCCCACATAAAAGATTCCATGCCACGTTTCTGCTGTCAAGCCTCGCTTCGGGCTTCTACTTCTTTTTCTGAAAATCGTGTGGCTGAAGAGAGGAGTTCTATTTCAATATCTTCCATTGAGATGATTCCAAAGGTTGACAAGAGCGGCAAATTTTGTAGTCCGAGAGCTGCTAGAGAGCTTGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCTGTTCGGCTTTTTGAGAAAAGGTTAAATGCCCGACGAGAATCAGGATATGAATTTGACAAGACATCATTGATGGAATATAATCATATGAGCTTTGGAGGTCCGCCGGTTACCGTTGAAACAATTGAAGAGGCAGATGAGCTCTTACGTAAGGATGAAAGGGATTCTACAATTGAGGCAGAAATCCTCGCAGCCCCACCAAAGATTGTCTACAGCAAACTGATCTTACGGTTTACACGAAAACTTTTGGTTGCAGTTGGGGACGGATGGGACAGTCGTGCGCTTAAAATTGAAAAAGTCATTCCTCCAACTTGGAAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCACCTGGCCATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCTCGTATTATTAATGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAGATTGATTCAATGCCTGCTCGAGAGAAGTAA

Protein sequence

MSLAPPTSLYPYSSHSHPLPKSHLSSFTQRSHPNFFSLCFPFHLSFSTSFSTLHSLKYSAFKTDTGLGDSHDADQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPFHASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSMPAREK
BLAST of CsaV3_4G017570.1 vs. NCBI nr
Match: KGN54018.1 (hypothetical protein Csa_4G268020 [Cucumis sativus])

HSP 1 Score: 756.5 bits (1952), Expect = 4.3e-215
Identity = 378/378 (100.00%), Postives = 378/378 (100.00%), Query Frame = 0

Query: 1   MSLAPPTSLYPYSSHSHPLPKSHLSSFTQRSHPNFFSLCFPFHLSFSTSFSTLHSLKYSA 60
           MSLAPPTSLYPYSSHSHPLPKSHLSSFTQRSHPNFFSLCFPFHLSFSTSFSTLHSLKYSA
Sbjct: 1   MSLAPPTSLYPYSSHSHPLPKSHLSSFTQRSHPNFFSLCFPFHLSFSTSFSTLHSLKYSA 60

Query: 61  FKTDTGLGDSHDADQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPF 120
           FKTDTGLGDSHDADQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPF
Sbjct: 61  FKTDTGLGDSHDADQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPF 120

Query: 121 HASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSP 180
           HASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSP
Sbjct: 121 HASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSP 180

Query: 181 RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVE 240
           RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVE
Sbjct: 181 RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVE 240

Query: 241 TIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKV 300
           TIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKV
Sbjct: 241 TIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKV 300

Query: 301 IPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL 360
           IPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL
Sbjct: 301 IPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL 360

Query: 361 RTFVKDIKEIDSMPAREK 379
           RTFVKDIKEIDSMPAREK
Sbjct: 361 RTFVKDIKEIDSMPAREK 378

BLAST of CsaV3_4G017570.1 vs. NCBI nr
Match: XP_008449897.1 (PREDICTED: uncharacterized protein LOC103491638 isoform X1 [Cucumis melo])

HSP 1 Score: 691.4 bits (1783), Expect = 1.7e-195
Identity = 351/389 (90.23%), Postives = 360/389 (92.54%), Query Frame = 0

Query: 1   MSLAPPTSLYPYSSHSHPLPKSHLSSFTQRSHPNFFSLCFPFHLSFSTSFSTLHSLKYSA 60
           MSLAPPTSLY YSSHSHPLPKSHLSSFTQRSHPN F + F FHLSFSTSFSTLHS K SA
Sbjct: 1   MSLAPPTSLYLYSSHSHPLPKSHLSSFTQRSHPNLFPVRFLFHLSFSTSFSTLHSFKSSA 60

Query: 61  FKTDTGLGDSHDA-----------DQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFS 120
           FK D GLGDSHDA           +QPH LKFAPGASAHKTRLF VGDKVITTRPNFHFS
Sbjct: 61  FKIDIGLGDSHDAGQPREQYQVEQEQPHGLKFAPGASAHKTRLFNVGDKVITTRPNFHFS 120

Query: 121 YHISGICQFPFHASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIP 180
            HISGICQFPFHASSIVPH+K+SMPR CCQASLRASTSF ENRVAEERSSIS+SSIE IP
Sbjct: 121 NHISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPENRVAEERSSISVSSIETIP 180

Query: 181 KVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNH 240
           K+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNH
Sbjct: 181 KIDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNSRRESGYEFDKTSLMEYNH 240

Query: 241 MSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDG 300
           MSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPK+VYSKLILRFTRKLLVAV DG
Sbjct: 241 MSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDG 300

Query: 301 WDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCD 360
           WD+RALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCD
Sbjct: 301 WDNRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCD 360

Query: 361 GAAPRIINGCLRTFVKDIKEIDSMPAREK 379
           GAAPRIINGCLRTFVKDIKE DS PAREK
Sbjct: 361 GAAPRIINGCLRTFVKDIKETDSTPAREK 389

BLAST of CsaV3_4G017570.1 vs. NCBI nr
Match: XP_004149639.2 (PREDICTED: uncharacterized protein LOC101216754 [Cucumis sativus])

HSP 1 Score: 609.8 bits (1571), Expect = 6.5e-171
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 0

Query: 75  QPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPFHASSIVPHIKDSMP 134
           QPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPFHASSIVPHIKDSMP
Sbjct: 11  QPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPFHASSIVPHIKDSMP 70

Query: 135 RFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSPRAARELALSIVYAA 194
           RFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSPRAARELALSIVYAA
Sbjct: 71  RFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSPRAARELALSIVYAA 130

Query: 195 CLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETIEEADELLRKDER 254
           CLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETIEEADELLRKDER
Sbjct: 131 CLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETIEEADELLRKDER 190

Query: 255 DSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKVIPPTWKNKPAGRIL 314
           DSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKVIPPTWKNKPAGRIL
Sbjct: 191 DSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKVIPPTWKNKPAGRIL 250

Query: 315 ELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSMP 374
           ELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSMP
Sbjct: 251 ELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSMP 310

Query: 375 AREK 379
           AREK
Sbjct: 311 AREK 314

BLAST of CsaV3_4G017570.1 vs. NCBI nr
Match: XP_016900769.1 (PREDICTED: uncharacterized protein LOC103491638 isoform X2 [Cucumis melo])

HSP 1 Score: 526.9 bits (1356), Expect = 5.5e-146
Identity = 261/275 (94.91%), Postives = 268/275 (97.45%), Query Frame = 0

Query: 104 PNFHFSYHISGICQFPFHASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISIS 163
           PNFHFS HISGICQFPFHASSIVPH+K+SMPR CCQASLRASTSF ENRVAEERSSIS+S
Sbjct: 15  PNFHFSNHISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPENRVAEERSSISVS 74

Query: 164 SIEMIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTS 223
           SIE IPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTS
Sbjct: 75  SIETIPKIDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNSRRESGYEFDKTS 134

Query: 224 LMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLL 283
           LMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPK+VYSKLILRFTRKLL
Sbjct: 135 LMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKMVYSKLILRFTRKLL 194

Query: 284 VAVGDGWDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDL 343
           VAV DGWD+RALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDL
Sbjct: 195 VAVVDGWDNRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDL 254

Query: 344 AKRFCDGAAPRIINGCLRTFVKDIKEIDSMPAREK 379
           AKRFCDGAAPRIINGCLRTFVKDIKE DS PAREK
Sbjct: 255 AKRFCDGAAPRIINGCLRTFVKDIKETDSTPAREK 289

BLAST of CsaV3_4G017570.1 vs. NCBI nr
Match: XP_023007062.1 (uncharacterized protein LOC111499665 [Cucurbita maxima])

HSP 1 Score: 475.3 bits (1222), Expect = 1.9e-130
Identity = 235/282 (83.33%), Postives = 255/282 (90.43%), Query Frame = 0

Query: 97  DKVITTRPNFHFSYHISGICQFPFHASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEE 156
           D VI TRPNFHFS H+SGIC FPF  S+IVP++K+S+P  C QASLRASTSFS+N V +E
Sbjct: 15  DNVIATRPNFHFSSHVSGICCFPFPTSAIVPYVKESVPLLCPQASLRASTSFSKNSVGKE 74

Query: 157 RSSISISSIEMIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESG 216
            SS+ +SSIE IPK+DKSG+FCSPRAARELALSI+YA+CLEGSDPVRLFEKRLNAR E G
Sbjct: 75  SSSVLVSSIEPIPKIDKSGRFCSPRAARELALSIIYASCLEGSDPVRLFEKRLNARLEPG 134

Query: 217 YEFDKTSLMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKIVYSKLIL 276
           YEFDKTSLMEYNHMSFGGPPVTVET EEADELLRKDE+DS IEAEILAAPPK+VYSKLIL
Sbjct: 135 YEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKVVYSKLIL 194

Query: 277 RFTRKLLVAVGDGWDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIV 336
           RFTRKLLVAV D WDS  LKI+KVIP TWK+KPAGRILELCILHLAMSEITV+GTRHQIV
Sbjct: 195 RFTRKLLVAVVDRWDSHVLKIDKVIPSTWKDKPAGRILELCILHLAMSEITVVGTRHQIV 254

Query: 337 INEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSMPAREK 379
           INEAVDLAKRFCDGAAPRIINGCLRTFVKDI+E DS  AR K
Sbjct: 255 INEAVDLAKRFCDGAAPRIINGCLRTFVKDIQETDSTHARVK 296

BLAST of CsaV3_4G017570.1 vs. TAIR10
Match: AT4G26370.1 (antitermination NusB domain-containing protein)

HSP 1 Score: 318.9 bits (816), Expect = 4.1e-87
Identity = 157/220 (71.36%), Postives = 182/220 (82.73%), Query Frame = 0

Query: 149 SENRVAEERSSISISSIEMI--PKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFE 208
           S  R A    +IS   ++ +  PK+DKSG+  SPRAARELAL I+YAACLEGSDP+RLFE
Sbjct: 64  SPTRSALRTPTISAEEVKDVPMPKIDKSGRLSSPRAARELALVILYAACLEGSDPIRLFE 123

Query: 209 KRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAP 268
           KR+NARRE GYEFDK+SL+EYNHMSFGGPPV  ET EE DEL+R DE++S IEAE+L+AP
Sbjct: 124 KRINARREPGYEFDKSSLLEYNHMSFGGPPVKTETKEEEDELVRHDEKESKIEAEVLSAP 183

Query: 269 PKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSEI 328
           PK+VYSKL+LRF +KLL AV D WDS  + IEK+ PP WK+ PAGRILE  ILHLAMSE+
Sbjct: 184 PKLVYSKLVLRFAKKLLAAVVDKWDSHVVIIEKISPPDWKSAPAGRILEFSILHLAMSEV 243

Query: 329 TVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKD 367
            V+ TRH IVINEAVDLAKRFCDG+APRIINGCLRTFVKD
Sbjct: 244 AVLETRHPIVINEAVDLAKRFCDGSAPRIINGCLRTFVKD 283

BLAST of CsaV3_4G017570.1 vs. Swiss-Prot
Match: sp|Q18B61|NUSB_PEPD6 (Transcription antitermination protein NusB OS=Peptoclostridium difficile (strain 630) OX=272563 GN=nusB PE=3 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 2.6e-06
Identity = 29/74 (39.19%), Postives = 48/74 (64.86%), Query Frame = 0

Query: 296 KIEKVIPPTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAP 355
           KI+++I    KN    R+  +++ IL L++ EI  + T +++ INEAV+LAK +CD  +P
Sbjct: 94  KIDELINKHAKNWTVDRMPKVDVSILRLSVCEILYLDTPNKVSINEAVELAKIYCDDKSP 153

Query: 356 RIINGCLRTFVKDI 368
           + ING L + V +I
Sbjct: 154 KFINGILGSVVDEI 167

BLAST of CsaV3_4G017570.1 vs. Swiss-Prot
Match: sp|A7GWZ7|NUSB_CAMC5 (Transcription antitermination protein NusB OS=Campylobacter curvus (strain 525.92) OX=360105 GN=nusB PE=3 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 2.9e-05
Identity = 26/67 (38.81%), Postives = 41/67 (61.19%), Query Frame = 0

Query: 296 KIEKVIPPTWKNKPAGR--ILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAP 355
           ++++++ P  K K   R  I+EL IL L + E+   GT   ++INEA++LAK     +AP
Sbjct: 58  ELDEILKPYLKEKDIERIGIVELAILRLGVYEMKFTGTDKAVIINEAIELAKELGGDSAP 117

Query: 356 RIINGCL 361
           + ING L
Sbjct: 118 KFINGVL 124

BLAST of CsaV3_4G017570.1 vs. Swiss-Prot
Match: sp|B1WXY6|NUSB_CYAA5 (Transcription antitermination protein NusB OS=Cyanothece sp. (strain ATCC 51142) OX=43989 GN=nusB PE=3 SV=1)

HSP 1 Score: 50.4 bits (119), Expect = 4.9e-05
Identity = 28/64 (43.75%), Postives = 38/64 (59.38%), Query Frame = 0

Query: 305 WKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFV 364
           W+ K   +I +  IL LA++EI  +    ++ INEAV+LAKR+ D    R ING LR F 
Sbjct: 145 WQLKRLAKI-DQDILRLAVAEILFLDVPEKVSINEAVELAKRYSDDDGYRFINGVLRRFT 204

Query: 365 KDIK 369
             IK
Sbjct: 205 DHIK 207

BLAST of CsaV3_4G017570.1 vs. Swiss-Prot
Match: sp|B1XIZ3|NUSB_SYNP2 (Transcription antitermination protein NusB OS=Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) OX=32049 GN=nusB PE=3 SV=1)

HSP 1 Score: 49.7 bits (117), Expect = 8.4e-05
Identity = 25/60 (41.67%), Postives = 38/60 (63.33%), Query Frame = 0

Query: 314 LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSM 373
           L+  IL +A++EI  + T +++ INEAV+LAKR+ D    R ING LR     ++  DS+
Sbjct: 153 LDRDILRIAVAEILFLETPYKVAINEAVELAKRYSDEDGHRFINGVLRRVSDRLRAEDSL 212

BLAST of CsaV3_4G017570.1 vs. Swiss-Prot
Match: sp|Q8GIR7|NUSB_SYNE7 (Transcription antitermination protein NusB OS=Synechococcus elongatus (strain PCC 7942) OX=1140 GN=nusB PE=3 SV=1)

HSP 1 Score: 48.5 bits (114), Expect = 1.9e-04
Identity = 25/48 (52.08%), Postives = 31/48 (64.58%), Query Frame = 0

Query: 314 LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR 362
           L+  IL LA +EI  +GT  Q+ INEAV+LA R+ D    R ING LR
Sbjct: 149 LDQDILRLAAAEILFLGTPEQVAINEAVELANRYSDEEGRRFINGVLR 196

BLAST of CsaV3_4G017570.1 vs. TrEMBL
Match: tr|A0A0A0KWZ5|A0A0A0KWZ5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G268020 PE=4 SV=1)

HSP 1 Score: 756.5 bits (1952), Expect = 2.8e-215
Identity = 378/378 (100.00%), Postives = 378/378 (100.00%), Query Frame = 0

Query: 1   MSLAPPTSLYPYSSHSHPLPKSHLSSFTQRSHPNFFSLCFPFHLSFSTSFSTLHSLKYSA 60
           MSLAPPTSLYPYSSHSHPLPKSHLSSFTQRSHPNFFSLCFPFHLSFSTSFSTLHSLKYSA
Sbjct: 1   MSLAPPTSLYPYSSHSHPLPKSHLSSFTQRSHPNFFSLCFPFHLSFSTSFSTLHSLKYSA 60

Query: 61  FKTDTGLGDSHDADQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPF 120
           FKTDTGLGDSHDADQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPF
Sbjct: 61  FKTDTGLGDSHDADQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGICQFPF 120

Query: 121 HASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSP 180
           HASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSP
Sbjct: 121 HASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSGKFCSP 180

Query: 181 RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVE 240
           RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVE
Sbjct: 181 RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVE 240

Query: 241 TIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKV 300
           TIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKV
Sbjct: 241 TIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKV 300

Query: 301 IPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL 360
           IPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL
Sbjct: 301 IPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL 360

Query: 361 RTFVKDIKEIDSMPAREK 379
           RTFVKDIKEIDSMPAREK
Sbjct: 361 RTFVKDIKEIDSMPAREK 378

BLAST of CsaV3_4G017570.1 vs. TrEMBL
Match: tr|A0A1S3BNR2|A0A1S3BNR2_CUCME (uncharacterized protein LOC103491638 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491638 PE=4 SV=1)

HSP 1 Score: 691.4 bits (1783), Expect = 1.1e-195
Identity = 351/389 (90.23%), Postives = 360/389 (92.54%), Query Frame = 0

Query: 1   MSLAPPTSLYPYSSHSHPLPKSHLSSFTQRSHPNFFSLCFPFHLSFSTSFSTLHSLKYSA 60
           MSLAPPTSLY YSSHSHPLPKSHLSSFTQRSHPN F + F FHLSFSTSFSTLHS K SA
Sbjct: 1   MSLAPPTSLYLYSSHSHPLPKSHLSSFTQRSHPNLFPVRFLFHLSFSTSFSTLHSFKSSA 60

Query: 61  FKTDTGLGDSHDA-----------DQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFS 120
           FK D GLGDSHDA           +QPH LKFAPGASAHKTRLF VGDKVITTRPNFHFS
Sbjct: 61  FKIDIGLGDSHDAGQPREQYQVEQEQPHGLKFAPGASAHKTRLFNVGDKVITTRPNFHFS 120

Query: 121 YHISGICQFPFHASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIP 180
            HISGICQFPFHASSIVPH+K+SMPR CCQASLRASTSF ENRVAEERSSIS+SSIE IP
Sbjct: 121 NHISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPENRVAEERSSISVSSIETIP 180

Query: 181 KVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNH 240
           K+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNH
Sbjct: 181 KIDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNSRRESGYEFDKTSLMEYNH 240

Query: 241 MSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDG 300
           MSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPK+VYSKLILRFTRKLLVAV DG
Sbjct: 241 MSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDG 300

Query: 301 WDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCD 360
           WD+RALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCD
Sbjct: 301 WDNRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCD 360

Query: 361 GAAPRIINGCLRTFVKDIKEIDSMPAREK 379
           GAAPRIINGCLRTFVKDIKE DS PAREK
Sbjct: 361 GAAPRIINGCLRTFVKDIKETDSTPAREK 389

BLAST of CsaV3_4G017570.1 vs. TrEMBL
Match: tr|A0A1S4DXQ5|A0A1S4DXQ5_CUCME (uncharacterized protein LOC103491638 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491638 PE=4 SV=1)

HSP 1 Score: 526.9 bits (1356), Expect = 3.6e-146
Identity = 261/275 (94.91%), Postives = 268/275 (97.45%), Query Frame = 0

Query: 104 PNFHFSYHISGICQFPFHASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISIS 163
           PNFHFS HISGICQFPFHASSIVPH+K+SMPR CCQASLRASTSF ENRVAEERSSIS+S
Sbjct: 15  PNFHFSNHISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPENRVAEERSSISVS 74

Query: 164 SIEMIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTS 223
           SIE IPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTS
Sbjct: 75  SIETIPKIDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNSRRESGYEFDKTS 134

Query: 224 LMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLL 283
           LMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPK+VYSKLILRFTRKLL
Sbjct: 135 LMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKMVYSKLILRFTRKLL 194

Query: 284 VAVGDGWDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDL 343
           VAV DGWD+RALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDL
Sbjct: 195 VAVVDGWDNRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDL 254

Query: 344 AKRFCDGAAPRIINGCLRTFVKDIKEIDSMPAREK 379
           AKRFCDGAAPRIINGCLRTFVKDIKE DS PAREK
Sbjct: 255 AKRFCDGAAPRIINGCLRTFVKDIKETDSTPAREK 289

BLAST of CsaV3_4G017570.1 vs. TrEMBL
Match: tr|A0A2N9HGW1|A0A2N9HGW1_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS39050 PE=4 SV=1)

HSP 1 Score: 360.9 bits (925), Expect = 3.5e-96
Identity = 174/229 (75.98%), Postives = 201/229 (87.77%), Query Frame = 0

Query: 150 ENRVAEERSSISISSI-EMIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKR 209
           E  V + + ++S S++  ++PK+DKSG+FCSPRAARELALSIVYAACLEGSDPVRLFEKR
Sbjct: 77  EQVVEKSKDALSTSTLSSLLPKIDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKR 136

Query: 210 LNARRESGYEFDKTSLMEYNHMSFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPK 269
           +NARRE  YEFDK SL+EYNHM+FGGPPVTVET+EEADELLR DE++S IEAE+LAAPPK
Sbjct: 137 MNARREPSYEFDKASLLEYNHMNFGGPPVTVETVEEADELLRNDEKESAIEAEVLAAPPK 196

Query: 270 IVYSKLILRFTRKLLVAVGDGWDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITV 329
           + Y+KLILRFTRKLLVAVGD WDS    I+ V+P  WKN+PAGRILELCILHLAMSEITV
Sbjct: 197 LAYNKLILRFTRKLLVAVGDRWDSDVHVIDNVVPSNWKNEPAGRILELCILHLAMSEITV 256

Query: 330 IGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSMPARE 378
           +GTRH IV+NEAVDLAKRFCDGAAPR+INGCLRTFVK ++EI S  A E
Sbjct: 257 LGTRHPIVVNEAVDLAKRFCDGAAPRVINGCLRTFVKGLEEIGSAQASE 305

BLAST of CsaV3_4G017570.1 vs. TrEMBL
Match: tr|M5WJ39|M5WJ39_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G001800 PE=4 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 1.0e-95
Identity = 187/259 (72.20%), Postives = 213/259 (82.24%), Query Frame = 0

Query: 111 HISGICQFPFHASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPK 170
           H+ G+ Q P  A+     +  S PR     SLR ST F+ +   E+ +S    S EM+PK
Sbjct: 43  HLLGLIQRPLLAT-----VSLSSPR----TSLRTST-FTLDEALEKPNS---DSREMLPK 102

Query: 171 VDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHM 230
           +DKSG+FCSPRAARELALSIVYAACLEGSDPVRLFEKR+N RRE GYEFD+ SL+EYN M
Sbjct: 103 IDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKRMNVRREPGYEFDRASLLEYNPM 162

Query: 231 SFGGPPVTVETIEEADELLRKDERDSTIEAEILAAPPKIVYSKLILRFTRKLLVAVGDGW 290
           SFGGPPVTVET+EEADELLR DE++S IEAE+LAAPPK+VYSKLILRFTRKLLVAV D W
Sbjct: 163 SFGGPPVTVETVEEADELLRNDEKESAIEAEVLAAPPKLVYSKLILRFTRKLLVAVMDKW 222

Query: 291 DSRALKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG 350
           DS  L I+KV PP WK++PAGRILELCILHLAMSEITV+ TRH IVINEAVDLAKRFCDG
Sbjct: 223 DSHVLVIDKVAPPNWKDEPAGRILELCILHLAMSEITVLETRHPIVINEAVDLAKRFCDG 282

Query: 351 AAPRIINGCLRTFVKDIKE 370
           +APR+INGCLRTFVK I+E
Sbjct: 283 SAPRVINGCLRTFVKGIEE 288

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN54018.14.3e-215100.00hypothetical protein Csa_4G268020 [Cucumis sativus][more]
XP_008449897.11.7e-19590.23PREDICTED: uncharacterized protein LOC103491638 isoform X1 [Cucumis melo][more]
XP_004149639.26.5e-171100.00PREDICTED: uncharacterized protein LOC101216754 [Cucumis sativus][more]
XP_016900769.15.5e-14694.91PREDICTED: uncharacterized protein LOC103491638 isoform X2 [Cucumis melo][more]
XP_023007062.11.9e-13083.33uncharacterized protein LOC111499665 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT4G26370.14.1e-8771.36antitermination NusB domain-containing protein[more]
Match NameE-valueIdentityDescription
sp|Q18B61|NUSB_PEPD62.6e-0639.19Transcription antitermination protein NusB OS=Peptoclostridium difficile (strain... [more]
sp|A7GWZ7|NUSB_CAMC52.9e-0538.81Transcription antitermination protein NusB OS=Campylobacter curvus (strain 525.9... [more]
sp|B1WXY6|NUSB_CYAA54.9e-0543.75Transcription antitermination protein NusB OS=Cyanothece sp. (strain ATCC 51142)... [more]
sp|B1XIZ3|NUSB_SYNP28.4e-0541.67Transcription antitermination protein NusB OS=Synechococcus sp. (strain ATCC 272... [more]
sp|Q8GIR7|NUSB_SYNE71.9e-0452.08Transcription antitermination protein NusB OS=Synechococcus elongatus (strain PC... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KWZ5|A0A0A0KWZ5_CUCSA2.8e-215100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G268020 PE=4 SV=1[more]
tr|A0A1S3BNR2|A0A1S3BNR2_CUCME1.1e-19590.23uncharacterized protein LOC103491638 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A1S4DXQ5|A0A1S4DXQ5_CUCME3.6e-14694.91uncharacterized protein LOC103491638 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A2N9HGW1|A0A2N9HGW1_FAGSY3.5e-9675.98Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS39050 PE=4 SV=1[more]
tr|M5WJ39|M5WJ39_PRUPE1.0e-9572.20Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G001800 PE=4 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: Biological Process
TermDefinition
GO:0006353DNA-templated transcription, termination
GO:0006355regulation of transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0003723RNA binding
Vocabulary: INTERPRO
TermDefinition
IPR011605NusB_fam
IPR006027NusB_RsmB_TIM44
IPR035926NusB-like_sf
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006353 DNA-templated transcription, termination
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
molecular_function GO:0003723 RNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsaV3_4G017570CsaV3_4G017570gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsaV3_4G017570.1CsaV3_4G017570.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_4G017570.1.exon9CsaV3_4G017570.1.exon9exon
CsaV3_4G017570.1.exon8CsaV3_4G017570.1.exon8exon
CsaV3_4G017570.1.exon7CsaV3_4G017570.1.exon7exon
CsaV3_4G017570.1.exon6CsaV3_4G017570.1.exon6exon
CsaV3_4G017570.1.exon5CsaV3_4G017570.1.exon5exon
CsaV3_4G017570.1.exon4CsaV3_4G017570.1.exon4exon
CsaV3_4G017570.1.exon3CsaV3_4G017570.1.exon3exon
CsaV3_4G017570.1.exon2CsaV3_4G017570.1.exon2exon
CsaV3_4G017570.1.exon1CsaV3_4G017570.1.exon1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_4G017570.1.cds8CsaV3_4G017570.1.cds8CDS
CsaV3_4G017570.1.cds7CsaV3_4G017570.1.cds7CDS
CsaV3_4G017570.1.cds6CsaV3_4G017570.1.cds6CDS
CsaV3_4G017570.1.cds5CsaV3_4G017570.1.cds5CDS
CsaV3_4G017570.1.cds4CsaV3_4G017570.1.cds4CDS
CsaV3_4G017570.1.cds3CsaV3_4G017570.1.cds3CDS
CsaV3_4G017570.1.cds2CsaV3_4G017570.1.cds2CDS
CsaV3_4G017570.1.cds1CsaV3_4G017570.1.cds1CDS


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR035926NusB-like superfamilyGENE3DG3DSA:1.10.940.10coord: 249..372
e-value: 2.8E-11
score: 45.5
IPR035926NusB-like superfamilySUPERFAMILYSSF48013NusB-likecoord: 178..213
coord: 271..368
IPR006027NusB/RsmB/TIM44PFAMPF01029NusBcoord: 281..365
e-value: 3.4E-10
score: 40.2
IPR011605NusB antitermination factorPANTHERPTHR11078N UTILIZATION SUBSTANCE PROTEIN B-RELATEDcoord: 167..370