Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATGGTAAAAATTATATAAATTACGCGGTGAAAATTGATTTAAGTCAATAATTTAGAGGTAAAAATTGAATTTTTTACTATATATTTTTAAAACTCCTTCTGCTTCTCAATTACGCGGGCAAAATCTTGCATCCAAAGCCCATTATTCAAGCGCTCGCCACGGGGATACTAAAAATTCTCTCCACTTCTCTCTCGTTTTGAGTTTCGCTCAGGGGAATAACGCGAGGAATCTCTCTCGGTTCTGTGCGAATTCCGGCTGTCGATGGAGAAAACAATGCCGGAATCGATGGAAGCTACACCGTCTGTGTCTTCAAGCCTCGATCTCCAAGCAGTTCGCAGGTTTCTCTCTGTCTTGTTTCTGTGTGCACTAATTACGTGTAGCAATGTATAATCCGACGTCATTTTTACTTAAACTGAAGTCGCATCAGCGAGCTAGAAGAGTTGCAGAGATCTTTGGAGGAAGATGAAGCTTATTGCACGGATTCATTAGGTTCTGAGAAGTTACTGAAGGAATGCGCTCTCCATCTCGAGGTTTTCTATTGCCGATTATTTTACTTTTCTACATCCCTTACTTTTTCTCTCTCGTGTTGAAAGAAAAACTCGTGGCTCTTATTCTTTGAAATGTGCAGAGCAGGCTGCAGCAGGTTCTGTCAGAATGCTCTAACGTTGATAGTTTCTTGAGGATTGATGATTTAGGTAATTCTGTTAAGCTTACTTATGAATTTACTATGGTTTGAATGCATTTATGTGTGCATTTCACAATTTCGTCCTCCATTCTGTTAGATGAATTTTCCTTCATTTTAGTAACTGCTACTAGATTAATGATTTATGTTTCCTTTCCTGCGCTTTGATCGATTTCTCCAATGTTTTTTTTCCTTCACAACTTTTTAAATCTCTTTCTAGATGCATATGTGGAACACATGAAAGAGGAACTCGTTGCGGTGGAAGCTGAAAGCAGCAAAATCTCTAATGAGATAGAGGTTCTTAAGAGAACCAATATAGAAGGCAATCTCTTCGTTTATGTTTTGATTTATACTTTTTAGCCTCCTATTTGTACTGACGTCCTTCTTTTGCTTCCGATCACAGGTTCTAATAAATTAGAGGTGGATCTCGAATTATTAAATGTGTCGTTAGATCGTTTTACATCACAGGTTAGTCTCCATTGCAAAACAGTTTGTACGACACCAACAGTTACTAGACTAGGATGAGCGGAGGAAGACAATGATTTAACTTAAATTTTCTTAAAAGTTGCCTGACTTAAATGTTGAAAAGAATTAAATCCAACTATGGAAGGTGTTTGTGACTTGTATTCATTTCAGTAATTGGGCGAATGAGAAATTAGGGCAAAAACCAGATGGATGAAGGTCGAGGAAGTGTAATATCTGAAATTTTTAGACGAACTCATATTATGTAGGCTCTTGCTCCTTACGTTATTTAAAGAAAATAAACTGTACAATATATTGGATTAGATATTCTGCCAAGCTGTTTCGAGACATTACTACTAATGTTTGATTTTTCCTTCCGTCTTTGTTGTTGTTGATTTTTTTTTTTTTTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCAATGATTTAAGTTCACTTTTTCCTTATTTTCAATTTGATTTTGAGGTTTGATTTTCTCTATTATTTTTGGCGTATGATCAACATAGCTTGTTTATAGGCCTAATAAAAGTTCCTCCCTTGTGAGAATGTCTGTCACCATGTTCTCTTCATTACAATCTTGCAGGATCCTGAGAATGAAACATTTAATTTCTGCTCTATGAACGGTGAAGACCAAATGAACGTGATAGTTGACCCTGAATGCAATGCTTTTGAGGTCTTTTCTATTACCTTATTACTTCTAACTTTGTTTTTATTGTCACATGTTCTTTTTATTCAATTTTGATCAATTATAAGCACGTTTCTAGGGAGCAGTCGATCTAATGAGCTTAATTAAGCCATGGTTCCATTTACTGTTTCAATCATGGTATTGTAGCTCAATGAAAAACTATATGTGGTTTGACTAATTCTGAAGTTGTTCTTTTCTTCGAAACTCTTTATGCTTCTTCCAGGTTTTGGAACTTGATAGTCATATTGAGAAGAATAAAAGAATCCTGAAATCTTTGCAGGAAGTAGATGAGATATTTAAAAGGTAATATGTGTTTTGAGGTTTTAATTTTTGTATCCAAATCTCCCGACCTTCAAACCTCTTCAGTTATGCAACTTAGTTATGACTTTAAACCACTTCTCCCGATCCAATTGTTCATCTTGCCCTCAGGTTTGAAATATCATAAATCTGAAAATTTTCTCTTTGTTGAATCCTCCTTCCCTCGGAGCATGAAACCTTCGTTTTATAAAATGGATTATGATTATGCCACGATCCTAAAGGGAGTTGCCATGGCGCTGCCCTCGATTTAGGCAGAAATTAAAATTTTGGAGTGCCATGACCTCATAAGGGAATTGTCATGGTGTTGTGCTGTAGCATCACTGCGCCTTCTTGTAGTGCTCTTTTCTTGCCTTTTCTTTGTTCTGCAGAGCCCCACGATTCTGTAACAACCACCTAGGCACCATTCTCTTGCCGCCTTTGTATGGTCTTCTTCTTGCACACAGGCTGATTTTTCTCTAAATATCTCTTAAAACCTTCACTTTCTCTACTTGTTTTTGTTCTACACAATAAAAATTCACGTGCACAAATCTGACACAATCAACACAAAACTAACAGTAATCAAGTTATAATCCTTATGAATTATAGCCCACTAAGTGCTATCAATCTAGTAATGACTTCTCTCTCCCTCCCTCCTCATAACTCTCTCCCTCTAGTTTGGATGTTGTTGAACAGGTTGAGGACACAATTGGAGGTCTGAAGGTCATTGATGTTGCTGATAATTTCATTAGATTGTCATTACGTACACATATTCCAAACTTGGAAGATTTTTCAAGCTTACAGAAACTTGAAGGAATGATCGAGCCATCGGAATTGAATCACGAGTTGCTAATAGAAGTTTTGGAAGGAACAATGGAGCTAAAGAATGCCGAGGTATCCCTTTAATTGGTTATGTATCGTAACTAATTGTATCCACGGTTATCATGGTCTAGTTCTTATTAGAGTTTAAAATCCAAGTAATTCCATCTGTTGGCACCTCTCTCTCTCCCATCAATGTAAATTCTGTTGACGGAGCTTAATTCATTAGTATTTTCACAGATCTTTCCTGGTGATGTCCACTTGCACGATATCATCAATGCTTCAAAGTCAGTCAGGTTTGTTTTCTATAAGATTCATGTATACTCACACTGTACTCAAGAATAATTGAAGAGGTTGGTCTTTTATTTTACAGTAGTATAGCACTGACAATGTTACATACTTATAGTCTTATTCGTTTACATATTCTTGTTAGATGTCAATTTGATCTACCTGCTTTATTAACTGGCGTTGTACAACTTTCACCAATTATATTGGTTTGCTGGAATCATAGATGAAACTGTTTGAGTTGAATTGCAATGACATGAGGGATAATAATATGTAACTCAAAAAAGCATATAATAGTGACATCACCTCTATTTATACGTAATTGTAATGATTTGGTCATAAATTTTCCAGACTTTTTACTTATTGAACAGAATGCAATTTATCTGACCCGTTCATTTTGCTTTTGACTGAGCAGCAATTCATTGGAATGGTTTGTGAAAAAAGTACAAGATAGAATTGTTTTGTGTACTCTTAGGCGATTTGTTGTGAAGAGTGCAAACAAGTCAAGGTGAGACTCGTAAAATTTTCAGATCTTTTGTATAATTAGAAGTAATGTATAAGATAGGTTTTTCTTTTTTGTCTTTTTTCATTTTGTGGAGTACTCTCTATGTCCATAAAGGAATTTCCCAACCCATATTTCCCATTTTCGAAAATGACTCGGGATGGGGAATGCACTATCTTTCGCTTCCCTTACTTCCCTCTTTTCTCCTTCCTTCGTTCCTAGCCATCCAGTTAGATATGTACAGAAAATTCTTTCCGCTGGGATCATACTTTCTCCTTTCGAGTCACTTTTGGCTACCATTGAATTCTTGGTAGATAGGGTTTCCTAGGCCCCTAGGCTGGGATAGGTTGCTTTCAGTCCTCCTTCCTATACAACCGCTTAACAGCCTTTGCATACAAGTAGATCGCTATGCACGACTTCTTGGTAAGACTATCAGCAGCCACTAATCATCATTATTGCCTTACTTCATCATCGTATTTGAGAGTAAAGTAATGTAATTCTATATTTTAGTTGTTTTTGGGACTGAAATGCTTACTCTTGATGCAGTCATTCCTTTGATTACATAGACCAAGACGAAACGATAGTATGTTCTATGATTGGAGGGATTGATGCGTTTATTAAGGTGTCTCAAGGCTGGCCACTAGCCGATTCTCCACTGAAACTTGTATCACTCAAGAGCTCAGACCATTATACAAAAGGTGCTTCCTTAAGCCTCGTTTGCAAGGTGGAGGTAAGATGGATATTTTTAGTTTTTGTTCATTGTCTTTGCCTCTCTTTGATACCTAAAAAGGGCTTTGGTAAATCAAATTATTTTTACTTTTTTATTTTTATTTTTCTATGGTTATCATTAGGGTATTACGTGCATATTCATAATCCGATTATGTTTGCACCCTTCAGAAAATGGCAAATTCCTTGGACGCACGTATTCGCCAAAATCTATCAAGCTTTGCAGACGCTGTTGAAAAAATATTGAAGGAGCAAATGCATTTAGAACTCCAAGCTGACAGTGGTCTTTGACGATTAAGAACTTTGGTTCATCATGCAATTCAGGTTTCTCAATTCTACATCCTCTACTAGTATAAGTATCACGTGATATTGCTGTTGATGATTTTTCATGCCGAAAATTTTAGCTCGATTATTGATTGCTATTATTATTATTATTTGCCCTTTTGTGTGTATAGGCTTAAAATATTGTTGTTTTGTTTATTTATTGTTATTATTTTTGTATAATTGGGATATCCAAAGCCCATCCCAGTAGTTTATAGAGGACTTGGGGAGGCGACAAGTGGTTAATTTTAGGCTAAATCATAAAAATAAAAATAAAAAGTCC
mRNA sequence
AATGGTAAAAATTATATAAATTACGCGGTGAAAATTGATTTAAGTCAATAATTTAGAGGTAAAAATTGAATTTTTTACTATATATTTTTAAAACTCCTTCTGCTTCTCAATTACGCGGGCAAAATCTTGCATCCAAAGCCCATTATTCAAGCGCTCGCCACGGGGATACTAAAAATTCTCTCCACTTCTCTCTCGTTTTGAGTTTCGCTCAGGGGAATAACGCGAGGAATCTCTCTCGGTTCTGTGCGAATTCCGGCTGTCGATGGAGAAAACAATGCCGGAATCGATGGAAGCTACACCGTCTGTGTCTTCAAGCCTCGATCTCCAAGCAGTTCGCAGCTAGAAGAGTTGCAGAGATCTTTGGAGGAAGATGAAGCTTATTGCACGGATTCATTAGGTTCTGAGAAGTTACTGAAGGAATGCGCTCTCCATCTCGAGAGCAGGCTGCAGCAGGTTCTGTCAGAATGCTCTAACGTTGATAGTTTCTTGAGGATTGATGATTTAGATGCATATGTGGAACACATGAAAGAGGAACTCGTTGCGGTGGAAGCTGAAAGCAGCAAAATCTCTAATGAGATAGAGGTGGATCTCGAATTATTAAATGTGTCGTTAGATCGTTTTACATCACAGGATCCTGAGAATGAAACATTTAATTTCTGCTCTATGAACGGTGAAGACCAAATGAACGTGATAGTTGACCCTGAATGCAATGCTTTTGAGGTTTTGGAACTTGATAGTCATATTGAGAAGAATAAAAGAATCCTGAAATCTTTGCAGGAAGTAGATGAGATATTTAAAAGTTTGGATGTTGTTGAACAGGTTGAGGACACAATTGGAGGTCTGAAGGTCATTGATGTTGCTGATAATTTCATTAGATTGTCATTACGTACACATATTCCAAACTTGGAAGATTTTTCAAGCTTACAGAAACTTGAAGGAATGATCGAGCCATCGGAATTGAATCACGAGTTGCTAATAGAAGTTTTGGAAGGAACAATGGAGCTAAAGAATGCCGAGATCTTTCCTGGTGATGTCCACTTGCACGATATCATCAATGCTTCAAAGTCAGTCAGCAATTCATTGGAATGGTTTGTGAAAAAAGTACAAGATAGAATTGTTTTGTGTACTCTTAGGCGATTTGTTGTGAAGAGTGCAAACAAGTCAAGTCATTCCTTTGATTACATAGACCAAGACGAAACGATAGTATGTTCTATGATTGGAGGGATTGATGCGTTTATTAAGGTGTCTCAAGGCTGGCCACTAGCCGATTCTCCACTGAAACTTGTATCACTCAAGAGCTCAGACCATTATACAAAAGGTGCTTCCTTAAGCCTCGTTTGCAAGGTGGAGAAAATGGCAAATTCCTTGGACGCACGTATTCGCCAAAATCTATCAAGCTTTGCAGACGCTGTTGAAAAAATATTGAAGGAGCAAATGCATTTAGAACTCCAAGCTGACAGTGGTCTTTGACGATTAAGAACTTTGGTTCATCATGCAATTCAGGTTTCTCAATTCTACATCCTCTACTAGTATAAGTATCACGTGATATTGCTGTTGATGATTTTTCATGCCGAAAATTTTAGCTCGATTATTGATTGCTATTATTATTATTATTTGCCCTTTTGTGTGTATAGGCTTAAAATATTGTTGTTTTGTTTATTTATTGTTATTATTTTTGTATAATTGGGATATCCAAAGCCCATCCCAGTAGTTTATAGAGGACTTGGGGAGGCGACAAGTGGTTAATTTTAGGCTAAATCATAAAAATAAAAATAAAAAGTCC
Coding sequence (CDS)
ATGAAAGAGGAACTCGTTGCGGTGGAAGCTGAAAGCAGCAAAATCTCTAATGAGATAGAGGTGGATCTCGAATTATTAAATGTGTCGTTAGATCGTTTTACATCACAGGATCCTGAGAATGAAACATTTAATTTCTGCTCTATGAACGGTGAAGACCAAATGAACGTGATAGTTGACCCTGAATGCAATGCTTTTGAGGTTTTGGAACTTGATAGTCATATTGAGAAGAATAAAAGAATCCTGAAATCTTTGCAGGAAGTAGATGAGATATTTAAAAGTTTGGATGTTGTTGAACAGGTTGAGGACACAATTGGAGGTCTGAAGGTCATTGATGTTGCTGATAATTTCATTAGATTGTCATTACGTACACATATTCCAAACTTGGAAGATTTTTCAAGCTTACAGAAACTTGAAGGAATGATCGAGCCATCGGAATTGAATCACGAGTTGCTAATAGAAGTTTTGGAAGGAACAATGGAGCTAAAGAATGCCGAGATCTTTCCTGGTGATGTCCACTTGCACGATATCATCAATGCTTCAAAGTCAGTCAGCAATTCATTGGAATGGTTTGTGAAAAAAGTACAAGATAGAATTGTTTTGTGTACTCTTAGGCGATTTGTTGTGAAGAGTGCAAACAAGTCAAGTCATTCCTTTGATTACATAGACCAAGACGAAACGATAGTATGTTCTATGATTGGAGGGATTGATGCGTTTATTAAGGTGTCTCAAGGCTGGCCACTAGCCGATTCTCCACTGAAACTTGTATCACTCAAGAGCTCAGACCATTATACAAAAGGTGCTTCCTTAAGCCTCGTTTGCAAGGTGGAGAAAATGGCAAATTCCTTGGACGCACGTATTCGCCAAAATCTATCAAGCTTTGCAGACGCTGTTGAAAAAATATTGAAGGAGCAAATGCATTTAGAACTCCAAGCTGACAGTGGTCTTTGA
Protein sequence
MKEELVAVEAESSKISNEIEVDLELLNVSLDRFTSQDPENETFNFCSMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGGLKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQADSGL
Homology
BLAST of Cp4.1LG03g18000 vs. NCBI nr
Match:
XP_023528068.1 (uncharacterized protein LOC111791098 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 597 bits (1539), Expect = 4.90e-213
Identity = 315/329 (95.74%), Postives = 315/329 (95.74%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESSKISNEIEV DLELLNVSLDRFTSQDPENETFNFC
Sbjct: 89 MKEELVAVEAESSKISNEIEVLKRTNIEGSNKLEVDLELLNVSLDRFTSQDPENETFNFC 148
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG
Sbjct: 149 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 208
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI
Sbjct: 209 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 268
Query: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET
Sbjct: 269 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 328
Query: 241 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 300
IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI
Sbjct: 329 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 388
Query: 301 RQNLSSFADAVEKILKEQMHLELQADSGL 315
RQNLSSFADAVEKILKEQMHLELQADSGL
Sbjct: 389 RQNLSSFADAVEKILKEQMHLELQADSGL 417
BLAST of Cp4.1LG03g18000 vs. NCBI nr
Match:
XP_023528067.1 (uncharacterized protein LOC111791098 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023528069.1 uncharacterized protein LOC111791098 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 597 bits (1539), Expect = 5.47e-213
Identity = 315/329 (95.74%), Postives = 315/329 (95.74%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESSKISNEIEV DLELLNVSLDRFTSQDPENETFNFC
Sbjct: 92 MKEELVAVEAESSKISNEIEVLKRTNIEGSNKLEVDLELLNVSLDRFTSQDPENETFNFC 151
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG
Sbjct: 152 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 211
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI
Sbjct: 212 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 271
Query: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET
Sbjct: 272 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 331
Query: 241 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 300
IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI
Sbjct: 332 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 391
Query: 301 RQNLSSFADAVEKILKEQMHLELQADSGL 315
RQNLSSFADAVEKILKEQMHLELQADSGL
Sbjct: 392 RQNLSSFADAVEKILKEQMHLELQADSGL 420
BLAST of Cp4.1LG03g18000 vs. NCBI nr
Match:
XP_022980354.1 (uncharacterized protein LOC111479744 [Cucurbita maxima] >XP_022980355.1 uncharacterized protein LOC111479744 [Cucurbita maxima] >XP_022980356.1 uncharacterized protein LOC111479744 [Cucurbita maxima])
HSP 1 Score: 582 bits (1501), Expect = 1.19e-208
Identity = 308/329 (93.62%), Postives = 311/329 (94.53%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESSKISNEIEV DLELLNVSLDRFTSQDPENETFNFC
Sbjct: 1 MKEELVAVEAESSKISNEIEVLKSTNIEGSNKLEVDLELLNVSLDRFTSQDPENETFNFC 60
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGEDQMNVIVD ECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG
Sbjct: 61 SMNGEDQMNVIVDRECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKVIDVADNFIRLSLRTHIPNLEDFSSLQ+LEGMIEPSELNHELLIEVLEGTMELKNAEI
Sbjct: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
Query: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET
Sbjct: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
Query: 241 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 300
IVC MIG IDAFIKVSQGWPLADSPLKLVSLK+SDHYTKGASLSLVCKVEKMANSLDARI
Sbjct: 241 IVCYMIGRIDAFIKVSQGWPLADSPLKLVSLKNSDHYTKGASLSLVCKVEKMANSLDARI 300
Query: 301 RQNLSSFADAVEKILKEQMHLELQADSGL 315
RQNLSSFADAV+ ILKEQMHLELQADSGL
Sbjct: 301 RQNLSSFADAVKNILKEQMHLELQADSGL 329
BLAST of Cp4.1LG03g18000 vs. NCBI nr
Match:
KAG6582484.1 (hypothetical protein SDJN03_22486, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 578 bits (1491), Expect = 1.11e-205
Identity = 307/329 (93.31%), Postives = 309/329 (93.92%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESSKISNEIEV DLELL+VSLDRFTSQD E ETFNFC
Sbjct: 92 MKEELVAVEAESSKISNEIEVLKRTNIEGSNKLEVDLELLDVSLDRFTSQDTEKETFNFC 151
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGEDQMNVIVD ECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG
Sbjct: 152 SMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 211
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKVI VADNFIRLSLRTHIPNLEDFSSLQ+LEGMIEPSELNHELLIEVLEGTMELKNAEI
Sbjct: 212 LKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEI 271
Query: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET
Sbjct: 272 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 331
Query: 241 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 300
IVC MIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI
Sbjct: 332 IVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 391
Query: 301 RQNLSSFADAVEKILKEQMHLELQADSGL 315
RQNLSSFADAVEKILKEQMHLELQAD GL
Sbjct: 392 RQNLSSFADAVEKILKEQMHLELQADGGL 420
BLAST of Cp4.1LG03g18000 vs. NCBI nr
Match:
XP_022924674.1 (uncharacterized protein LOC111432106 isoform X3 [Cucurbita moschata])
HSP 1 Score: 573 bits (1476), Expect = 2.13e-203
Identity = 304/329 (92.40%), Postives = 309/329 (93.92%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESS+ISNEIEV +LELL+VSLDRFTSQDPE ETFNFC
Sbjct: 92 MKEELVAVEAESSQISNEIEVLKRTNIEGSNKLEVNLELLDVSLDRFTSQDPEKETFNFC 151
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGEDQMNVIVD E NAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG
Sbjct: 152 SMNGEDQMNVIVDRERNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 211
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKVI VADNFIRLSLRTHIPNLEDFSSLQ+LEGMIEPSELNHELLIEVLEGTMELKNAEI
Sbjct: 212 LKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEI 271
Query: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET
Sbjct: 272 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 331
Query: 241 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 300
IVC MIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI
Sbjct: 332 IVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 391
Query: 301 RQNLSSFADAVEKILKEQMHLELQADSGL 315
RQNLSSFADAVEKILKEQMHLEL+AD GL
Sbjct: 392 RQNLSSFADAVEKILKEQMHLELEADGGL 420
BLAST of Cp4.1LG03g18000 vs. ExPASy TrEMBL
Match:
A0A6J1ITD7 (uncharacterized protein LOC111479744 OS=Cucurbita maxima OX=3661 GN=LOC111479744 PE=4 SV=1)
HSP 1 Score: 582 bits (1501), Expect = 5.78e-209
Identity = 308/329 (93.62%), Postives = 311/329 (94.53%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESSKISNEIEV DLELLNVSLDRFTSQDPENETFNFC
Sbjct: 1 MKEELVAVEAESSKISNEIEVLKSTNIEGSNKLEVDLELLNVSLDRFTSQDPENETFNFC 60
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGEDQMNVIVD ECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG
Sbjct: 61 SMNGEDQMNVIVDRECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKVIDVADNFIRLSLRTHIPNLEDFSSLQ+LEGMIEPSELNHELLIEVLEGTMELKNAEI
Sbjct: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
Query: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET
Sbjct: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
Query: 241 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 300
IVC MIG IDAFIKVSQGWPLADSPLKLVSLK+SDHYTKGASLSLVCKVEKMANSLDARI
Sbjct: 241 IVCYMIGRIDAFIKVSQGWPLADSPLKLVSLKNSDHYTKGASLSLVCKVEKMANSLDARI 300
Query: 301 RQNLSSFADAVEKILKEQMHLELQADSGL 315
RQNLSSFADAV+ ILKEQMHLELQADSGL
Sbjct: 301 RQNLSSFADAVKNILKEQMHLELQADSGL 329
BLAST of Cp4.1LG03g18000 vs. ExPASy TrEMBL
Match:
A0A6J1E9V8 (uncharacterized protein LOC111432106 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111432106 PE=4 SV=1)
HSP 1 Score: 573 bits (1476), Expect = 1.03e-203
Identity = 304/329 (92.40%), Postives = 309/329 (93.92%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESS+ISNEIEV +LELL+VSLDRFTSQDPE ETFNFC
Sbjct: 92 MKEELVAVEAESSQISNEIEVLKRTNIEGSNKLEVNLELLDVSLDRFTSQDPEKETFNFC 151
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGEDQMNVIVD E NAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG
Sbjct: 152 SMNGEDQMNVIVDRERNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 211
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKVI VADNFIRLSLRTHIPNLEDFSSLQ+LEGMIEPSELNHELLIEVLEGTMELKNAEI
Sbjct: 212 LKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEI 271
Query: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET
Sbjct: 272 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 331
Query: 241 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 300
IVC MIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI
Sbjct: 332 IVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 391
Query: 301 RQNLSSFADAVEKILKEQMHLELQADSGL 315
RQNLSSFADAVEKILKEQMHLEL+AD GL
Sbjct: 392 RQNLSSFADAVEKILKEQMHLELEADGGL 420
BLAST of Cp4.1LG03g18000 vs. ExPASy TrEMBL
Match:
A0A6J1ED63 (uncharacterized protein LOC111432106 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111432106 PE=4 SV=1)
HSP 1 Score: 573 bits (1476), Expect = 1.07e-203
Identity = 304/329 (92.40%), Postives = 309/329 (93.92%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESS+ISNEIEV +LELL+VSLDRFTSQDPE ETFNFC
Sbjct: 93 MKEELVAVEAESSQISNEIEVLKRTNIEGSNKLEVNLELLDVSLDRFTSQDPEKETFNFC 152
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGEDQMNVIVD E NAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG
Sbjct: 153 SMNGEDQMNVIVDRERNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 212
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKVI VADNFIRLSLRTHIPNLEDFSSLQ+LEGMIEPSELNHELLIEVLEGTMELKNAEI
Sbjct: 213 LKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEI 272
Query: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET
Sbjct: 273 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 332
Query: 241 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 300
IVC MIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI
Sbjct: 333 IVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 392
Query: 301 RQNLSSFADAVEKILKEQMHLELQADSGL 315
RQNLSSFADAVEKILKEQMHLEL+AD GL
Sbjct: 393 RQNLSSFADAVEKILKEQMHLELEADGGL 421
BLAST of Cp4.1LG03g18000 vs. ExPASy TrEMBL
Match:
A0A6J1E9M5 (uncharacterized protein LOC111432106 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432106 PE=4 SV=1)
HSP 1 Score: 573 bits (1476), Expect = 1.19e-203
Identity = 304/329 (92.40%), Postives = 309/329 (93.92%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESS+ISNEIEV +LELL+VSLDRFTSQDPE ETFNFC
Sbjct: 96 MKEELVAVEAESSQISNEIEVLKRTNIEGSNKLEVNLELLDVSLDRFTSQDPEKETFNFC 155
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGEDQMNVIVD E NAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG
Sbjct: 156 SMNGEDQMNVIVDRERNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 215
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKVI VADNFIRLSLRTHIPNLEDFSSLQ+LEGMIEPSELNHELLIEVLEGTMELKNAEI
Sbjct: 216 LKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEI 275
Query: 181 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 240
FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET
Sbjct: 276 FPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDET 335
Query: 241 IVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 300
IVC MIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI
Sbjct: 336 IVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARI 395
Query: 301 RQNLSSFADAVEKILKEQMHLELQADSGL 315
RQNLSSFADAVEKILKEQMHLEL+AD GL
Sbjct: 396 RQNLSSFADAVEKILKEQMHLELEADGGL 424
BLAST of Cp4.1LG03g18000 vs. ExPASy TrEMBL
Match:
A0A5A7U6L2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold171G003710 PE=4 SV=1)
HSP 1 Score: 520 bits (1339), Expect = 5.88e-183
Identity = 274/329 (83.28%), Postives = 294/329 (89.36%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIEV--------------DLELLNVSLDRFTSQDPENETFNFC 60
MKEELVAVEAESSKISNEIEV DLE+L +SLDRF SQDPE TFN
Sbjct: 86 MKEELVAVEAESSKISNEIEVLKRTTIEDSNKLKMDLEVLKLSLDRFASQDPEEATFNCS 145
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
SMNGED+MNVIVD ECNAFEVLEL+S IEKNK+ILKSLQEVDEIFKSLDV+EQVE TIGG
Sbjct: 146 SMNGEDRMNVIVDRECNAFEVLELESQIEKNKKILKSLQEVDEIFKSLDVIEQVEGTIGG 205
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
+KVIDVADN IRLSL THIPN+EDFS+LQ+LEG+IE SEL+HEL+IEV GTMELKNAEI
Sbjct: 206 MKVIDVADNSIRLSLHTHIPNVEDFSTLQRLEGLIEKSELDHELIIEVSNGTMELKNAEI 265
Query: 181 FPGDVHLHDIINASKSVSNS-LEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDE 240
FP DVHLHDIINASKS+SNS LEWFV+KVQDRIVLCTLRRF VKSANKSSHSF+Y+DQDE
Sbjct: 266 FPADVHLHDIINASKSISNSSLEWFVRKVQDRIVLCTLRRFAVKSANKSSHSFEYLDQDE 325
Query: 241 TIVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDAR 300
I+CSMIGGIDA IKVSQGWPLADSPLKL+SLKSSDHYTKG SLSL+CKVEKMANSLD R
Sbjct: 326 MIMCSMIGGIDACIKVSQGWPLADSPLKLISLKSSDHYTKGISLSLICKVEKMANSLDGR 385
Query: 301 IRQNLSSFADAVEKILKEQMHLELQADSG 314
IRQNLSSFADAVEKILKEQMHLELQADSG
Sbjct: 386 IRQNLSSFADAVEKILKEQMHLELQADSG 414
BLAST of Cp4.1LG03g18000 vs. TAIR 10
Match:
AT3G23910.1 (BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT3G24255.2); Has 562 Blast hits to 532 proteins in 147 species: Archae - 28; Bacteria - 51; Metazoa - 157; Fungi - 82; Plants - 85; Viruses - 6; Other Eukaryotes - 153 (source: NCBI BLink). )
HSP 1 Score: 275.0 bits (702), Expect = 7.4e-74
Identity = 157/338 (46.45%), Postives = 219/338 (64.79%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIE--------------VDLELLNVSLDRFTSQDPENETFNFC 60
++ EL +VEAES+K+S EIE DLE L +SLD +SQD E N
Sbjct: 82 LRNELQSVEAESAKVSEEIERLSQSHAQDSSRLQRDLEGLLLSLDSMSSQDVEKSKENQP 141
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
S + + VI D + F++ EL++ +E+ + ILKSL+++D + K D EQVED + G
Sbjct: 142 SSSSMEVCEVIDD---DKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTG 201
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKV++ NFIRL LRT+I L+ F K + + EPSEL HELLI + + T E+ E+
Sbjct: 202 LKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEM 261
Query: 181 FPGDVHLHDIINASKS------------VSNSLEWFVKKVQDRIVLCTLRRFVVKSANKS 240
FP D+++ DII A+ S +S++W V KVQD+I+ TLR+++V S+
Sbjct: 262 FPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKYIVMSSKTI 321
Query: 241 SHSFDYIDQDETIVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCK 300
++F+Y D+DETIV + GGIDAF+KVS GWPL ++PLKL SLK+SD+ +KG SLSL+CK
Sbjct: 322 RYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASLKNSDNQSKGISLSLICK 381
Query: 301 VEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQAD 313
VE++ANSLD RQNLS F DA+EKIL EQ ELQ++
Sbjct: 382 VEELANSLDLETRQNLSGFMDAIEKILVEQTREELQSN 416
BLAST of Cp4.1LG03g18000 vs. TAIR 10
Match:
AT3G24255.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )
HSP 1 Score: 263.5 bits (672), Expect = 2.2e-70
Identity = 154/338 (45.56%), Postives = 216/338 (63.91%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIE--------------VDLELLNVSLDRFTSQDPENETFNFC 60
++ EL +VEAES+K+S EIE DLE L +SLD +SQD E N
Sbjct: 407 LRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEKSKENQP 466
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
S + + VI D + F++ EL++ +E+ + ILKSL+++D + K D EQVED + G
Sbjct: 467 SSSSMEVCEVIDD---DKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTG 526
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKV++ NFIRL LRT+I L+ F K + + EPSEL HELLI + + T E+ E+
Sbjct: 527 LKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEM 586
Query: 181 FPGDVHLHDIINASKS------------VSNSLEWFVKKVQDRIVLCTLRRFVVKSANKS 240
FP D+++ DII A+ S +S++W V KVQD+I+ TLR+ V S+
Sbjct: 587 FPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSKTI 646
Query: 241 SHSFDYIDQDETIVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCK 300
++F+Y D+DETIV + GGIDAF+KVS GWPL ++PLKL SLK+SD+ +KG SLSL+ K
Sbjct: 647 RYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASLKNSDNQSKGFSLSLISK 706
Query: 301 VEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQAD 313
+E++ANSLD RQNLS F DAVEKIL +Q EL+++
Sbjct: 707 LEELANSLDLETRQNLSGFMDAVEKILVQQTREELKSN 741
BLAST of Cp4.1LG03g18000 vs. TAIR 10
Match:
AT3G24255.2 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )
HSP 1 Score: 263.5 bits (672), Expect = 2.2e-70
Identity = 154/338 (45.56%), Postives = 216/338 (63.91%), Query Frame = 0
Query: 1 MKEELVAVEAESSKISNEIE--------------VDLELLNVSLDRFTSQDPENETFNFC 60
++ EL +VEAES+K+S EIE DLE L +SLD +SQD E N
Sbjct: 89 LRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEKSKENQP 148
Query: 61 SMNGEDQMNVIVDPECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSLDVVEQVEDTIGG 120
S + + VI D + F++ EL++ +E+ + ILKSL+++D + K D EQVED + G
Sbjct: 149 SSSSMEVCEVIDD---DKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTG 208
Query: 121 LKVIDVADNFIRLSLRTHIPNLEDFSSLQKLEGMIEPSELNHELLIEVLEGTMELKNAEI 180
LKV++ NFIRL LRT+I L+ F K + + EPSEL HELLI + + T E+ E+
Sbjct: 209 LKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEM 268
Query: 181 FPGDVHLHDIINASKS------------VSNSLEWFVKKVQDRIVLCTLRRFVVKSANKS 240
FP D+++ DII A+ S +S++W V KVQD+I+ TLR+ V S+
Sbjct: 269 FPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSKTI 328
Query: 241 SHSFDYIDQDETIVCSMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCK 300
++F+Y D+DETIV + GGIDAF+KVS GWPL ++PLKL SLK+SD+ +KG SLSL+ K
Sbjct: 329 RYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASLKNSDNQSKGFSLSLISK 388
Query: 301 VEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQAD 313
+E++ANSLD RQNLS F DAVEKIL +Q EL+++
Sbjct: 389 LEELANSLDLETRQNLSGFMDAVEKILVQQTREELKSN 423
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023528068.1 | 4.90e-213 | 95.74 | uncharacterized protein LOC111791098 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_023528067.1 | 5.47e-213 | 95.74 | uncharacterized protein LOC111791098 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
XP_022980354.1 | 1.19e-208 | 93.62 | uncharacterized protein LOC111479744 [Cucurbita maxima] >XP_022980355.1 uncharac... | [more] |
KAG6582484.1 | 1.11e-205 | 93.31 | hypothetical protein SDJN03_22486, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022924674.1 | 2.13e-203 | 92.40 | uncharacterized protein LOC111432106 isoform X3 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ITD7 | 5.78e-209 | 93.62 | uncharacterized protein LOC111479744 OS=Cucurbita maxima OX=3661 GN=LOC111479744... | [more] |
A0A6J1E9V8 | 1.03e-203 | 92.40 | uncharacterized protein LOC111432106 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1ED63 | 1.07e-203 | 92.40 | uncharacterized protein LOC111432106 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E9M5 | 1.19e-203 | 92.40 | uncharacterized protein LOC111432106 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A5A7U6L2 | 5.88e-183 | 83.28 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT3G23910.1 | 7.4e-74 | 46.45 | BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse... | [more] |
AT3G24255.1 | 2.2e-70 | 45.56 | RNA-directed DNA polymerase (reverse transcriptase)-related family protein | [more] |
AT3G24255.2 | 2.2e-70 | 45.56 | RNA-directed DNA polymerase (reverse transcriptase)-related family protein | [more] |