Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATAAAGATTAAAGAAGCATTTATTTGAAAGAAAGGTGACCGTGGGTAACGGTATCGTAGCCTTTTTTCCAATTCAAGAACTCTCTGTTCCATCTCCGGAGGGGTAGGAGTGTACTCTGGGGAAGAAAAATGGCGTTCACAGATCAAATCACTACCTGAAATTTACAATTCCATCACTTCAGTTCCGGATTAATGACGTTCCGATTTGGAGTTTGTCCACGCGCCACCGCTATCTCCAACTTTCCCCAACTCAATGACAAGAACTTTCTCGTTCTCCCTCTGGAACGAGAGGTTGAGATTCGGCGGGCGAGAGGCACGAGGAGATTGAGAATTAGGGTTTCGGATGGAGAAGAATCGTATCTCGGGATGTGGAAGAACGCGGTGGAGCGCCAGAGGAAAGCTAATGAGTTTCGGAAAGTTGTGGAGAATACTGTGGGAAATGACAATCGCAATGACGGTGATCGAAGTGATGATCAGTTGGAGGAGAAGAGCGAGGAGTTCAGTAAGATTCTTCAGGTCCCGACGGAGGAAAGGGATAAGATTCAGCGGATGCAGGTCATACATCGAGCCACTGCCGCCATTGCAGCTGCTCGTGCACTTGTTGGTGAGACTAGAATTGTAGCTGATTCGAATACTTCTTTGAATTTGAATAGTAGGAATGACGGTGGACTGCTCGATCGTGAAGAAGGTATTATTAACATTATTGAATCTTGAAGAACTGATCTATTTGTTGATTATATGAGTTGAAGTCAAGTTACACATTTATATAAATTCTAGGTTATTTTCTCTTCTGTATTTGACCTTCTTTGAAGTTGAACTTTGAATATTACAAGTAGTTGGTACATGTCCGTTTGGCTTGTTAGCGAATGAATAAACTTCCATGTTTGACCAGGCATATCTTCTCTGAGTTTCAAGCTCAGAAAATTGATTGGTTAACATAGATCTGCAATCAACATCTTAATACAGAAAATAATGGCAGTGTTTCTAGTGTTACAATCGCAATTATCTTGGTAGTGAACTTTTGTGATTGTGTGGCACTCACTCCCAATACTGAGTGAGTCAAGCCAATTTGGATTCTTGTCGCCTACGGTCAGGCAATGTATGCTCAAGTCTCTAGCCCCATTTTGAGAAGAAGGTTTTAGAAACGAGGGCAGAGTGATTTTCGAAAGAGAGTTTGAAAGCAAAGTTGAAAGAAGGTTTGAAAGTCGTTATAAGACCATATGCAAAGCAAGAAATACCACAACTTAAATATCATATAACTCGGGCGGGAAGAATTGACAATTAGTCACATTCTCCAATAGTACATTTTGAGGTGATTCAAGTCTGGCAGGCCTAGACGTGTTTTCGCCGAACGGTCATACAAAATCTAGGTGGGAGGAGTTAACGACTGATCACATCCCCCTATAGTATATTTTGGGGCGATTTAGGTCTGATAGGCCTAGACATGATTTTACCGAATAGTCATACAAACAGAAAATACAATATTAAGACAATACAACATGAAATCCAATAAAGGGTTTGGCATGGCGTGAGAAACAAGCCTTGGACGAACATGACCATGACAACTAGCACGCCTAGAAAAGAATGGAGAACTAGTGTTATTGTGGTTCAGTAATTTGAAGGAGACAATTTAATGGTTCATGTATTTAGACTTGTTAACTTCTTATGTGGTTGTTTAGGCTCTCTTACTAACTCTTTATTTTTCTATACTGTTTTTGATTCATCTAGCATCATCTGAATTTCAAAGCGAGAACACCCTGCTACCCAAGTCTGAAACTTCACCAACTGCGACCCCTGGTCCAGATTTCTGGTCTTGGACACCCCCTCCAGATAATGATGGGAATGGTAATGCTTTCAGTGAGTTGCGACCAATTGAAAAATCACAGGTATATCCAATGCTATCCAATTTTGTTAAGGAGAAAGAGCCTCCGGTGGGATTTCTCTCAATTCCTTTTCAGAGTGAACTTCCTGAAAGCGTCAAACCTCTTCTACCACCTTTTCAATCATTGATGGACATTGAAAAGTTAGAATCATTGGAAACCAATACAGAGACACATTCCTTGGAAGACGATGAAAATGTTGGGATGGAGTTTTCAGTACTTGCAGCAGAAGCATCTCAGGCGCTTAGTAGCATAGATAAAGAATCAACAAAAGGAATAGATTCAGATGGGTCACGATGGTGGAAGGAGATGGGAGTTGAGCAGAGACCGGATGGAGTGATTTGCAAGTGGACACTAACGAGGGGAGTTAGCGCAGACTTCGCCACCGAGTGGCAGAACAAGTATTGGGAAGCTGCTGATGAGTTTGGTTATAAGGAACTTGGTTCAGAGAAATCTGGCCGCGATGCCTATGGAAGTGTTTGGCGTGAATATTGGAGAGAATCTATGCAGCAGGTCAGTTTAAATTGTTAATGATTTGCTTGCTTCTGTTGCATTATCCTTGTAAAATATTTGCTTTATGCAATCAGTTAAAGTCTCAATAGTCTTTGTTTTATGCACTCACTTAAAGTGCTTAATGGTTAGTTCGAATCCATATGCTAGTTTGCCAAATTTGAAAACGAAGTTAAACCATTTCATTGGATATTGCGAGAAGATGGAAATGCCTTTATGAGTGTGTAATGTACAATCTGTCTGTTGATATCTAACAAATTCTTTTGTGTCTCTCTCTCTGATACCTGAGAAGGATTCGCTGATGTTTCGATGTCGGGAAATGATCTCTAATGTGGATCCTGGAAATTGCTTCATGTGGTTCCTAATTTCAATCATGGCAGCAAGTAGTCCTTATTATGACCATAACTGAACTTACTTTCTTTCAGGAGCAAGGCCTTGTTCACCTTGAAAAAACTGCAGACAAATGGGGGAAAAATGGAAGTGGCACCGAGTGGCAAGAGAAATGGTGGGAATATTATAATACCTCTGGTCAAGTTGAAAAAAATGCTCATAAATGGTGTAAAATTGACCCGAACACATATGTCGATCCTGGTCATGCTCATATCTGGCATGAAAGGTATCAAAATTTTTCAAATGTTGGAGATGCTAATCACCTGTTGTTCTTGCTGTCCTCTTCACTTGGTCTTTTGTGATGCATTCTCAATCCTCTGAATTCCTTCTTCGAGATTTCGTACGTTTAATCTGGTGACACGCTTCTTAAGAAATTGTCTATTTGAGTTTGCTTTAGCATGTAACTTTCTTCTTGCAGATGGGGTGAAAAGTATGATGGACAAGGTGGCAGCATCAAGTACACAGATAAATGGGCTGAAGGATGTGAAGGTGATGGTTGGACGAAGTGGGGCGACAAATGGGACGAAAACTTCGATTCGAATGGTCATGGCGTCAAACAGGGGGAAACATGGTGGGAAGGTAAACATGGAGAACGGTGGAACCGTACGTGGGGCGAGGGCCACAGTGGTTCAGGCTGGGTTCACAAGTATGGCAAGAGCAGCAGTGGGGAGCATTGGGACACACATGTTCAGCAGGAAACCTGGTATGAGAGATTCCCACATTTCGGCTTTTATCACTGCTTCAACAATTCAGTCCAGCTCCGGGAAGTTCAGAAGCCATCTGAGACGTCTTTGTAAGTCTTCTTTTCCTACTGCTAAGTGAATATAAAAACTTGCAGTATTATAGCTCATTTTGAAGAAAATTGAATGCTTTATCGGAGTGGATACCGTTTTTCTGGCATTGTACATAAAATAAAAGCTTTTAAGTTAATAAATTAGCTTACTTCTTTCTTTCCTGAG
mRNA sequence
AATAAAGATTAAAGAAGCATTTATTTGAAAGAAAGGTGACCGTGGGTAACGGTATCGTAGCCTTTTTTCCAATTCAAGAACTCTCTGTTCCATCTCCGGAGGGGTAGGAGTGTACTCTGGGGAAGAAAAATGGCGTTCACAGATCAAATCACTACCTGAAATTTACAATTCCATCACTTCAGTTCCGGATTAATGACGTTCCGATTTGGAGTTTGTCCACGCGCCACCGCTATCTCCAACTTTCCCCAACTCAATGACAAGAACTTTCTCGTTCTCCCTCTGGAACGAGAGGTTGAGATTCGGCGGGCGAGAGGCACGAGGAGATTGAGAATTAGGGTTTCGGATGGAGAAGAATCGTATCTCGGGATGTGGAAGAACGCGGTGGAGCGCCAGAGGAAAGCTAATGAGTTTCGGAAAGTTGTGGAGAATACTGTGGGAAATGACAATCGCAATGACGGTGATCGAAGTGATGATCAGTTGGAGGAGAAGAGCGAGGAGTTCAGTAAGATTCTTCAGGTCCCGACGGAGGAAAGGGATAAGATTCAGCGGATGCAGGTCATACATCGAGCCACTGCCGCCATTGCAGCTGCTCGTGCACTTGTTGGTGAGACTAGAATTGTAGCTGATTCGAATACTTCTTTGAATTTGAATAGTAGGAATGACGGTGGACTGCTCGATCGTGAAGAAGCATCATCTGAATTTCAAAGCGAGAACACCCTGCTACCCAAGTCTGAAACTTCACCAACTGCGACCCCTGGTCCAGATTTCTGGTCTTGGACACCCCCTCCAGATAATGATGGGAATGGTAATGCTTTCAGTGAGTTGCGACCAATTGAAAAATCACAGGTATATCCAATGCTATCCAATTTTGTTAAGGAGAAAGAGCCTCCGGTGGGATTTCTCTCAATTCCTTTTCAGAGTGAACTTCCTGAAAGCGTCAAACCTCTTCTACCACCTTTTCAATCATTGATGGACATTGAAAAGTTAGAATCATTGGAAACCAATACAGAGACACATTCCTTGGAAGACGATGAAAATGTTGGGATGGAGTTTTCAGTACTTGCAGCAGAAGCATCTCAGGCGCTTAGTAGCATAGATAAAGAATCAACAAAAGGAATAGATTCAGATGGGTCACGATGGTGGAAGGAGATGGGAGTTGAGCAGAGACCGGATGGAGTGATTTGCAAGTGGACACTAACGAGGGGAGTTAGCGCAGACTTCGCCACCGAGTGGCAGAACAAGTATTGGGAAGCTGCTGATGAGTTTGGTTATAAGGAACTTGGTTCAGAGAAATCTGGCCGCGATGCCTATGGAAGTGTTTGGCGTGAATATTGGAGAGAATCTATGCAGCAGGAGCAAGGCCTTGTTCACCTTGAAAAAACTGCAGACAAATGGGGGAAAAATGGAAGTGGCACCGAGTGGCAAGAGAAATGGTGGGAATATTATAATACCTCTGGTCAAGTTGAAAAAAATGCTCATAAATGGTGTAAAATTGACCCGAACACATATGTCGATCCTGGTCATGCTCATATCTGGCATGAAAGATGGGGTGAAAAGTATGATGGACAAGGTGGCAGCATCAAGTACACAGATAAATGGGCTGAAGGATGTGAAGGTGATGGTTGGACGAAGTGGGGCGACAAATGGGACGAAAACTTCGATTCGAATGGTCATGGCGTCAAACAGGGGGAAACATGGTGGGAAGGTAAACATGGAGAACGGTGGAACCGTACGTGGGGCGAGGGCCACAGTGGTTCAGGCTGGGTTCACAAGTATGGCAAGAGCAGCAGTGGGGAGCATTGGGACACACATGTTCAGCAGGAAACCTGGTATGAGAGATTCCCACATTTCGGCTTTTATCACTGCTTCAACAATTCAGTCCAGCTCCGGGAAGTTCAGAAGCCATCTGAGACGTCTTTGTAAGTCTTCTTTTCCTACTGCTAAGTGAATATAAAAACTTGCAGTATTATAGCTCATTTTGAAGAAAATTGAATGCTTTATCGGAGTGGATACCGTTTTTCTGGCATTGTACATAAAATAAAAGCTTTTAAGTTAATAAATTAGCTTACTTCTTTCTTTCCTGAG
Coding sequence (CDS)
ATGACGTTCCGATTTGGAGTTTGTCCACGCGCCACCGCTATCTCCAACTTTCCCCAACTCAATGACAAGAACTTTCTCGTTCTCCCTCTGGAACGAGAGGTTGAGATTCGGCGGGCGAGAGGCACGAGGAGATTGAGAATTAGGGTTTCGGATGGAGAAGAATCGTATCTCGGGATGTGGAAGAACGCGGTGGAGCGCCAGAGGAAAGCTAATGAGTTTCGGAAAGTTGTGGAGAATACTGTGGGAAATGACAATCGCAATGACGGTGATCGAAGTGATGATCAGTTGGAGGAGAAGAGCGAGGAGTTCAGTAAGATTCTTCAGGTCCCGACGGAGGAAAGGGATAAGATTCAGCGGATGCAGGTCATACATCGAGCCACTGCCGCCATTGCAGCTGCTCGTGCACTTGTTGGTGAGACTAGAATTGTAGCTGATTCGAATACTTCTTTGAATTTGAATAGTAGGAATGACGGTGGACTGCTCGATCGTGAAGAAGCATCATCTGAATTTCAAAGCGAGAACACCCTGCTACCCAAGTCTGAAACTTCACCAACTGCGACCCCTGGTCCAGATTTCTGGTCTTGGACACCCCCTCCAGATAATGATGGGAATGGTAATGCTTTCAGTGAGTTGCGACCAATTGAAAAATCACAGGTATATCCAATGCTATCCAATTTTGTTAAGGAGAAAGAGCCTCCGGTGGGATTTCTCTCAATTCCTTTTCAGAGTGAACTTCCTGAAAGCGTCAAACCTCTTCTACCACCTTTTCAATCATTGATGGACATTGAAAAGTTAGAATCATTGGAAACCAATACAGAGACACATTCCTTGGAAGACGATGAAAATGTTGGGATGGAGTTTTCAGTACTTGCAGCAGAAGCATCTCAGGCGCTTAGTAGCATAGATAAAGAATCAACAAAAGGAATAGATTCAGATGGGTCACGATGGTGGAAGGAGATGGGAGTTGAGCAGAGACCGGATGGAGTGATTTGCAAGTGGACACTAACGAGGGGAGTTAGCGCAGACTTCGCCACCGAGTGGCAGAACAAGTATTGGGAAGCTGCTGATGAGTTTGGTTATAAGGAACTTGGTTCAGAGAAATCTGGCCGCGATGCCTATGGAAGTGTTTGGCGTGAATATTGGAGAGAATCTATGCAGCAGGAGCAAGGCCTTGTTCACCTTGAAAAAACTGCAGACAAATGGGGGAAAAATGGAAGTGGCACCGAGTGGCAAGAGAAATGGTGGGAATATTATAATACCTCTGGTCAAGTTGAAAAAAATGCTCATAAATGGTGTAAAATTGACCCGAACACATATGTCGATCCTGGTCATGCTCATATCTGGCATGAAAGATGGGGTGAAAAGTATGATGGACAAGGTGGCAGCATCAAGTACACAGATAAATGGGCTGAAGGATGTGAAGGTGATGGTTGGACGAAGTGGGGCGACAAATGGGACGAAAACTTCGATTCGAATGGTCATGGCGTCAAACAGGGGGAAACATGGTGGGAAGGTAAACATGGAGAACGGTGGAACCGTACGTGGGGCGAGGGCCACAGTGGTTCAGGCTGGGTTCACAAGTATGGCAAGAGCAGCAGTGGGGAGCATTGGGACACACATGTTCAGCAGGAAACCTGGTATGAGAGATTCCCACATTTCGGCTTTTATCACTGCTTCAACAATTCAGTCCAGCTCCGGGAAGTTCAGAAGCCATCTGAGACGTCTTTGTAA
Protein sequence
MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMWKNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRMQVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKSETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIPFQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQALSSIDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGYKELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTHVQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Homology
BLAST of CmoCh17G001940 vs. ExPASy TrEMBL
Match:
A0A6J1H328 (uncharacterized protein LOC111460038 OS=Cucurbita moschata OX=3662 GN=LOC111460038 PE=4 SV=1)
HSP 1 Score: 1188.3 bits (3073), Expect = 0.0e+00
Identity = 576/576 (100.00%), Postives = 576/576 (100.00%), Query Frame = 0
Query: 1 MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60
MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW
Sbjct: 1 MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60
Query: 61 KNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120
KNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM
Sbjct: 61 KNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120
Query: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180
QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS
Sbjct: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180
Query: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIP 240
ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIP
Sbjct: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIP 240
Query: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQALSS 300
FQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQALSS
Sbjct: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQALSS 300
Query: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGY 360
IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGY
Sbjct: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGY 360
Query: 361 KELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
KELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT
Sbjct: 361 KELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
Query: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK
Sbjct: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
Query: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH
Sbjct: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
Query: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 577
VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Sbjct: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576
BLAST of CmoCh17G001940 vs. ExPASy TrEMBL
Match:
A0A6J1KWK6 (uncharacterized protein LOC111499367 OS=Cucurbita maxima OX=3661 GN=LOC111499367 PE=4 SV=1)
HSP 1 Score: 1164.8 bits (3012), Expect = 0.0e+00
Identity = 563/576 (97.74%), Postives = 570/576 (98.96%), Query Frame = 0
Query: 1 MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60
M FRFGVCPRATAISNFPQLNDKNFL+LP EREVEIRRARGTRRLRIRVSDGEESYLGMW
Sbjct: 1 MAFRFGVCPRATAISNFPQLNDKNFLILPHEREVEIRRARGTRRLRIRVSDGEESYLGMW 60
Query: 61 KNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120
KNAVERQRKA EFRKVVENTVGND+RNDGDRSDDQL EKSEEFSKILQVPT+ERDKIQRM
Sbjct: 61 KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLAEKSEEFSKILQVPTKERDKIQRM 120
Query: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180
QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS
Sbjct: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180
Query: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIP 240
ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQ YPMLSNFVKEKEPPVGFLSIP
Sbjct: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQAYPMLSNFVKEKEPPVGFLSIP 240
Query: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQALSS 300
FQSELPESVKPLLPPFQSLMDIEKLESLET+TETHSLE+DENVGMEFSVLAAEASQALSS
Sbjct: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300
Query: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGY 360
IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSAD ATEWQNKYWEAADEFGY
Sbjct: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360
Query: 361 KELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
KELGSEKSGRDAYGSVWREYWRESM+QEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT
Sbjct: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
Query: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK
Sbjct: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
Query: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
WGDKWDENFDSNGHG+KQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH
Sbjct: 481 WGDKWDENFDSNGHGIKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
Query: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 577
VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Sbjct: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576
BLAST of CmoCh17G001940 vs. ExPASy TrEMBL
Match:
A0A5A7UQT1 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold216G001440 PE=4 SV=1)
HSP 1 Score: 1018.1 bits (2631), Expect = 1.5e-293
Identity = 493/577 (85.44%), Postives = 525/577 (90.99%), Query Frame = 0
Query: 1 MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60
M FR GV PR T ++FP L NFL+LPLERE++IR A GTR LRIR SDG ESYLGMW
Sbjct: 1 MPFRLGVSPRPTLSNHFPHLYHHNFLLLPLEREIDIRHATGTRSLRIRASDGGESYLGMW 60
Query: 61 KNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120
KNAVERQRKA EF+KVVENTVGND+RN GD S DQLE+KSEEFSKILQVP EERD+IQRM
Sbjct: 61 KNAVERQRKAIEFQKVVENTVGNDDRNAGDPSSDQLEKKSEEFSKILQVPPEERDRIQRM 120
Query: 121 QVIHRATAAIAAARALVGETRIVA--DSNTSLNLNSRNDGGLLDREEASSEFQSENTLLP 180
QVIHRA AAIAAARALVGET +A DS+ S+NLNS NDGGLLDREEA EFQSEN+LLP
Sbjct: 121 QVIHRAAAAIAAARALVGETGTLAVGDSDASVNLNSTNDGGLLDREEALPEFQSENSLLP 180
Query: 181 KSETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLS 240
+SETS + TPGPDFWSWTPPPDND NGNAF EL+PI KSQ YP LSNFV+EKE P+ LS
Sbjct: 181 ESETSRSWTPGPDFWSWTPPPDNDENGNAFGELQPIGKSQAYPKLSNFVEEKERPIDSLS 240
Query: 241 IPFQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQAL 300
IPFQSE+ ESV PLLPPFQSL+ +EKLES ET+TETHSLE+DENVG+EFSV AAEASQAL
Sbjct: 241 IPFQSEISESVNPLLPPFQSLVGMEKLESSETSTETHSLEEDENVGIEFSVHAAEASQAL 300
Query: 301 SSIDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEF 360
SS+DKESTKGID DGSRWWKE G+EQRPDGVIC+WTLTRGVSAD ATEWQNKYWEAADEF
Sbjct: 301 SSVDKESTKGIDPDGSRWWKETGIEQRPDGVICRWTLTRGVSADLATEWQNKYWEAADEF 360
Query: 361 GYKELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYY 420
GYKELGSEKSGRDAYG+VWREYWRESM+QEQGLVHLEKTADKWG NGSGTEWQEKWWEYY
Sbjct: 361 GYKELGSEKSGRDAYGNVWREYWRESMRQEQGLVHLEKTADKWGINGSGTEWQEKWWEYY 420
Query: 421 NTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGW 480
NTSGQ EKNAHKWCKIDPNTYVDPGHAHIW+ERWGEKYDGQGGSIKYTDKWAE CEGDGW
Sbjct: 421 NTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNERWGEKYDGQGGSIKYTDKWAERCEGDGW 480
Query: 481 TKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWD 540
TKWGDKWDENFD NGHGVKQGETWWEGKHGERWNRTWGEGH+GSGWVHKYGKSSSGEHWD
Sbjct: 481 TKWGDKWDENFDPNGHGVKQGETWWEGKHGERWNRTWGEGHNGSGWVHKYGKSSSGEHWD 540
Query: 541 THVQQETWYERFPHFGFYHCFNNSVQLREVQKPSETS 576
TH QQETWYERFPHFGFYHCFNNSVQLREVQKPSET+
Sbjct: 541 THAQQETWYERFPHFGFYHCFNNSVQLREVQKPSETA 577
BLAST of CmoCh17G001940 vs. ExPASy TrEMBL
Match:
A0A1S3BMD1 (uncharacterized protein LOC103491223 OS=Cucumis melo OX=3656 GN=LOC103491223 PE=4 SV=1)
HSP 1 Score: 1018.1 bits (2631), Expect = 1.5e-293
Identity = 493/577 (85.44%), Postives = 525/577 (90.99%), Query Frame = 0
Query: 1 MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60
M FR GV PR T ++FP L NFL+LPLERE++IR A GTR LRIR SDG ESYLGMW
Sbjct: 1 MPFRLGVSPRPTLSNHFPHLYHHNFLLLPLEREIDIRHATGTRSLRIRASDGGESYLGMW 60
Query: 61 KNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120
KNAVERQRKA EF+KVVENTVGND+RN GD S DQLE+KSEEFSKILQVP EERD+IQRM
Sbjct: 61 KNAVERQRKAIEFQKVVENTVGNDDRNAGDPSSDQLEKKSEEFSKILQVPPEERDRIQRM 120
Query: 121 QVIHRATAAIAAARALVGETRIVA--DSNTSLNLNSRNDGGLLDREEASSEFQSENTLLP 180
QVIHRA AAIAAARALVGET +A DS+ S+NLNS NDGGLLDREEA EFQSEN+LLP
Sbjct: 121 QVIHRAAAAIAAARALVGETGTLAVGDSDASVNLNSTNDGGLLDREEALPEFQSENSLLP 180
Query: 181 KSETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLS 240
+SETS + TPGPDFWSWTPPPDND NGNAF EL+PI KSQ YP LSNFV+EKE P+ LS
Sbjct: 181 ESETSRSWTPGPDFWSWTPPPDNDENGNAFGELQPIGKSQAYPKLSNFVEEKERPIDSLS 240
Query: 241 IPFQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQAL 300
IPFQSE+ ESV PLLPPFQSL+ +EKLES ET+TETHSLE+DENVG+EFSV AAEASQAL
Sbjct: 241 IPFQSEISESVNPLLPPFQSLVGMEKLESSETSTETHSLEEDENVGIEFSVHAAEASQAL 300
Query: 301 SSIDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEF 360
SS+DKESTKGID DGSRWWKE G+EQRPDGVIC+WTLTRGVSAD ATEWQNKYWEAADEF
Sbjct: 301 SSVDKESTKGIDPDGSRWWKETGIEQRPDGVICRWTLTRGVSADLATEWQNKYWEAADEF 360
Query: 361 GYKELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYY 420
GYKELGSEKSGRDAYG+VWREYWRESM+QEQGLVHLEKTADKWG NGSGTEWQEKWWEYY
Sbjct: 361 GYKELGSEKSGRDAYGNVWREYWRESMRQEQGLVHLEKTADKWGINGSGTEWQEKWWEYY 420
Query: 421 NTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGW 480
NTSGQ EKNAHKWCKIDPNTYVDPGHAHIW+ERWGEKYDGQGGSIKYTDKWAE CEGDGW
Sbjct: 421 NTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNERWGEKYDGQGGSIKYTDKWAERCEGDGW 480
Query: 481 TKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWD 540
TKWGDKWDENFD NGHGVKQGETWWEGKHGERWNRTWGEGH+GSGWVHKYGKSSSGEHWD
Sbjct: 481 TKWGDKWDENFDPNGHGVKQGETWWEGKHGERWNRTWGEGHNGSGWVHKYGKSSSGEHWD 540
Query: 541 THVQQETWYERFPHFGFYHCFNNSVQLREVQKPSETS 576
TH QQETWYERFPHFGFYHCFNNSVQLREVQKPSET+
Sbjct: 541 THAQQETWYERFPHFGFYHCFNNSVQLREVQKPSETA 577
BLAST of CmoCh17G001940 vs. ExPASy TrEMBL
Match:
A0A0A0LNS6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G215500 PE=4 SV=1)
HSP 1 Score: 988.8 bits (2555), Expect = 9.6e-285
Identity = 479/575 (83.30%), Postives = 514/575 (89.39%), Query Frame = 0
Query: 1 MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60
M R + PR T +FP+L NFL+LPL+ ++IR A R LRIR SD ESYLGMW
Sbjct: 2 MPLRLPLSPRPTLHHHFPRLYHHNFLLLPLQPHIQIRHATPARTLRIRASDEGESYLGMW 61
Query: 61 KNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120
KNAVERQRKA EF+KVVENT GND+RN GD S DQLE+KSEEFSKILQVP EERD+IQRM
Sbjct: 62 KNAVERQRKAVEFQKVVENTEGNDDRNAGDPSSDQLEKKSEEFSKILQVPPEERDRIQRM 121
Query: 121 QVIHRATAAIAAARALVGETRIVA--DSNTSLNLNSRNDGGLLDREEASSEFQSENTLLP 180
QVIHRA AAIAAARALVGET +A DS+T +NLNS ND GLLDREEA SEFQSEN LLP
Sbjct: 122 QVIHRAAAAIAAARALVGETGTLAVGDSDTCVNLNSTNDEGLLDREEALSEFQSENALLP 181
Query: 181 KSETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLS 240
+ ETS + TPGPDFWSWTPPPD+DGN NAF EL+P+ KSQ YP LSNFV+EKE P+ FLS
Sbjct: 182 EFETSQSWTPGPDFWSWTPPPDDDGNDNAFGELQPLGKSQAYPKLSNFVEEKERPIDFLS 241
Query: 241 IPFQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQAL 300
IPFQSE+ ESV PLLPPFQSL+ +EKLES ET+TETHSLE+DENVG+EFSV AAEASQAL
Sbjct: 242 IPFQSEISESVNPLLPPFQSLVGMEKLESSETSTETHSLEEDENVGIEFSVHAAEASQAL 301
Query: 301 SSIDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEF 360
SS+DKESTKGID DGSRWWKE G+EQRPDGVICKWTLTRGVSAD ATEWQNKYWEAADEF
Sbjct: 302 SSVDKESTKGIDPDGSRWWKETGIEQRPDGVICKWTLTRGVSADLATEWQNKYWEAADEF 361
Query: 361 GYKELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYY 420
GYKELGSEKSGRDAYG+VWREYWRESM+QEQGLVHLEKTADKWG NGSGTEWQEKWWEYY
Sbjct: 362 GYKELGSEKSGRDAYGNVWREYWRESMRQEQGLVHLEKTADKWGINGSGTEWQEKWWEYY 421
Query: 421 NTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGW 480
NTSGQ EKNAHKWCKIDPNTYVDPGHAHIW+ERWGEKYDGQGGSIKYTDKWAE CEGDGW
Sbjct: 422 NTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNERWGEKYDGQGGSIKYTDKWAERCEGDGW 481
Query: 481 TKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWD 540
TKWGDKWDENFD NGHG+KQGETWWEG+HGERWNRTWGEGH+GSGWVHKYGKSSSGEHWD
Sbjct: 482 TKWGDKWDENFDPNGHGIKQGETWWEGRHGERWNRTWGEGHNGSGWVHKYGKSSSGEHWD 541
Query: 541 THVQQETWYERFPHFGFYHCFNNSVQLREVQKPSE 574
TH QQETWYERFPHFGFYHCFNNSVQLREVQKPSE
Sbjct: 542 THAQQETWYERFPHFGFYHCFNNSVQLREVQKPSE 576
BLAST of CmoCh17G001940 vs. TAIR 10
Match:
AT3G55760.1 (unknown protein; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G42430.2); Has 176 Blast hits to 125 proteins in 40 species: Archae - 0; Bacteria - 3; Metazoa - 19; Fungi - 9; Plants - 81; Viruses - 0; Other Eukaryotes - 64 (source: NCBI BLink). )
HSP 1 Score: 635.6 bits (1638), Expect = 4.0e-182
Identity = 316/550 (57.45%), Postives = 398/550 (72.36%), Query Frame = 0
Query: 29 PLEREVEIRRAR-GTRRLRIRVSDGEESYLGMWKNAVERQRKANEFRKVVENTVGNDNRN 88
P+ +R +R G R LR+ ++G ESYL MWKNAV+R++K F K+ EN V D
Sbjct: 38 PVTSRRSLRGSRTGVRILRVS-NEGRESYLDMWKNAVDREKKEKAFEKIAENVVAVDGEK 97
Query: 89 DGDRSDDQLEEKSEEFSKILQVPTEERDKIQRMQVIHRATAAIAAARALVGETRIVADSN 148
+ LE+KS+EF KIL+V EERD+IQRMQV+ RA AAI+AARA++
Sbjct: 98 E---KGGDLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKE 157
Query: 149 TSLNLNSRNDGGLLDR-EEASSEFQSENTLLPKSETSPTATPGPDFWSWTPPPDNDGNGN 208
N ++ + + + A S +P+SETS T TPGPDFWSWTPP G+
Sbjct: 158 GFPNEDNTVTSEVTETPKNAKLGMWSRTVYVPRSETSGTETPGPDFWSWTPP---QGSEI 217
Query: 209 AFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIPFQSEL-PESVKPLLPPFQSLMDIEKL 268
+ +L+ +EK +P L N V EK+ LSIP++S L E +PPF+SL+++ K
Sbjct: 218 SSVDLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERHSFTIPPFESLIEVRKE 277
Query: 269 ESLETNTETHSLEDDENVGMEFSVLAAEASQALSSIDKESTKGIDSDGSRWWKEMGVEQR 328
+ ++ET S E D + + S A E ++ L S+D+ ST G+ DG +WWK+ GVE+R
Sbjct: 278 AETKPSSETLSTEHD--LDLISSANAEEVARVLDSLDESSTHGVSEDGLKWWKQTGVEKR 337
Query: 329 PDGVICKWTLTRGVSADFATEWQNKYWEAADEFGYKELGSEKSGRDAYGSVWREYWRESM 388
PDGV+C+WT+ RGV+AD EWQ+KYWEA+D+FG+KELGSEKSGRDA G+VWRE+WRESM
Sbjct: 338 PDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGSEKSGRDATGNVWREFWRESM 397
Query: 389 QQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNTSGQVEKNAHKWCKIDPNTYVDPGHA 448
QE G+VH+EKTADKWGK+G G EWQEKWWE+Y+ +G+ EK AHKWC ID NT +D GHA
Sbjct: 398 SQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAHKWCSIDRNTPLDAGHA 457
Query: 449 HIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTKWGDKWDENFDSNGHGVKQGETWWEG 508
H+WHERWGEKYDGQGGS KYTDKWAE GDGW KWGDKWDENF+ + GVKQGETWWEG
Sbjct: 458 HVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKWGDKWDENFNPSAQGVKQGETWWEG 517
Query: 509 KHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTHVQQETWYERFPHFGFYHCFNNSVQL 568
KHG+RWNR+WGEGH+GSGWVHKYGKSSSGEHWDTHV QETWYE+FPHFGF+HCF+NSVQL
Sbjct: 518 KHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHVPQETWYEKFPHFGFFHCFDNSVQL 577
Query: 569 REVQKPSETS 576
R V+KPS+ S
Sbjct: 578 RAVKKPSDMS 578
BLAST of CmoCh17G001940 vs. TAIR 10
Match:
AT3G55760.2 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G42430.2); Has 176 Blast hits to 125 proteins in 40 species: Archae - 0; Bacteria - 3; Metazoa - 19; Fungi - 9; Plants - 81; Viruses - 0; Other Eukaryotes - 64 (source: NCBI BLink). )
HSP 1 Score: 635.6 bits (1638), Expect = 4.0e-182
Identity = 316/550 (57.45%), Postives = 398/550 (72.36%), Query Frame = 0
Query: 29 PLEREVEIRRAR-GTRRLRIRVSDGEESYLGMWKNAVERQRKANEFRKVVENTVGNDNRN 88
P+ +R +R G R LR+ ++G ESYL MWKNAV+R++K F K+ EN V D
Sbjct: 38 PVTSRRSLRGSRTGVRILRVS-NEGRESYLDMWKNAVDREKKEKAFEKIAENVVAVDGEK 97
Query: 89 DGDRSDDQLEEKSEEFSKILQVPTEERDKIQRMQVIHRATAAIAAARALVGETRIVADSN 148
+ LE+KS+EF KIL+V EERD+IQRMQV+ RA AAI+AARA++
Sbjct: 98 E---KGGDLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKE 157
Query: 149 TSLNLNSRNDGGLLDR-EEASSEFQSENTLLPKSETSPTATPGPDFWSWTPPPDNDGNGN 208
N ++ + + + A S +P+SETS T TPGPDFWSWTPP G+
Sbjct: 158 GFPNEDNTVTSEVTETPKNAKLGMWSRTVYVPRSETSGTETPGPDFWSWTPP---QGSEI 217
Query: 209 AFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIPFQSEL-PESVKPLLPPFQSLMDIEKL 268
+ +L+ +EK +P L N V EK+ LSIP++S L E +PPF+SL+++ K
Sbjct: 218 SSVDLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERHSFTIPPFESLIEVRKE 277
Query: 269 ESLETNTETHSLEDDENVGMEFSVLAAEASQALSSIDKESTKGIDSDGSRWWKEMGVEQR 328
+ ++ET S E D + + S A E ++ L S+D+ ST G+ DG +WWK+ GVE+R
Sbjct: 278 AETKPSSETLSTEHD--LDLISSANAEEVARVLDSLDESSTHGVSEDGLKWWKQTGVEKR 337
Query: 329 PDGVICKWTLTRGVSADFATEWQNKYWEAADEFGYKELGSEKSGRDAYGSVWREYWRESM 388
PDGV+C+WT+ RGV+AD EWQ+KYWEA+D+FG+KELGSEKSGRDA G+VWRE+WRESM
Sbjct: 338 PDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGSEKSGRDATGNVWREFWRESM 397
Query: 389 QQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNTSGQVEKNAHKWCKIDPNTYVDPGHA 448
QE G+VH+EKTADKWGK+G G EWQEKWWE+Y+ +G+ EK AHKWC ID NT +D GHA
Sbjct: 398 SQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAHKWCSIDRNTPLDAGHA 457
Query: 449 HIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTKWGDKWDENFDSNGHGVKQGETWWEG 508
H+WHERWGEKYDGQGGS KYTDKWAE GDGW KWGDKWDENF+ + GVKQGETWWEG
Sbjct: 458 HVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKWGDKWDENFNPSAQGVKQGETWWEG 517
Query: 509 KHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTHVQQETWYERFPHFGFYHCFNNSVQL 568
KHG+RWNR+WGEGH+GSGWVHKYGKSSSGEHWDTHV QETWYE+FPHFGF+HCF+NSVQL
Sbjct: 518 KHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHVPQETWYEKFPHFGFFHCFDNSVQL 577
Query: 569 REVQKPSETS 576
R V+KPS+ S
Sbjct: 578 RAVKKPSDMS 578
BLAST of CmoCh17G001940 vs. TAIR 10
Match:
AT3G55760.3 (unknown protein; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G42430.2). )
HSP 1 Score: 635.6 bits (1638), Expect = 4.0e-182
Identity = 316/550 (57.45%), Postives = 398/550 (72.36%), Query Frame = 0
Query: 29 PLEREVEIRRAR-GTRRLRIRVSDGEESYLGMWKNAVERQRKANEFRKVVENTVGNDNRN 88
P+ +R +R G R LR+ ++G ESYL MWKNAV+R++K F K+ EN V D
Sbjct: 38 PVTSRRSLRGSRTGVRILRVS-NEGRESYLDMWKNAVDREKKEKAFEKIAENVVAVDGEK 97
Query: 89 DGDRSDDQLEEKSEEFSKILQVPTEERDKIQRMQVIHRATAAIAAARALVGETRIVADSN 148
+ LE+KS+EF KIL+V EERD+IQRMQV+ RA AAI+AARA++
Sbjct: 98 E---KGGDLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKE 157
Query: 149 TSLNLNSRNDGGLLDR-EEASSEFQSENTLLPKSETSPTATPGPDFWSWTPPPDNDGNGN 208
N ++ + + + A S +P+SETS T TPGPDFWSWTPP G+
Sbjct: 158 GFPNEDNTVTSEVTETPKNAKLGMWSRTVYVPRSETSGTETPGPDFWSWTPP---QGSEI 217
Query: 209 AFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIPFQSEL-PESVKPLLPPFQSLMDIEKL 268
+ +L+ +EK +P L N V EK+ LSIP++S L E +PPF+SL+++ K
Sbjct: 218 SSVDLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERHSFTIPPFESLIEVRKE 277
Query: 269 ESLETNTETHSLEDDENVGMEFSVLAAEASQALSSIDKESTKGIDSDGSRWWKEMGVEQR 328
+ ++ET S E D + + S A E ++ L S+D+ ST G+ DG +WWK+ GVE+R
Sbjct: 278 AETKPSSETLSTEHD--LDLISSANAEEVARVLDSLDESSTHGVSEDGLKWWKQTGVEKR 337
Query: 329 PDGVICKWTLTRGVSADFATEWQNKYWEAADEFGYKELGSEKSGRDAYGSVWREYWRESM 388
PDGV+C+WT+ RGV+AD EWQ+KYWEA+D+FG+KELGSEKSGRDA G+VWRE+WRESM
Sbjct: 338 PDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGSEKSGRDATGNVWREFWRESM 397
Query: 389 QQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNTSGQVEKNAHKWCKIDPNTYVDPGHA 448
QE G+VH+EKTADKWGK+G G EWQEKWWE+Y+ +G+ EK AHKWC ID NT +D GHA
Sbjct: 398 SQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAHKWCSIDRNTPLDAGHA 457
Query: 449 HIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTKWGDKWDENFDSNGHGVKQGETWWEG 508
H+WHERWGEKYDGQGGS KYTDKWAE GDGW KWGDKWDENF+ + GVKQGETWWEG
Sbjct: 458 HVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKWGDKWDENFNPSAQGVKQGETWWEG 517
Query: 509 KHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTHVQQETWYERFPHFGFYHCFNNSVQL 568
KHG+RWNR+WGEGH+GSGWVHKYGKSSSGEHWDTHV QETWYE+FPHFGF+HCF+NSVQL
Sbjct: 518 KHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHVPQETWYEKFPHFGFFHCFDNSVQL 577
Query: 569 REVQKPSETS 576
R V+KPS+ S
Sbjct: 578 RAVKKPSDMS 578
BLAST of CmoCh17G001940 vs. TAIR 10
Match:
AT1G42430.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55760.3); Has 186 Blast hits to 143 proteins in 47 species: Archae - 0; Bacteria - 23; Metazoa - 14; Fungi - 6; Plants - 87; Viruses - 0; Other Eukaryotes - 56 (source: NCBI BLink). )
HSP 1 Score: 236.9 bits (603), Expect = 4.1e-62
Identity = 120/265 (45.28%), Postives = 167/265 (63.02%), Query Frame = 0
Query: 308 GIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGYKELGSEK 367
G + DGS W++E G + +G C+W+ G S D ++EW +WE +D GYKELG EK
Sbjct: 145 GTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSDWTGYKELGVEK 204
Query: 368 SGRDAYGSVWREYWRESMQQEQ--GLVHLEKTADKWGKNGS-GTEWQEKWWEYYNTSGQV 427
SG+++ G W E W+E + Q++ L +E++A K K+G+ W EKWWE Y+ G
Sbjct: 205 SGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 264
Query: 428 EKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTKWGDK 487
EK AHK+ +++ + W E+WGE YDG+G +K+TDKWAE G TKWGDK
Sbjct: 265 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 324
Query: 488 WDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTHVQQE 547
W+E F S G G +QGETW + +RW+RTWGE H G+G VHKYGKS++GE WD V +E
Sbjct: 325 WEEKFFS-GIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 384
Query: 548 TWYERFPHFGFYHCFNNSVQLREVQ 570
T+YE PH+G+ +S QL +Q
Sbjct: 385 TYYEAEPHYGWADVVGDSTQLLSIQ 396
BLAST of CmoCh17G001940 vs. TAIR 10
Match:
AT1G42430.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55760.3). )
HSP 1 Score: 236.9 bits (603), Expect = 4.1e-62
Identity = 120/265 (45.28%), Postives = 167/265 (63.02%), Query Frame = 0
Query: 308 GIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGYKELGSEK 367
G + DGS W++E G + +G C+W+ G S D ++EW +WE +D GYKELG EK
Sbjct: 128 GTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSDWTGYKELGVEK 187
Query: 368 SGRDAYGSVWREYWRESMQQEQ--GLVHLEKTADKWGKNGS-GTEWQEKWWEYYNTSGQV 427
SG+++ G W E W+E + Q++ L +E++A K K+G+ W EKWWE Y+ G
Sbjct: 188 SGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 247
Query: 428 EKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTKWGDK 487
EK AHK+ +++ + W E+WGE YDG+G +K+TDKWAE G TKWGDK
Sbjct: 248 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 307
Query: 488 WDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTHVQQE 547
W+E F S G G +QGETW + +RW+RTWGE H G+G VHKYGKS++GE WD V +E
Sbjct: 308 WEEKFFS-GIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 367
Query: 548 TWYERFPHFGFYHCFNNSVQLREVQ 570
T+YE PH+G+ +S QL +Q
Sbjct: 368 TYYEAEPHYGWADVVGDSTQLLSIQ 379
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1H328 | 0.0e+00 | 100.00 | uncharacterized protein LOC111460038 OS=Cucurbita moschata OX=3662 GN=LOC1114600... | [more] |
A0A6J1KWK6 | 0.0e+00 | 97.74 | uncharacterized protein LOC111499367 OS=Cucurbita maxima OX=3661 GN=LOC111499367... | [more] |
A0A5A7UQT1 | 1.5e-293 | 85.44 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BMD1 | 1.5e-293 | 85.44 | uncharacterized protein LOC103491223 OS=Cucumis melo OX=3656 GN=LOC103491223 PE=... | [more] |
A0A0A0LNS6 | 9.6e-285 | 83.30 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G215500 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G55760.1 | 4.0e-182 | 57.45 | unknown protein; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 16 p... | [more] |
AT3G55760.2 | 4.0e-182 | 57.45 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXP... | [more] |
AT3G55760.3 | 4.0e-182 | 57.45 | unknown protein; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10 growth ... | [more] |
AT1G42430.1 | 4.1e-62 | 45.28 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G42430.2 | 4.1e-62 | 45.28 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |