Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGAAGTGGAAGTGATGAAATGGGCATAACCCATCTTCCACGGCTCTTCATTTAAGTGCGCAATTAGTGCCATCAATTCCCTTTTTCAATTTCCTTGTTCCCCCATTTCTCTGTTTTTGGCGCTTCTTCTCCATTTCTTACCTTTTCCCCTTCTACTGTCTTGTCATCTTCTTCCATTTTCCCTTTCGATTTATAAATTTTGGTGAAGAAAACGGATAGATGGATGGTTGATGGATGAGTAATAAACCCCCCTCTCTATATCCACATTTACTTCGAAGACTGTGGCTGTCATTTGTAGTTGTTGCGCCAGTTAAATCAGACCGAATCGCCCCCAATTTCTCTCTCTCTCTCTCTCTCTCTCTCTCGAACCCTAGATTTTCTTTTTGAATTTGGGTTCTGGGTTTGCGGTGGAGAAGCTCCACGAGGTGGGATTTTGAGCTTTTGTGGTTCTGGGATCGATTTGCTTTGAGTTTGAGGTGTAATGGGGGGAGATAATTTGGACCCGATTGAGGGAGGTGGCAATGGCGGTTGTTAACCCACTTCTCATGCATTGCCTCCATGGGAAAGAGCGAGGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCCGAGCTTTCTGATTGGACTGTGCAGAGTAGATGTGGCGGCGGTGGGTGCTTTGCGATTCGTAGACTGATTGCTGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCGCCGTTCCTTTCATACGGAGATTGGCCGGATCAGGCGGCTGATTCTACTTATAGAGGTAGGATCCTTCGTCAATTTTCTTTTTCTTTTTCTTTCATGATTTTCCTGTTCAGGCTTCCTGGGTTTTGAGTGTTTGTAGACCTAGAATCATGTTGGTGGGTGTTGGCTGGTGATATCAAATTATCATCCCCTTTTTGTTGGCTGGTGATATCAACTTTGATTGTTTTGAAGAGTAGAATGTTATGGATTTTTGAATGTTTGTAGCTGTTCTTGTGTTTAGATAGCTCCTCGTGGAAGAATTATAGTCTTGTTTTGTTGGATTCGATGGGATTGCGAGTAAGGAAACTCACTCTTGATTGATTCTGGAAGATTTGATCGATTAAATGAGAAGTTTATGATGAAAACTGACTCACTTTTCCATGTATCCATTTAGTTTCCCTGGATTCATGTGAATTCTGTTCATCAAGTTGCTGTTCATATGCATTAGAACTTTCCATTTTGATAATGTTTATAGATTTGTGTTATATTGAAGAACCCTTTTGGCTATCCGAAGGCTTTATGAGCTTGATTGTTCGTTTTTTTCCTCTCTCGGTATACAAACTCGACATGTTTACGCCCATTGATGCATGAAATGTACTGCCTAAAAGCTCATAGTGCTCTGGTTTTAAACGGATCGCTCTACTTTTGTACATTGGAATTTGAAAGAGCGATACATATAAACGATATGATCGTATAACTTAACTGAAATTAATTGCGTCGTAATCAGATGGGTTTAGAAGTTGGGTGATATCTTATAAACTCAATGAGTACTCCATTTGCTTGTTGGGTGTGTCTAGATTTGTTTGATTGTGGTGAACTTTGTGTTAGAAATTGTGAAGTTTTATTGGATTTTATCGATATCAGTTCGTCTTGGTTGAATTATTGTAACGCTATCATATCATGCTAGCCATCCCTTTCGTATATTATATAATCGTTTCGCTGAAGTTATATGATTCTATCCCGTTTCCAGATCATGAAATCGTAGCGTGTTTTCGTGCTCGGAAGCCAGTTCCTTTTCTGAAAAACCATATTTTTGAGCTTGAAGACAACATTTTTGGAGAAATTCCTGTTCCTTTTGTCAAGGTATTCAATCCAGTTTCTCGATTCATCTAGCTTGACTAGTCACGTTTTCGGTCGTTTGATCTGATACTCTGTGAATCAGGTGGCCGTCCTCTCGCTACAATCATTAGGCGGATCGAACGTAACAGATATCATTTTCTCCGTAGATCCTGATGCCAAGTATTCAAAAATTCCACCAACTTCTCAAAGTTTAATCAAGGAAACGTTTGAAACATTGGTTATAAACGACCCTCCTCTCAGATTGAACGCATCGTTATTCGGCAATACTTCGTTGTTCGAGGTGTTGAAATTTCCTGGTGGGATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAATTTTACGTTGAATTATTCTATCTATCAAATTCAAGTGAATTTCGACGATCTTACCAGCCAGCTGAGGTCGGGATTACGTCTATCTCGTTATGAGGTTTGGCTGTCTTCCTTACTATCTTCTGATGGATATTGTTTTTAGCTGTATAGATTTCACTGACAGCATTTTGTATGATGCAGAATTTATATGTTAGCCTATCGAACGAACGAGGTTCGACAATGCAGGCTCCTACTATTGTTCAGTCGTCTGTTCTGATGGCTATTGGGACGAATTCATCGAATCAAAGACTCAAACAGTTGGCTCAAACCATCACGAATTCTCATTCGGGAAATCTTGGCCTGAACAACACTGTTTTTGGCAAGGTCAAGCAGGTGCGTCTTTCGTCGGTCCTAAACCACTCTCTTAGTGGCGGTCAAGCACGGTCACCTTCACCTGCTCCTCTGCCTCATTCTCACCCTCACCACCACCACCACCACCACCACCACCACCACCACCACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCACCACCATCAACACCACCATCACCATCACCATCACCATCACCACCACCACCACCACCACCACCACCACCACCAGGACGCTACATATTCACCGAGTCCTGGAACAGAGGAGCACAAATATGCACCGAAGAATGGGATCTCATCAGCTCCTGAAGCTGGTTCATCCCCAGTGGAAAGTCCAGCTTCAAAGAAACGAAACTATGAAGCAACTCCGCCTGGTTTTCGATACGGATATAAAGGGTTGTCAGCAAAAGTCAGAAAACGATCTCATTTAGGCTCTATTCTGTCTCCAAGCAGTCCTCCATCGTCGCCATACTTACGAGTAGGCCTACCAGCACCGGTCACGGTTTCTATATCTGCTTCAAGTCCACTGCAAGGGGTAGCTCTATCTAATGTACAGCCTCCAGAAAAAGGCGACAGAAGTGCCCCTTCAGTCTTGCCACCACAATTTTCTTGTGAGTATGCCATCCTTCATTCTCACATTGTTTCATAGCTCCATCAAGAAACTATTAGTAATGGTTCGAATATACCGACGCGTAATGGAACGGACTCTGATTAGAACCTACTAGAGGAATTTCTGCATCCGTTTTTAACAGATTCAGATAGGCGAAGAAGCCAGACTGGTTAGGGAGTTTGTAGTTTTTGAAAATTAAGTTTATAAGCCAGACTGATCGTGTTGTAATCCGATATGTTGATTTGCAGTTTCTGTAGGCGTTCGTGTTCATACAATTCGATGGACACTCGGGCTGTTTCTTGTTGTATGGCATGTATAACCAAGGAGATAGAACCTACATGCGTACTTCTGGGTAACAACAGGACTCGTAATCGATATCAGAGTTGTGATAGCGATAGCGAGACCGACGCAAAGGCGTGCTCCTGCTAGAGTTGATATTAATGTAAATATGAGATGAAGCAAGTTATTAGGAGATGCATTTTTCCAGGTCAAAGTCACAGAGGTGGCAGGCCTTGTGTTGTTTTTCTTCTGCAGAAAATGTAAAGTAGAGAAGAAATCAGCAAATGGATCTTGTTCTCAACTTCTTAATCAACCAATCACAATTTTTCACCCCCGTTTTTTTAATTTTAATATCCCCGTCTCATATTGTCGGTCTCTTTGATCCTTTGGCAACGTTCATCTGTGTCAACGTTCATCCTGT
mRNA sequence
CAGAAGTGGAAGTGATGAAATGGGCATAACCCATCTTCCACGGCTCTTCATTTAAGTGCGCAATTAGTGCCATCAATTCCCTTTTTCAATTTCCTTGTTCCCCCATTTCTCTGTTTTTGGCGCTTCTTCTCCATTTCTTACCTTTTCCCCTTCTACTGTCTTGTCATCTTCTTCCATTTTCCCTTTCGATTTATAAATTTTGGTGAAGAAAACGGATAGATGGATGGTTGATGGATGAGTAATAAACCCCCCTCTCTATATCCACATTTACTTCGAAGACTGTGGCTGTCATTTGTAGTTGTTGCGCCAGTTAAATCAGACCGAATCGCCCCCAATTTCTCTCTCTCTCTCTCTCTCTCTCTCTCGAACCCTAGATTTTCTTTTTGAATTTGGGTTCTGGGTTTGCGGTGGAGAAGCTCCACGAGGTGGGATTTTGAGCTTTTGTGGTTCTGGGATCGATTTGCTTTGAGTTTGAGGTGTAATGGGGGGAGATAATTTGGACCCGATTGAGGGAGGTGGCAATGGCGGTTGTTAACCCACTTCTCATGCATTGCCTCCATGGGAAAGAGCGAGGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCCGAGCTTTCTGATTGGACTGTGCAGAGTAGATGTGGCGGCGGTGGGTGCTTTGCGATTCGTAGACTGATTGCTGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCGCCGTTCCTTTCATACGGAGATTGGCCGGATCAGGCGGCTGATTCTACTTATAGAGATCATGAAATCGTAGCGTGTTTTCGTGCTCGGAAGCCAGTTCCTTTTCTGAAAAACCATATTTTTGAGCTTGAAGACAACATTTTTGGAGAAATTCCTGTTCCTTTTGTCAAGGTGGCCGTCCTCTCGCTACAATCATTAGGCGGATCGAACGTAACAGATATCATTTTCTCCGTAGATCCTGATGCCAAGTATTCAAAAATTCCACCAACTTCTCAAAGTTTAATCAAGGAAACGTTTGAAACATTGGTTATAAACGACCCTCCTCTCAGATTGAACGCATCGTTATTCGGCAATACTTCGTTGTTCGAGGTGTTGAAATTTCCTGGTGGGATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAATTTTACGTTGAATTATTCTATCTATCAAATTCAAGTGAATTTCGACGATCTTACCAGCCAGCTGAGGTCGGGATTACGTCTATCTCGTTATGAGAATTTATATGTTAGCCTATCGAACGAACGAGGTTCGACAATGCAGGCTCCTACTATTGTTCAGTCGTCTGTTCTGATGGCTATTGGGACGAATTCATCGAATCAAAGACTCAAACAGTTGGCTCAAACCATCACGAATTCTCATTCGGGAAATCTTGGCCTGAACAACACTGTTTTTGGCAAGGTCAAGCAGGACGCTACATATTCACCGAGTCCTGGAACAGAGGAGCACAAATATGCACCGAAGAATGGGATCTCATCAGCTCCTGAAGCTGGTTCATCCCCAGTGGAAAGTCCAGCTTCAAAGAAACGAAACTATGAAGCAACTCCGCCTGGTTTTCGATACGGATATAAAGGGTTGTCAGCAAAAGTCAGAAAACGATCTCATTTAGGCTCTATTCTGTCTCCAAGCAGTCCTCCATCGTCGCCATACTTACGAGTAGGCCTACCAGCACCGGTCACGGTTTCTATATCTGCTTCAAGTCCACTGCAAGGGGTAGCTCTATCTAATGTACAGCCTCCAGAAAAAGGCGACAGAAGTGCCCCTTCAGTCTTGCCACCACAATTTTCTTTTTCTGTAGGCGTTCGTGTTCATACAATTCGATGGACACTCGGGCTGTTTCTTGTTGTATGGCATGTATAACCAAGGAGATAGAACCTACATGCGTACTTCTGGGTAACAACAGGACTCGTAATCGATATCAGAGTTGTGATAGCGATAGCGAGACCGACGCAAAGGCGTGCTCCTGCTAGAGTTGATATTAATGTAAATATGAGATGAAGCAAGTTATTAGGAGATGCATTTTTCCAGGTCAAAGTCACAGAGGTGGCAGGCCTTGTGTTGTTTTTCTTCTGCAGAAAATGTAAAGTAGAGAAGAAATCAGCAAATGGATCTTGTTCTCAACTTCTTAATCAACCAATCACAATTTTTCACCCCCGTTTTTTTAATTTTAATATCCCCGTCTCATATTGTCGGTCTCTTTGATCCTTTGGCAACGTTCATCTGTGTCAACGTTCATCCTGT
Coding sequence (CDS)
ATGGGAAAGAGCGAGGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCCGAGCTTTCTGATTGGACTGTGCAGAGTAGATGTGGCGGCGGTGGGTGCTTTGCGATTCGTAGACTGATTGCTGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCGCCGTTCCTTTCATACGGAGATTGGCCGGATCAGGCGGCTGATTCTACTTATAGAGATCATGAAATCGTAGCGTGTTTTCGTGCTCGGAAGCCAGTTCCTTTTCTGAAAAACCATATTTTTGAGCTTGAAGACAACATTTTTGGAGAAATTCCTGTTCCTTTTGTCAAGGTGGCCGTCCTCTCGCTACAATCATTAGGCGGATCGAACGTAACAGATATCATTTTCTCCGTAGATCCTGATGCCAAGTATTCAAAAATTCCACCAACTTCTCAAAGTTTAATCAAGGAAACGTTTGAAACATTGGTTATAAACGACCCTCCTCTCAGATTGAACGCATCGTTATTCGGCAATACTTCGTTGTTCGAGGTGTTGAAATTTCCTGGTGGGATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAATTTTACGTTGAATTATTCTATCTATCAAATTCAAGTGAATTTCGACGATCTTACCAGCCAGCTGAGGTCGGGATTACGTCTATCTCGTTATGAGAATTTATATGTTAGCCTATCGAACGAACGAGGTTCGACAATGCAGGCTCCTACTATTGTTCAGTCGTCTGTTCTGATGGCTATTGGGACGAATTCATCGAATCAAAGACTCAAACAGTTGGCTCAAACCATCACGAATTCTCATTCGGGAAATCTTGGCCTGAACAACACTGTTTTTGGCAAGGTCAAGCAGGACGCTACATATTCACCGAGTCCTGGAACAGAGGAGCACAAATATGCACCGAAGAATGGGATCTCATCAGCTCCTGAAGCTGGTTCATCCCCAGTGGAAAGTCCAGCTTCAAAGAAACGAAACTATGAAGCAACTCCGCCTGGTTTTCGATACGGATATAAAGGGTTGTCAGCAAAAGTCAGAAAACGATCTCATTTAGGCTCTATTCTGTCTCCAAGCAGTCCTCCATCGTCGCCATACTTACGAGTAGGCCTACCAGCACCGGTCACGGTTTCTATATCTGCTTCAAGTCCACTGCAAGGGGTAGCTCTATCTAATGTACAGCCTCCAGAAAAAGGCGACAGAAGTGCCCCTTCAGTCTTGCCACCACAATTTTCTTTTTCTGTAGGCGTTCGTGTTCATACAATTCGATGGACACTCGGGCTGTTTCTTGTTGTATGGCATGTATAA
Protein sequence
MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNTVFGKVKQDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLVVWHV
Homology
BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match:
A0A6J1F409 (uncharacterized protein LOC111441963 OS=Cucurbita moschata OX=3662 GN=LOC111441963 PE=4 SV=1)
HSP 1 Score: 863.6 bits (2230), Expect = 3.7e-247
Identity = 456/521 (87.52%), Postives = 456/521 (87.52%), Query Frame = 0
Query: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
Query: 61 WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV
Sbjct: 61 WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN
Sbjct: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Sbjct: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
Query: 301 VFGKVK------------------------------------------------------ 360
VFGKVK
Sbjct: 301 VFGKVKQVRLSSVLNHSLSGGQARSPSPAPLPHSHPHHHHHHHHHHHHHHQHHHHHHHHH 360
Query: 361 -----------QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPG 420
QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPG
Sbjct: 361 HHHHHHHHHHHQDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPG 420
Query: 421 FRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQ 457
FRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQ
Sbjct: 421 FRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQ 480
BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match:
A0A6J1J074 (uncharacterized protein LOC111482272 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482272 PE=4 SV=1)
HSP 1 Score: 845.9 bits (2184), Expect = 7.9e-242
Identity = 449/529 (84.88%), Postives = 451/529 (85.26%), Query Frame = 0
Query: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
Query: 61 WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV
Sbjct: 61 WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN
Sbjct: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLTSQLRSGLRLS 240
Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Sbjct: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
Query: 301 VFGKVK------------------------------------------------------ 360
VFGKVK
Sbjct: 301 VFGKVKQVRLSSVLNHSLSGGQARSPSPAPLPHSHPHHPHHHHHHHHHHHQHHHQHHHHH 360
Query: 361 -------------------QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKR 420
QDA YSPSPGTEEHK+APKNGISSAPEAGSSPVESPASKKR
Sbjct: 361 HHHHHHHHHHHHHHHQHHHQDAAYSPSPGTEEHKHAPKNGISSAPEAGSSPVESPASKKR 420
Query: 421 NYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQ 457
NYEATPPGFRYGYKGLS KVRKRSHLGSI SPSSPPSSPYLRVGLPAPVTVSISASSPL
Sbjct: 421 NYEATPPGFRYGYKGLSTKVRKRSHLGSIPSPSSPPSSPYLRVGLPAPVTVSISASSPLP 480
BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match:
A0A6J1J390 (uncharacterized protein LOC111482272 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482272 PE=4 SV=1)
HSP 1 Score: 835.1 bits (2156), Expect = 1.4e-238
Identity = 448/551 (81.31%), Postives = 450/551 (81.67%), Query Frame = 0
Query: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
Query: 61 WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV
Sbjct: 61 WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN
Sbjct: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLTSQLRSGLRLS 240
Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Sbjct: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
Query: 301 VFGKVKQ----------------------------------------------------- 360
VFGKVKQ
Sbjct: 301 VFGKVKQVRLSSVLNHSLSGGQARSPSPAPLPHSHPHHPHHHHHHHHHHHQHHHQHHHHH 360
Query: 361 ------------------------------------------DATYSPSPGTEEHKYAPK 420
A YSPSPGTEEHK+APK
Sbjct: 361 HHHHHHHHHHHHHHHQHHHQDAAYSPSPGTEEHKHAPKNGISSAAYSPSPGTEEHKHAPK 420
Query: 421 NGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSS 457
NGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLS KVRKRSHLGSI SPSSPPSS
Sbjct: 421 NGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSTKVRKRSHLGSIPSPSSPPSS 480
BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match:
A0A0A0LHD1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G819880 PE=4 SV=1)
HSP 1 Score: 719.5 bits (1856), Expect = 8.6e-204
Identity = 385/511 (75.34%), Postives = 413/511 (80.82%), Query Frame = 0
Query: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
MGKSEEEQPLPVG SSSELSD V++RCGGGGC IRRLIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1 MGKSEEEQPLPVGGSSSELSDRNVENRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIF 60
Query: 61 WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
WLPPFLSYG+WPD+ DS YRDH+IVA F A KPVPFL+ HIFELEDNIFGEIP+P VKV
Sbjct: 61 WLPPFLSYGNWPDRPVDSAYRDHDIVASFHASKPVPFLQKHIFELEDNIFGEIPIPSVKV 120
Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
A+LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLN SLFGN
Sbjct: 121 AILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGN 180
Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLS 240
Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTN--SSNQRLKQLAQTITNSHSGNLGLN 300
YENLYVSLSNERGST+ APT+VQSSVLMAIGTN SS QRLKQLA TITNSHSGNLGLN
Sbjct: 241 PYENLYVSLSNERGSTIDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN 300
Query: 301 NTVFGKVKQ----------------------------------------------DATYS 360
NTVFGKVKQ DA YS
Sbjct: 301 NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLPHSHHHRHHHHHHHHHHHHHHRDAAYS 360
Query: 361 PSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSH 420
PSPGTEEHK+APKNG+SSAPEAGSSP+E P S+KRNYEATPP FRYGYK K+RK +
Sbjct: 361 PSPGTEEHKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSLTKLRKH-N 420
Query: 421 LGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKG-------DRSAP 457
LG I SPSS PSSPYLRVG PAPV+ SISASSPL GV LSNVQPP G +RS+P
Sbjct: 421 LGPIPSPSSSPSSPYLRVGQPAPVSDSISASSPLSGVVLSNVQPPNTGSGHAENFERSSP 480
BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match:
A0A1S3B8E9 (uncharacterized protein LOC103487165 OS=Cucumis melo OX=3656 GN=LOC103487165 PE=4 SV=1)
HSP 1 Score: 713.8 bits (1841), Expect = 4.7e-202
Identity = 387/544 (71.14%), Postives = 414/544 (76.10%), Query Frame = 0
Query: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
MGKSEEEQPLPVGVSSSELSD V++RCGGGGC IR+LIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1 MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIF 60
Query: 61 WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
WLPPFLSYG+WPD+ DS YRDH+IVA F A KPVPFL+NHIFELEDNIFGEIP+P VKV
Sbjct: 61 WLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQNHIFELEDNIFGEIPIPSVKV 120
Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
A+LSLQSL G NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLN SLFGN
Sbjct: 121 AILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGN 180
Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLS 240
Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTN--SSNQRLKQLAQTITNSHSGNLGLN 300
YENLYVSLSNERGSTM APT+VQSSVLMAIGTN SS QRLKQLA TITNSHSGNLGLN
Sbjct: 241 PYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN 300
Query: 301 NTVFGKVK---------------------------------------------------- 360
NTVFGKVK
Sbjct: 301 NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHH 360
Query: 361 ---------------------------QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPV 420
Q A YSPSPGTEEHK+APKNG+SSAPEAGSSP+
Sbjct: 361 HHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGVSSAPEAGSSPM 420
Query: 421 ESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVS 457
E P S+KRNYEATPP FRYGYK S K+RK+ HLG I SPSS P SPYLRVGLPAPV+ S
Sbjct: 421 EGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLRVGLPAPVSDS 480
BLAST of CmoCh14G005120 vs. TAIR 10
Match:
AT3G56590.2 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 326.2 bits (835), Expect = 4.1e-89
Identity = 217/482 (45.02%), Postives = 276/482 (57.26%), Query Frame = 0
Query: 1 MGKSE-EEQPLPVGVSSSELSDWTVQSRCGGGG-------CFAIRRLIAVRCVFFLLLSA 60
MGK+ EEQ LPV SD +R GGG C I ++RCV L SA
Sbjct: 1 MGKNTVEEQNLPV-------SDGAASARNNGGGGISTCCCCDWISSYFSLRCVLILAFSA 60
Query: 61 AVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGE 120
AVFLSA+FWLPPFL + D D D ++DH IVA F KP+ F+++++ +LE++I E
Sbjct: 61 AVFLSALFWLPPFLGFADPGDLDLDPRFKDHRIVASFDVGKPISFMEDNLMQLENDITDE 120
Query: 121 IPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLR 180
I P KV VL+L+ LG N T +IF++DP+ + SKIP +SLIK FETLV R
Sbjct: 121 ISFPMTKVVVLALERLGDLNRTMVIFAIDPEKENSKIPAEIESLIKAAFETLVQKQLSFR 180
Query: 181 LNASLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQ 240
L SLFG FEVLKFPGGIT+IPPQ F LQ AQ+ FNFTLN+SIYQIQ NF++L SQ
Sbjct: 181 LTESLFGEPFFFEVLKFPGGITVIPPQPIFPLQKAQLLFNFTLNFSIYQIQSNFEELASQ 240
Query: 241 LRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHS 300
L+ G+ L+ YENLY++LSN RGST+ PTIV SSVL+ G++S RLKQLAQTIT+SHS
Sbjct: 241 LKKGINLASYENLYITLSNSRGSTVAPPTIVHSSVLLTFGSSS---RLKQLAQTITSSHS 300
Query: 301 GNLGLNNTVFGKVKQ-------------DATYSPSPGTEEHKY--------------APK 360
NLGLN+TVFGKVKQ +T SPSP E H+Y AP+
Sbjct: 301 KNLGLNHTVFGKVKQVRLSSILPHSPATSSTPSPSPQPETHQYPHHHPHHHHHHHELAPE 360
Query: 361 NGISSAPEAGSSPVESPASKKRNYEATP--PGFRYGYKGLSAKVRKRSHLGSILSPSSPP 420
+ S P G +P +P P P + KG SA +H + +P+
Sbjct: 361 PSL-SPPTKGFAPASAPTKHSPLPPRNPPCPYEQRRPKGNSA----LNHHTAPPTPAPHR 420
Query: 421 SSPYLRVGLPAPVT-VSISASSPLQGVALSNVQPPEKGD-------RSAPSVLPPQFSFS 438
S P+ PAP +I SSPL V +++ PP K +PS P S S
Sbjct: 421 SQPHPPAPNPAPPRHHAIPVSSPLPHVVFAHIPPPSKSSPESEPTGEKSPSPAPTPSSAS 467
BLAST of CmoCh14G005120 vs. TAIR 10
Match:
AT3G56590.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 322.8 bits (826), Expect = 4.5e-88
Identity = 211/457 (46.17%), Postives = 268/457 (58.64%), Query Frame = 0
Query: 1 MGKSE-EEQPLPVGVSSSELSDWTVQSRCGGGG-------CFAIRRLIAVRCVFFLLLSA 60
MGK+ EEQ LPV SD +R GGG C I ++RCV L SA
Sbjct: 1 MGKNTVEEQNLPV-------SDGAASARNNGGGGISTCCCCDWISSYFSLRCVLILAFSA 60
Query: 61 AVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGE 120
AVFLSA+FWLPPFL + D D D ++DH IVA F KP+ F+++++ +LE++I E
Sbjct: 61 AVFLSALFWLPPFLGFADPGDLDLDPRFKDHRIVASFDVGKPISFMEDNLMQLENDITDE 120
Query: 121 IPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLR 180
I P KV VL+L+ LG N T +IF++DP+ + SKIP +SLIK FETLV R
Sbjct: 121 ISFPMTKVVVLALERLGDLNRTMVIFAIDPEKENSKIPAEIESLIKAAFETLVQKQLSFR 180
Query: 181 LNASLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQ 240
L SLFG FEVLKFPGGIT+IPPQ F LQ AQ+ FNFTLN+SIYQIQ NF++L SQ
Sbjct: 181 LTESLFGEPFFFEVLKFPGGITVIPPQPIFPLQKAQLLFNFTLNFSIYQIQSNFEELASQ 240
Query: 241 LRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHS 300
L+ G+ L+ YENLY++LSN RGST+ PTIV SSVL+ G++S RLKQLAQTIT+SHS
Sbjct: 241 LKKGINLASYENLYITLSNSRGSTVAPPTIVHSSVLLTFGSSS---RLKQLAQTITSSHS 300
Query: 301 GNLGLNNTVFGKVKQ-------------DATYSPSPGTEEHKY--------------APK 360
NLGLN+TVFGKVKQ +T SPSP E H+Y AP+
Sbjct: 301 KNLGLNHTVFGKVKQVRLSSILPHSPATSSTPSPSPQPETHQYPHHHPHHHHHHHELAPE 360
Query: 361 NGISSAPEAGSSPVESPASKKRNYEATP--PGFRYGYKGLSAKVRKRSHLGSILSPSSPP 420
+ S P G +P +P P P + KG SA +H + +P+
Sbjct: 361 PSL-SPPTKGFAPASAPTKHSPLPPRNPPCPYEQRRPKGNSA----LNHHTAPPTPAPHR 420
BLAST of CmoCh14G005120 vs. TAIR 10
Match:
AT3G10810.1 (zinc finger (C3HC4-type RING finger) family protein )
HSP 1 Score: 298.9 bits (764), Expect = 6.9e-81
Identity = 206/501 (41.12%), Postives = 277/501 (55.29%), Query Frame = 0
Query: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
MGK+E++ L V + +RC G C I + +C+F LLLS A+FLSA+F
Sbjct: 1 MGKTEDDVSLRVAGGEATGDSTVRNARC--GCCKWISSFVGFKCLFVLLLSVALFLSALF 60
Query: 61 WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
L PF D D D +R H IVA F + FL + +L+++IF E+ +KV
Sbjct: 61 LLLPFPM--DREDSNLDPRFRGHAIVASFSINRSASFLNENTLQLQNDIFQEMSYISIKV 120
Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
+L+++ N+T ++F +DPD Y +I P S S IKE FE+++IN L+L SLFG
Sbjct: 121 TILAVEPSDELNITKVVFGIDPDTGYREILPLSLSSIKEMFESVLINQSTLQLTKSLFGE 180
Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
T LFEVLKFPGGIT+IPPQSAF LQ +I FNFTLNYSI+QIQ+NF+ L SQL++GL L+
Sbjct: 181 TFLFEVLKFPGGITVIPPQSAFPLQKFKIVFNFTLNYSIHQIQINFNTLASQLKNGLNLA 240
Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
YENLYVSLSN GST+ PT V SSVL+ +GT++S+ RLKQL TIT S S NLGLNNT
Sbjct: 241 PYENLYVSLSNSEGSTVSPPTTVHSSVLLRVGTSNSSPRLKQLTDTITGSRSKNLGLNNT 300
Query: 301 VFGKVKQ------------DATYSPSPGT---------------------EEHKYAPKNG 360
+FGKVKQ +T SPSP H + +
Sbjct: 301 IFGKVKQVRLSSFLPNSSDSSTKSPSPSPSPHSKHHHHHHHHHHHHHHHHHNHHHHHHHN 360
Query: 361 ISSAPEAGSSPVESPA---SKKRNYEATP---PGFRYGYKGLSAKVRKRSHLGSILSPSS 420
+S SPV SPA S+KR A P PG R +K KR S +P+
Sbjct: 361 LSPKMAPEVSPVASPAPHRSRKRAPSAPPPCNPGNRVHFK------EKRVQFSSTPAPAP 420
Query: 421 PPSSPYLRVGLPAPVTVS----ISASSPLQGVALSN-VQPP--EKGDRSAPSVL--PPQF 454
+P+ ++ PAP++ + + S+PL V ++ QPP E + A V PQ
Sbjct: 421 SAGAPHHQLHSPAPISAAKSHIVPISAPLPHVVFAHAAQPPITEPREPHANEVAHPQPQS 480
BLAST of CmoCh14G005120 vs. TAIR 10
Match:
AT1G10790.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2); Has 78 Blast hits to 78 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 154.1 bits (388), Expect = 2.8e-37
Identity = 114/315 (36.19%), Postives = 164/315 (52.06%), Query Frame = 0
Query: 1 MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGC-FAIRRLIAVRCVFFLLLSAAVFLSAI 60
M K +E L + + +L + R G C A RL+ +RC+ L+LS A+ LSAI
Sbjct: 1 MKKHSKENALALQQETLDLENPESSPRSSGRSCSSAFSRLVGLRCLIVLVLSCAILLSAI 60
Query: 61 FWLPPFLSYGDWPDQAADSTYR-DHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVP-F 120
FWL P S ++ AD T + + + A FR +KPV + H ++E +I I +
Sbjct: 61 FWLFPRRSVSEF---KADGTVKLNASVQASFRLQKPVSEVVRHKGKIEHDILRSIGLSNN 120
Query: 121 VKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASL 180
KV VLSL G SN TD+ F+V P +I S SL++ +F L L+L S
Sbjct: 121 SKVTVLSLNQSGASNYTDVEFAVLPVPPDHEISKHSLSLLRSSFVKLFAKRSKLKLTTSG 180
Query: 181 FGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGL 240
FG + F+VLKFPGGIT+ P + A + A + F+ T+ SI +Q D L L
Sbjct: 181 FGKPTSFQVLKFPGGITVDPLEPAPVSGVALVLFSVTIKTSISTVQDRLDLLNGLFEHML 240
Query: 241 RLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGL 300
L YE+++ L+N++GST+ P Q V + +QRL Q I S + NLGL
Sbjct: 241 SLEPYESVHFQLTNKQGSTISPPLTFQVYVAFTM-RKYLHQRLNHFTQIIQTSRAKNLGL 300
Query: 301 NNTVFGKVKQDATYS 313
+ VFG+VK D T+S
Sbjct: 301 DEAVFGEVK-DITFS 310
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1F409 | 3.7e-247 | 87.52 | uncharacterized protein LOC111441963 OS=Cucurbita moschata OX=3662 GN=LOC1114419... | [more] |
A0A6J1J074 | 7.9e-242 | 84.88 | uncharacterized protein LOC111482272 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1J390 | 1.4e-238 | 81.31 | uncharacterized protein LOC111482272 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A0A0LHD1 | 8.6e-204 | 75.34 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G819880 PE=4 SV=1 | [more] |
A0A1S3B8E9 | 4.7e-202 | 71.14 | uncharacterized protein LOC103487165 OS=Cucumis melo OX=3656 GN=LOC103487165 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT3G56590.2 | 4.1e-89 | 45.02 | hydroxyproline-rich glycoprotein family protein | [more] |
AT3G56590.1 | 4.5e-88 | 46.17 | hydroxyproline-rich glycoprotein family protein | [more] |
AT3G10810.1 | 6.9e-81 | 41.12 | zinc finger (C3HC4-type RING finger) family protein | [more] |
AT1G10790.1 | 2.8e-37 | 36.19 | BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... | [more] |