CmoCh01G011620.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh01G011620.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionExtracellular ligand-gated ion channel
LocationCmo_Chr01 : 9491669 .. 9494323 (-)
Sequence length1901
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCACGTGTCATTTTCTTGCCTGTGATCCTTCACTTTGTTTATGTTGTCCACATACGCTCCTCCATTTGTATCTTTTAAGAAAATAAAAATGAAAATGAAAATAAAAATAAAATAAAATAAAATAAAATAAAAATAAAAAACCCCTAAACCCAATTGTTGCGAAGTTTTTTCGTTCTTCTCTGCAAATCCCTTCTCCTTCAACCTTCAACAAATCAAACCTATAAAAGGTGAAGTGAAAGAAATTCCAATTTCCTTATCATCATCATCTTCAATCAAGCAATCCACAATCGGAATCCTGTTCGTCCCTATCAATTTGTACAAAAAGCGAAGAAATCAAGCAGGGGAGGGAGGAACAATCAACAATCCTGAGTTTCGCTCGCAATCGCCATGGGCGACACCAGCAGAGAGGCGTTGATTAGCAGAAAGGCGAAGGTGTTCAATCGCTCAGCCTCTCACGCTCAGGACGAATTGCATAGCTTCAGATCGTATTTGCGATGGATGTGCGTGGATCAATCGGACATTTGGAGCGCTGGATTGTCGTGGTCGGTGTTCTTCCTCTTCGCTATCGTCGTTCCAGCGACCTCGCATTTCCATCTCGCCTGTTCTTCCTGTGATAGCCATCACGCCAGGCCGTTCGATCGCGTCGTTCAATTGTCGCTCAGCAGTGTGGCGACGGTTTCCTTCCTCTGCCTTTCGAATTTCATCAGGAGGTACGGACTTAGGAGATTCTTGTTCTTCGATAGGCTCTGCGATGAAAGCGAAACTGTGAGAAGAGGATACACCAACAAGTTCAATGTAAGTTATATTCCAACTCGCAACTCATCTATCTATCTCCTTCTTCGCATTCCACTGATTGGATCTGGATTCTGATTCCTTGATTGTTCAATCGTTTTTGTTCTTGAATTTGAATTCAAGGTAAATTGTTAAGAGAAATTTCCATGGAAGTTAAGAGAGAGCTTGAATTTGTGAAAGATTTAGGGATTAATATCTGGCTTTTGGACAAATCTGCTGGTTTAATTCGGTTTTCTGTCCATTCAATTGCCTGTCAACTTCTTTGGAATTTTTTTGCCTTTGATTGCGGTATTTTTCTGTTTCTGGTAAGAACTCATTCAGCGTTTAGACTCGTTTTAATATGAACTTCTCCGAATTTGTTTTATTTTTTCAAATTCTATTTTTAGAAATAATATCCTTTTTTTTTTTAATTAAAGAACAATTTAAAAAATTATTTATAGGGACAAATAATGCTTAGAAATGAACTTAAACAAAAAGAAAAAGAAAAAGACAATTTAATTTAATTTAATTATTTTTATTTTTATTTATTTATTGCAGAGATCACTGAGAGTACTATCAGCATTTGTGGTCCCATGCTTCGCGGCGGAGAGCGCGTACAAAATCTGGTGGTACGCGTCAGGCGCGTCGCAAATCCCATTCCTGGGGAACGTTATAGTGAGCGACGCAGTAGCGTGCTCCATGGAGCTGCTGTCGTGGCTGTACAGAACCACAGTGATCTTCCTTGTGTGCATCCTCTTCCGTCTGATCTGCGACCTCCAGATCCTACGGCTGCAGGACTTCGCCACCGTGTTCCAGGTAGACTCCGACGTGGGGTCGGTCCTTTCCGAGCATTTGAGAATCAGACGGCACTTGAGAATCATCAGCCACCGGTACCGCGCCTTCATATTGTGGTCGTTGATCCTCGTCACCGGCAGCCAGTTCACTTCTCTACTCATGACTACCAAGTCCTCCTCCCTCAACATCTACATCGCCGGCGAACTTGCGGTACGCGAAATCAAACCCCTCTTAACAAATTCGTTTTAAAACGGAAATGCTGAAAGCGAAATTGGAGAATATCTGTTTGATTGCAATTAATTTTAGATTGAATTAAAAACAAGTATGAAAATGGGTGCAGCTGTGCTCGATGACGCTACTGACAAGTCTGATGATACTTCTACGGAGCGCCACAAAAATCACCCACAAAGCGCAGTCAGTGACGGCGCTAGCTGCCAAGTGGCACGTGTGTGCGACGTTGGATTCTTTCGACGTGACAGACGGGGAGACCCCCATGGCTGCCGCCGCTCCCACCGACGGAAACCTTATGTTTCCGGTGACACCGCGCGGCGATGAGGAATCAGAGGGGGAGGAAGGTTGCGACGAAGAAGATGAACTGGACAACACCAAGTTGATCCCAGCCTACGCGTACAGCACCATCTCATTCCAAAAGAGACAGGCCTTAGGTAATCCCAAAATTCCTAATCTAAATCGAAAATTAAGTAGAAATTAAGTATTGATTAATGGGTTTGGTTTGGTTTGGTTGGTGAATGCAGTGACGTACTTCGAGAACAACAGAGCTGGGATAACCATATACGGGTTTACCCTGGATAGGACTACACTCCACACCATCTTTGGAATAGAGTTATCCTTGGTTCTTTGGCTGCTTGGCAAAACCATCGGTTTTTCTTAAACCCCAAATTCAACTTCTTTATTATTATTATTATTATCGTAATTAATTAATTAATTCCCACTTCCCACTGCCTTGTATTTGTGTGCTGTATTCTTCCATCTTTGTTTAATTACTGTCCCCTCCTCCCCCTCCCACGTATTGTTTTCTGCGTTTTCTTATAATTTACATTTTACTTTGTATTTTTTTTTTATTTTCT

mRNA sequence

GCCACGTGTCATTTTCTTGCCTGTGATCCTTCACTTTGTTTATGTTGTCCACATACGCTCCTCCATTTGTATCTTTTAAGAAAATAAAAATGAAAATGAAAATAAAAATAAAATAAAATAAAATAAAATAAAAATAAAAAACCCCTAAACCCAATTGTTGCGAAGTTTTTTCGTTCTTCTCTGCAAATCCCTTCTCCTTCAACCTTCAACAAATCAAACCTATAAAAGGTGAAGTGAAAGAAATTCCAATTTCCTTATCATCATCATCTTCAATCAAGCAATCCACAATCGGAATCCTGTTCGTCCCTATCAATTTGTACAAAAAGCGAAGAAATCAAGCAGGGGAGGGAGGAACAATCAACAATCCTGAGTTTCGCTCGCAATCGCCATGGGCGACACCAGCAGAGAGGCGTTGATTAGCAGAAAGGCGAAGGTGTTCAATCGCTCAGCCTCTCACGCTCAGGACGAATTGCATAGCTTCAGATCGTATTTGCGATGGATGTGCGTGGATCAATCGGACATTTGGAGCGCTGGATTGTCGTGGTCGGTGTTCTTCCTCTTCGCTATCGTCGTTCCAGCGACCTCGCATTTCCATCTCGCCTGTTCTTCCTGTGATAGCCATCACGCCAGGCCGTTCGATCGCGTCGTTCAATTGTCGCTCAGCAGTGTGGCGACGGTTTCCTTCCTCTGCCTTTCGAATTTCATCAGGAGGTACGGACTTAGGAGATTCTTGTTCTTCGATAGGCTCTGCGATGAAAGCGAAACTGTGAGAAGAGGATACACCAACAAGTTCAATAGATCACTGAGAGTACTATCAGCATTTGTGGTCCCATGCTTCGCGGCGGAGAGCGCGTACAAAATCTGGTGGTACGCGTCAGGCGCGTCGCAAATCCCATTCCTGGGGAACGTTATAGTGAGCGACGCAGTAGCGTGCTCCATGGAGCTGCTGTCGTGGCTGTACAGAACCACAGTGATCTTCCTTGTGTGCATCCTCTTCCGTCTGATCTGCGACCTCCAGATCCTACGGCTGCAGGACTTCGCCACCGTGTTCCAGGTAGACTCCGACGTGGGGTCGGTCCTTTCCGAGCATTTGAGAATCAGACGGCACTTGAGAATCATCAGCCACCGGTACCGCGCCTTCATATTGTGGTCGTTGATCCTCGTCACCGGCAGCCAGTTCACTTCTCTACTCATGACTACCAAGTCCTCCTCCCTCAACATCTACATCGCCGGCGAACTTGCGCTGTGCTCGATGACGCTACTGACAAGTCTGATGATACTTCTACGGAGCGCCACAAAAATCACCCACAAAGCGCAGTCAGTGACGGCGCTAGCTGCCAAGTGGCACGTGTGTGCGACGTTGGATTCTTTCGACGTGACAGACGGGGAGACCCCCATGGCTGCCGCCGCTCCCACCGACGGAAACCTTATGTTTCCGGTGACACCGCGCGGCGATGAGGAATCAGAGGGGGAGGAAGGTTGCGACGAAGAAGATGAACTGGACAACACCAAGTTGATCCCAGCCTACGCGTACAGCACCATCTCATTCCAAAAGAGACAGGCCTTAGTGACGTACTTCGAGAACAACAGAGCTGGGATAACCATATACGGGTTTACCCTGGATAGGACTACACTCCACACCATCTTTGGAATAGAGTTATCCTTGGTTCTTTGGCTGCTTGGCAAAACCATCGGTTTTTCTTAAACCCCAAATTCAACTTCTTTATTATTATTATTATTATCGTAATTAATTAATTAATTCCCACTTCCCACTGCCTTGTATTTGTGTGCTGTATTCTTCCATCTTTGTTTAATTACTGTCCCCTCCTCCCCCTCCCACGTATTGTTTTCTGCGTTTTCTTATAATTTACATTTTACTTTGTATTTTTTTTTTATTTTCT

Coding sequence (CDS)

ATGGGCGACACCAGCAGAGAGGCGTTGATTAGCAGAAAGGCGAAGGTGTTCAATCGCTCAGCCTCTCACGCTCAGGACGAATTGCATAGCTTCAGATCGTATTTGCGATGGATGTGCGTGGATCAATCGGACATTTGGAGCGCTGGATTGTCGTGGTCGGTGTTCTTCCTCTTCGCTATCGTCGTTCCAGCGACCTCGCATTTCCATCTCGCCTGTTCTTCCTGTGATAGCCATCACGCCAGGCCGTTCGATCGCGTCGTTCAATTGTCGCTCAGCAGTGTGGCGACGGTTTCCTTCCTCTGCCTTTCGAATTTCATCAGGAGGTACGGACTTAGGAGATTCTTGTTCTTCGATAGGCTCTGCGATGAAAGCGAAACTGTGAGAAGAGGATACACCAACAAGTTCAATAGATCACTGAGAGTACTATCAGCATTTGTGGTCCCATGCTTCGCGGCGGAGAGCGCGTACAAAATCTGGTGGTACGCGTCAGGCGCGTCGCAAATCCCATTCCTGGGGAACGTTATAGTGAGCGACGCAGTAGCGTGCTCCATGGAGCTGCTGTCGTGGCTGTACAGAACCACAGTGATCTTCCTTGTGTGCATCCTCTTCCGTCTGATCTGCGACCTCCAGATCCTACGGCTGCAGGACTTCGCCACCGTGTTCCAGGTAGACTCCGACGTGGGGTCGGTCCTTTCCGAGCATTTGAGAATCAGACGGCACTTGAGAATCATCAGCCACCGGTACCGCGCCTTCATATTGTGGTCGTTGATCCTCGTCACCGGCAGCCAGTTCACTTCTCTACTCATGACTACCAAGTCCTCCTCCCTCAACATCTACATCGCCGGCGAACTTGCGCTGTGCTCGATGACGCTACTGACAAGTCTGATGATACTTCTACGGAGCGCCACAAAAATCACCCACAAAGCGCAGTCAGTGACGGCGCTAGCTGCCAAGTGGCACGTGTGTGCGACGTTGGATTCTTTCGACGTGACAGACGGGGAGACCCCCATGGCTGCCGCCGCTCCCACCGACGGAAACCTTATGTTTCCGGTGACACCGCGCGGCGATGAGGAATCAGAGGGGGAGGAAGGTTGCGACGAAGAAGATGAACTGGACAACACCAAGTTGATCCCAGCCTACGCGTACAGCACCATCTCATTCCAAAAGAGACAGGCCTTAGTGACGTACTTCGAGAACAACAGAGCTGGGATAACCATATACGGGTTTACCCTGGATAGGACTACACTCCACACCATCTTTGGAATAGAGTTATCCTTGGTTCTTTGGCTGCTTGGCAAAACCATCGGTTTTTCTTAA
BLAST of CmoCh01G011620.1 vs. TrEMBL
Match: A0A0A0K352_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G179620 PE=4 SV=1)

HSP 1 Score: 756.5 bits (1952), Expect = 1.8e-215
Identity = 396/439 (90.21%), Postives = 413/439 (94.08%), Query Frame = 1

Query: 1   MGDTSREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAI 60
           MGD++REALISRK+ VF RS SHA DELHSFRSYLRWMCVDQSDIW+AGLSWS+FFLFAI
Sbjct: 1   MGDSNREALISRKSSVFKRSVSHAHDELHSFRSYLRWMCVDQSDIWTAGLSWSMFFLFAI 60

Query: 61  VVPATSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRL 120
           +VPATSHF LACSSCDS+HARPFDRVVQLSLSSVATVSFLCLS+FIRRYGLRRFLFFD+L
Sbjct: 61  IVPATSHFVLACSSCDSNHARPFDRVVQLSLSSVATVSFLCLSSFIRRYGLRRFLFFDKL 120

Query: 121 CDESETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAV 180
           CDESETVRRGYT KFNRSLRVLS FV+PCFAAESAYKIWWYASGASQIPFLGNVIVSDAV
Sbjct: 121 CDESETVRRGYTIKFNRSLRVLSTFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAV 180

Query: 181 ACSMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRH 240
           AC+MELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDV SVLSEHLRIRRH
Sbjct: 181 ACAMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVASVLSEHLRIRRH 240

Query: 241 LRIISHRYRAFILWSLILVTGSQFTSLLMTTKSSS-LNIYIAGELALCSMTLLTSLMILL 300
           LRIISHRYR FIL SL+LVTGSQFTSLL+TTKSSS LNIYIAGELALCSMTLLTSLMILL
Sbjct: 241 LRIISHRYRVFILGSLVLVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL 300

Query: 301 RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEES 360
           RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMA+      +  FP      EES
Sbjct: 301 RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMASTI----HQAFPPHHGVGEES 360

Query: 361 EGEEGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI 420
           EG+EGCD ED+LDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI
Sbjct: 361 EGDEGCD-EDDLDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI 420

Query: 421 FGIELSLVLWLLGKTIGFS 439
           FGIELSLVLWLLGKTIGFS
Sbjct: 421 FGIELSLVLWLLGKTIGFS 434

BLAST of CmoCh01G011620.1 vs. TrEMBL
Match: A0A061F5U3_THECC (F11F12.5 protein OS=Theobroma cacao GN=TCM_030801 PE=4 SV=1)

HSP 1 Score: 651.0 bits (1678), Expect = 1.0e-183
Identity = 337/439 (76.77%), Postives = 381/439 (86.79%), Query Frame = 1

Query: 1   MGDTSREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAI 60
           MGD +RE L+ R    F R+ SHA DEL SFR+YLRWMCVDQSD+W+A LSW VF L  +
Sbjct: 1   MGD-NREPLVDRTR--FMRTISHAHDELQSFRTYLRWMCVDQSDVWTACLSWFVFILLGL 60

Query: 61  VVPATSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRL 120
           VVPA SHF LACS+CD+ HARP+D VVQLSLSSVA++SF+CL+ F+++YGLRRFLFFD+L
Sbjct: 61  VVPAMSHFLLACSTCDARHARPYDWVVQLSLSSVASLSFVCLTRFVKKYGLRRFLFFDKL 120

Query: 121 CDESETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAV 180
           CDESETVR+GYT + NRSL+++S FV+PCF AESAYK+WWYASGASQIPFLG V +SDA+
Sbjct: 121 CDESETVRKGYTGQLNRSLKIVSIFVLPCFVAESAYKVWWYASGASQIPFLGIVWLSDAM 180

Query: 181 ACSMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRH 240
           AC MEL SWLYRTTV FLVC+LFRLIC+LQ+LR+QDFA VFQVDSDVGSVLSEHLRIRRH
Sbjct: 181 ACMMELCSWLYRTTVFFLVCVLFRLICNLQVLRIQDFAQVFQVDSDVGSVLSEHLRIRRH 240

Query: 241 LRIISHRYRAFILWSLILVTGSQFTSLLMTTKSSS-LNIYIAGELALCSMTLLTSLMILL 300
           LRIISHRYRAFILW LILVTGSQFTSLL+TTK+++ LNIY AGELA+CS+TLLT L ILL
Sbjct: 241 LRIISHRYRAFILWCLILVTGSQFTSLLITTKANAELNIYKAGELAMCSITLLTGLSILL 300

Query: 301 RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEES 360
           RSATKITHKAQSVT LAAKWHVCATLDSFD  DGETP   A    G   FP     D ES
Sbjct: 301 RSATKITHKAQSVTCLAAKWHVCATLDSFDANDGETPRIPAI--HGCQSFPYVGT-DGES 360

Query: 361 EGEEGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI 420
           +GE+  DEED++DN+KLIPAYAYSTIS+QKRQALVTYFENNRAGITIYGFTLDR+TLHTI
Sbjct: 361 DGEDAGDEEDDIDNSKLIPAYAYSTISYQKRQALVTYFENNRAGITIYGFTLDRSTLHTI 420

Query: 421 FGIELSLVLWLLGKTIGFS 439
           FGIELSLVLWLLGKTIG S
Sbjct: 421 FGIELSLVLWLLGKTIGIS 433

BLAST of CmoCh01G011620.1 vs. TrEMBL
Match: D7LAY2_ARALL (Extracellular ligand-gated ion channel OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_479553 PE=4 SV=1)

HSP 1 Score: 636.7 bits (1641), Expect = 2.0e-179
Identity = 331/438 (75.57%), Postives = 374/438 (85.39%), Query Frame = 1

Query: 2   GDTSREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAIV 61
           G  +RE LI+R+ K F RS SHAQDEL SFR YLRWMCVDQS  W+A LSWS+F +F +V
Sbjct: 17  GRGTRERLINRETK-FTRSVSHAQDELQSFRKYLRWMCVDQSSPWTAVLSWSMFVVFTLV 76

Query: 62  VPATSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRLC 121
           VPATSHF LAC+ CDSHH+RP+D VVQLSLSS A +SFLCLS F+ +YGLRRFLFFD+L 
Sbjct: 77  VPATSHFMLACADCDSHHSRPYDSVVQLSLSSFAALSFLCLSRFVSKYGLRRFLFFDKLW 136

Query: 122 DESETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVA 181
           DESETVRRGYTN+ NRSL++LS FV PCF A S+YKIWWYASGASQIPFLGNVI+SD VA
Sbjct: 137 DESETVRRGYTNQLNRSLKILSYFVTPCFLAMSSYKIWWYASGASQIPFLGNVILSDTVA 196

Query: 182 CSMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRHL 241
           C MEL SWLYRTTVIFLVC+LFRLIC LQILRLQDFA VFQ+DSDVGS+LSEHLRIRRHL
Sbjct: 197 CLMELCSWLYRTTVIFLVCVLFRLICHLQILRLQDFAQVFQMDSDVGSILSEHLRIRRHL 256

Query: 242 RIISHRYRAFILWSLILVTGSQFTSLLMTTKS-SSLNIYIAGELALCSMTLLTSLMILLR 301
           RIISHRYR FIL SLILVTGSQF SLL+TTK+ + LNIY AGELALCSMTL+T+L+ILLR
Sbjct: 257 RIISHRYRTFILLSLILVTGSQFYSLLITTKAYAELNIYRAGELALCSMTLVTALLILLR 316

Query: 302 SATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEESE 361
           SA+KITHKAQ+VT LAAKWHVCAT++SF+  DGETP        G+  +P T   + ES+
Sbjct: 317 SASKITHKAQAVTCLAAKWHVCATIESFETVDGETPRLVDR-ASGHGYYP-TDDDNGESD 376

Query: 362 GEEGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIF 421
            E+  DEED+ DN  LIPAYAYSTISFQKRQALV YFENNRAGIT++GFTLDR+TLHTIF
Sbjct: 377 SEDYGDEEDDFDNNNLIPAYAYSTISFQKRQALVNYFENNRAGITVFGFTLDRSTLHTIF 436

Query: 422 GIELSLVLWLLGKTIGFS 439
           GIE+SLVLWLLGKTIG S
Sbjct: 437 GIEMSLVLWLLGKTIGIS 451

BLAST of CmoCh01G011620.1 vs. TrEMBL
Match: B9GJ40_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s36620g PE=4 SV=1)

HSP 1 Score: 636.3 bits (1640), Expect = 2.7e-179
Identity = 328/437 (75.06%), Postives = 379/437 (86.73%), Query Frame = 1

Query: 3   DTSREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAIVV 62
           + +RE+LI+R  K+ +RS SHA DEL SFRSYLRWMCVDQS + +A LSW++F LF +VV
Sbjct: 4   NNNRESLINRNNKL-SRSQSHAYDELKSFRSYLRWMCVDQSSMGTACLSWTMFVLFGLVV 63

Query: 63  PATSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRLCD 122
           PATSHF LACSSCDS H RP+D VVQLSLSSVAT+SF+CLS F+R+YGLRRFLFFD+L D
Sbjct: 64  PATSHFVLACSSCDSRHGRPYDSVVQLSLSSVATLSFVCLSRFVRKYGLRRFLFFDKLWD 123

Query: 123 ESETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVAC 182
           ESETVRRGYTN+ N SL++L  FV+PCF AE AYKIWWYASGASQIPFLGNV++SD VAC
Sbjct: 124 ESETVRRGYTNQLNSSLKLLLIFVIPCFVAECAYKIWWYASGASQIPFLGNVVLSDTVAC 183

Query: 183 SMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRHLR 242
            MEL SWLYRT VIFLVC+LF LIC LQILRLQDFA VFQVDSDVGSVLSEH RIRRHLR
Sbjct: 184 IMELCSWLYRTIVIFLVCVLFHLICHLQILRLQDFARVFQVDSDVGSVLSEHFRIRRHLR 243

Query: 243 IISHRYRAFILWSLILVTGSQFTSLLMTTKSSS-LNIYIAGELALCSMTLLTSLMILLRS 302
           IISHRYR F+L SLIL+TGSQF+SLL+TTK+ + ++IY AGELALCS+TL+T L+I+LRS
Sbjct: 244 IISHRYRGFVLCSLILITGSQFSSLLITTKAHAVVDIYRAGELALCSITLVTGLLIILRS 303

Query: 303 ATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEESEG 362
           ATKITHKAQ+VT+LAAKWH+CATLD+FD T+GETP       D   +FPV    D ES+G
Sbjct: 304 ATKITHKAQAVTSLAAKWHICATLDTFDATEGETPR-----HDSGQVFPVVGT-DGESDG 363

Query: 363 EEGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFG 422
           ++  DEEDELDN+KLIPAYAYSTISFQKRQALVTYFENNRAGIT+YGF LDR+TLH+IFG
Sbjct: 364 DDAGDEEDELDNSKLIPAYAYSTISFQKRQALVTYFENNRAGITVYGFILDRSTLHSIFG 423

Query: 423 IELSLVLWLLGKTIGFS 439
           +EL+LVLWLLGKTIG S
Sbjct: 424 VELALVLWLLGKTIGIS 433

BLAST of CmoCh01G011620.1 vs. TrEMBL
Match: A0A0D2QY93_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G443500 PE=4 SV=1)

HSP 1 Score: 635.2 bits (1637), Expect = 6.0e-179
Identity = 325/439 (74.03%), Postives = 376/439 (85.65%), Query Frame = 1

Query: 1   MGDTSREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAI 60
           M DT+RE L+ R  + F R+ SHAQDELHSFR+YLRWMCVDQS+IW+A LSW +F +  +
Sbjct: 1   MEDTTREPLMDRSTR-FQRTISHAQDELHSFRTYLRWMCVDQSNIWTASLSWFLFIVLGL 60

Query: 61  VVPATSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRL 120
           VVP  SHF LACSSCD+ HARP+D VVQLSLSSV+T+SF+CL++F+++YGL+RFLFFD+L
Sbjct: 61  VVPCLSHFFLACSSCDAKHARPYDWVVQLSLSSVSTMSFVCLTSFVKKYGLKRFLFFDKL 120

Query: 121 CDESETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAV 180
           C+ESE VR+GYT K NRSL+++S+FV+PCF AE+AYK+WWYASGASQIPFLG V +SD+V
Sbjct: 121 CNESEIVRKGYTAKLNRSLKIVSSFVIPCFLAETAYKVWWYASGASQIPFLGIVWLSDSV 180

Query: 181 ACSMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRH 240
           AC MEL SWLYRTTV FLVC+LF L+C+LQ+LR+QDFA VFQVDSDVGSVLSEHLRIRRH
Sbjct: 181 ACFMELCSWLYRTTVFFLVCVLFHLVCNLQVLRIQDFAQVFQVDSDVGSVLSEHLRIRRH 240

Query: 241 LRIISHRYRAFILWSLILVTGSQFTSLLMTTKSSS-LNIYIAGELALCSMTLLTSLMILL 300
           LRIISHRYR+FILW LILVTGSQFTSLL+T K++S LNIY AGEL +CS+TL+T L ILL
Sbjct: 241 LRIISHRYRSFILWCLILVTGSQFTSLLVTLKATSELNIYKAGELLMCSLTLVTGLCILL 300

Query: 301 RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEES 360
           RSATKITHKAQSVT LA+KWHVCATLDSFDV DGETP   A        FP     D ES
Sbjct: 301 RSATKITHKAQSVTCLASKWHVCATLDSFDVNDGETPRTPAIHV--RQSFP-NVGTDGES 360

Query: 361 EGEEGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI 420
           E E+  +EED+LDN KLIPAYAYSTIS+QKR ALVTYFENNRAGITIYGF LDR+TLHTI
Sbjct: 361 ESEDVGEEEDDLDNNKLIPAYAYSTISYQKRHALVTYFENNRAGITIYGFMLDRSTLHTI 420

Query: 421 FGIELSLVLWLLGKTIGFS 439
           FGIELSLVLWLL KTIG S
Sbjct: 421 FGIELSLVLWLLSKTIGIS 435

BLAST of CmoCh01G011620.1 vs. TAIR10
Match: AT3G20300.1 (AT3G20300.1 Protein of unknown function (DUF3537))

HSP 1 Score: 634.8 bits (1636), Expect = 3.9e-182
Identity = 330/435 (75.86%), Postives = 373/435 (85.75%), Query Frame = 1

Query: 5   SREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAIVVPA 64
           +RE LI+R+ K F RS SHAQDELHSFR YLRWMCVDQS  W+A LSWS+F +F +VVPA
Sbjct: 21  TRERLINRENK-FTRSVSHAQDELHSFRKYLRWMCVDQSSPWTAVLSWSMFVVFTLVVPA 80

Query: 65  TSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRLCDES 124
           TSHF LACS CDSHH+RP+D VVQLSLSS A +SFLCLS F+ +YGLRRFLFFD+L DES
Sbjct: 81  TSHFMLACSDCDSHHSRPYDSVVQLSLSSFAALSFLCLSRFVSKYGLRRFLFFDKLWDES 140

Query: 125 ETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACSM 184
           ETVR GYTN+ NRSL++LS FV PCF A S+YKIWWYASGASQIPFLGNVI+SD VAC M
Sbjct: 141 ETVRLGYTNQLNRSLKILSYFVSPCFLAMSSYKIWWYASGASQIPFLGNVILSDTVACLM 200

Query: 185 ELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRHLRII 244
           EL SWLYRTTVIFLVC+LFRLIC LQILRLQDFA VFQ+DSDVGS+LSEHLRIRRHLRII
Sbjct: 201 ELCSWLYRTTVIFLVCVLFRLICHLQILRLQDFAQVFQMDSDVGSILSEHLRIRRHLRII 260

Query: 245 SHRYRAFILWSLILVTGSQFTSLLMTTKS-SSLNIYIAGELALCSMTLLTSLMILLRSAT 304
           SHRYR FIL SLILVTGSQF SLL+TTK+ + LNIY AGELALCSMTL+T+L+ILLRSA+
Sbjct: 261 SHRYRTFILLSLILVTGSQFYSLLITTKAYAELNIYRAGELALCSMTLVTALLILLRSAS 320

Query: 305 KITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEESEGEE 364
           KITHKAQ+VT LAAKWHVCAT++SF+  DGETP        G+  +P T   + ES+ E+
Sbjct: 321 KITHKAQAVTCLAAKWHVCATIESFETVDGETPRLVDR-ASGHGYYP-TDDDNGESDSED 380

Query: 365 GCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIE 424
             DEED+ DN  LIPAYAYSTISFQKRQALV YFENNR+GIT++GFTLDR+TLHTIFGIE
Sbjct: 381 YGDEEDDFDNNNLIPAYAYSTISFQKRQALVNYFENNRSGITVFGFTLDRSTLHTIFGIE 440

Query: 425 LSLVLWLLGKTIGFS 439
           +SLVLWLLGKTIG S
Sbjct: 441 MSLVLWLLGKTIGIS 452

BLAST of CmoCh01G011620.1 vs. TAIR10
Match: AT1G50630.1 (AT1G50630.1 Protein of unknown function (DUF3537))

HSP 1 Score: 605.9 bits (1561), Expect = 2.0e-173
Identity = 313/432 (72.45%), Postives = 362/432 (83.80%), Query Frame = 1

Query: 15  KVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAIVVPATSHFHLACSS 74
           K+FNR  SH QDELHSFR YLRWMCVD S  W+A LSW++F +F +VVPA SHF LAC+ 
Sbjct: 24  KIFNRCVSHQQDELHSFRKYLRWMCVDHSSPWTAILSWTMFIVFTLVVPAISHFLLACAD 83

Query: 75  CDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRLCDESETVRRGYTNK 134
           CDS+H+RP+D VVQLSLSSVATVSFLCL+ F+ +YGLRRFLFFD+L DESETVRR YTN+
Sbjct: 84  CDSYHSRPYDSVVQLSLSSVATVSFLCLTRFVSKYGLRRFLFFDKLWDESETVRRNYTNQ 143

Query: 135 FNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACSMELLSWLYRTT 194
            N SL ++S FV+PCF+A SAYKIWWYASG S+IPFLGN ++SD VAC MEL SWLYRTT
Sbjct: 144 LNTSLHIVSYFVIPCFSAMSAYKIWWYASGGSRIPFLGNAVLSDTVACIMELCSWLYRTT 203

Query: 195 VIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRHLRIISHRYRAFILW 254
           VIFLVC+LFRLIC LQILRLQDFA +FQ+DSDVGS+LSEHLRIRRHLRIISHRYR+FIL 
Sbjct: 204 VIFLVCVLFRLICHLQILRLQDFAKLFQIDSDVGSILSEHLRIRRHLRIISHRYRSFILC 263

Query: 255 SLILVTGSQFTSLLMTTKS-SSLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVT 314
            LILVTGSQF+SLL+TTK+ + +NIY AGELALCSMTL+T+L+ILLRSA+KITHKAQ+VT
Sbjct: 264 LLILVTGSQFSSLLITTKAYTEVNIYRAGELALCSMTLVTALLILLRSASKITHKAQAVT 323

Query: 315 ALAAKWHVCATLDSFDVT------DGETP-MAAAAPTDGNLMFPVTPRGDEESEGEEGCD 374
            LAAKWHVCATL+SFD T        ETP + A    D N +  V      ES+ +E  D
Sbjct: 324 CLAAKWHVCATLESFDQTVESFDQTVETPTLVARNNNDNNNVHDVVTL--TESDSDEYGD 383

Query: 375 EEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELSL 434
           EED+LDN  +IP YA+ST+SFQKRQALV+YFENN AGIT+YGFTLDR TLHTIFG+ELSL
Sbjct: 384 EEDDLDNNDIIPVYAFSTMSFQKRQALVSYFENNSAGITVYGFTLDRGTLHTIFGLELSL 443

Query: 435 VLWLLGKTIGFS 439
           VLWLLGKTIG S
Sbjct: 444 VLWLLGKTIGIS 453

BLAST of CmoCh01G011620.1 vs. TAIR10
Match: AT4G22270.1 (AT4G22270.1 Protein of unknown function (DUF3537))

HSP 1 Score: 427.9 bits (1099), Expect = 7.3e-120
Identity = 237/429 (55.24%), Postives = 311/429 (72.49%), Query Frame = 1

Query: 11  SRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAIVVPATSHFHL 70
           +R + + N+          +F S + W   DQS+  +A LSWSVFFL  ++VP  SHF L
Sbjct: 15  TRPSAIINKQQPEFPASRFTFMSLVLWF--DQSNFGTALLSWSVFFLLVVIVPLISHFLL 74

Query: 71  ACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRLCDESETVRRG 130
            CS CD HH RP+D +VQLSLS  A +SF+ LS + R++G+RRFLF D+L D S+ VR  
Sbjct: 75  VCSDCDFHHRRPYDVIVQLSLSIFAGISFVSLSIWSRKFGMRRFLFLDKLWDVSDKVRIE 134

Query: 131 YTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACSMELLSWL 190
           Y  +  RSL+ L  FV+P    E+ Y+IWWY SG +QIP++ N I+S  VAC+++L SWL
Sbjct: 135 YEAEIQRSLKRLMIFVLPSLTLEATYRIWWYISGFNQIPYIINPILSHVVACTLQLSSWL 194

Query: 191 YRTTVIFLVCILFRLICDLQILRLQDFATVFQVD-SDVGSVLSEHLRIRRHLRIISHRYR 250
           YR ++  +VCIL+++ C LQ LRL DFA  F  + +DV S L EH +IRR+LRI+SHR+R
Sbjct: 195 YRNSLFIIVCILYKITCHLQTLRLDDFARCFASEITDVRSALGEHQKIRRNLRIVSHRFR 254

Query: 251 AFILWSLILVTGSQFTSLLMTTKSS-SLNIYIAGELALCSMTLLTSLMILLRSATKITHK 310
            FIL SLILVT +QF +LL TT++S ++NIY  GELALCS++L+T + I LRSATKITHK
Sbjct: 255 RFILLSLILVTATQFMALLTTTRASVAVNIYEVGELALCSLSLVTGVFICLRSATKITHK 314

Query: 311 AQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGD--EESEGEEGCD 370
           AQSVT+LAAKW+VCAT+DSFD  DGET      PT   +   V+ RG+  E S+ EEG +
Sbjct: 315 AQSVTSLAAKWNVCATVDSFDHLDGET------PTGSIIESQVSLRGNAIETSDDEEG-E 374

Query: 371 EEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELSL 430
            +D+LDNTK+ P YA +TIS+QKRQALVTY ENN+AGIT+YGF +DR+ L+TIFGIEL+L
Sbjct: 375 GDDDLDNTKIHPIYA-NTISYQKRQALVTYLENNKAGITVYGFLVDRSWLNTIFGIELAL 433

Query: 431 VLWLLGKTI 436
           +LWLL KTI
Sbjct: 435 LLWLLNKTI 433

BLAST of CmoCh01G011620.1 vs. TAIR10
Match: AT4G03820.2 (AT4G03820.2 Protein of unknown function (DUF3537))

HSP 1 Score: 382.5 bits (981), Expect = 3.5e-106
Identity = 213/412 (51.70%), Postives = 286/412 (69.42%), Query Frame = 1

Query: 30  SFRSYLRWMCVDQSDIWSAGLSWSVFFLFAIVVPATSHFHLACSSCDSHHARPFDRVVQL 89
           SF     W   DQS+     LSWS+FFL A++VP  SHF L C+ CD  H RP+D +VQL
Sbjct: 31  SFPRLFLWF--DQSNRIKTLLSWSIFFLLAVIVPMISHFVLICADCDFKHRRPYDGLVQL 90

Query: 90  SLSSVATVSFLCLSNFIRRYGLRRFLFFDRLCDESETVRRGYTNKFNRSLRVLSAFVVPC 149
           SLS  A +SF+ LS++ ++YG+RRFLFFD+L D S+ VR GY  K  RS+++L+ FV+P 
Sbjct: 91  SLSIFAGISFVSLSDWSKKYGIRRFLFFDKLKDVSDKVRIGYEAKIQRSMKLLAIFVLPS 150

Query: 150 FAAESAYKIWWYASGASQIPFLGNVIVSDAVACSMELLSWLYRTTVIFLVCILFRLICDL 209
              ++ Y+IWWYASG +QIP++ N  +S  +AC+++L SWLYRT++  + CIL++ IC L
Sbjct: 151 TTLQAIYRIWWYASGFNQIPYIINPTLSHVLACTLQLSSWLYRTSLFIIACILYQNICHL 210

Query: 210 QILRLQDFATVFQVD-SDVGSVLSEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLL 269
           Q+LRL +FA  F  +  D  S+L+EHL+IRR L+I+SHR+R FIL SL  VT +QF +LL
Sbjct: 211 QVLRLDEFARCFASEIKDFSSILAEHLKIRRELKIVSHRFRRFILLSLFFVTATQFMALL 270

Query: 270 MTTKSS-SLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTALAAKWHVCATLDS 329
            T ++S   NIY  GELALCS +L++ L I L+SAT++THKAQSVT++A KW+VCA+LD+
Sbjct: 271 TTIRASVPFNIYEVGELALCSTSLVSGLFICLKSATQMTHKAQSVTSIATKWNVCASLDT 330

Query: 330 FDVT-DGETPMAAAAPTDGNLMF---PVTPRGDEESEGEEGCDEEDELDNTKLIPAYAYS 389
           FDV  DGETP          ++     V    D++ EG EG D + E+      P +A  
Sbjct: 331 FDVLYDGETPKCPTTTQHSQILSRRRNVVQSSDDDEEG-EGDDNDLEIH-----PIFA-R 390

Query: 390 TISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTI 436
            IS QKRQALVTY ENNRAGIT+YGF +D+T L  IF IEL+L+LWLL KTI
Sbjct: 391 AISSQKRQALVTYLENNRAGITVYGFLVDKTWLRMIFSIELALLLWLLKKTI 433

BLAST of CmoCh01G011620.1 vs. TAIR10
Match: AT1G67570.1 (AT1G67570.1 Protein of unknown function (DUF3537))

HSP 1 Score: 226.5 bits (576), Expect = 3.2e-59
Identity = 141/414 (34.06%), Postives = 239/414 (57.73%), Query Frame = 1

Query: 28  LHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAIVVPATSHFHLACSSCDSHHARPFDRVV 87
           L    ++L  +  +QS   S  LSW VF    +V+P T      C  C+ +  + F+  +
Sbjct: 51  LEWLETFLTLLGFNQSSKQSLVLSWIVFLSIGLVLPVTVLELGHCLGCERYQYKSFELNI 110

Query: 88  QLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRLCDESETVRRGYTNKFNRSLRVLSAFVV 147
            +S + +A VS LC+S+ +R++G+R+FLF D+L    + ++  Y  +   S+R+L+ + +
Sbjct: 111 VVSQALLAGVSLLCVSHNLRKHGIRKFLFVDQLSGRMDRLKAQYIQQILNSVRLLAVWSL 170

Query: 148 PCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACSMELLSWLYRTTVIFLVCILFRLIC 207
           PCFA +   +I          P+L     S A+  SM +LSW Y +T+      +F L+C
Sbjct: 171 PCFALKGVREIIRMYYVPHDQPWL-----SVAILLSM-ILSWTYLSTIFLAASAMFHLVC 230

Query: 208 DLQILRLQDFATVFQVDSDVGSVLSEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSL 267
           +LQ++  +D+A + + +S++   + EH+R+R +L  ISHR+R F+L   ++VT SQFT+L
Sbjct: 231 NLQVIHFEDYAKLLEGESEISLFIYEHMRLRHYLSKISHRFRIFLLLQFLVVTASQFTTL 290

Query: 268 LMTTKSSSLNIYI-AGELALCSMTLLTSLMILLRSATKITHKAQSVTALAAKWHV---CA 327
             TT  S    YI  G+ A+ ++  +  +++ L +ATKI+H+AQ++ ++A++WH    C+
Sbjct: 291 FQTTAYSGRITYINGGDFAVSAVVQVVGIILCLHAATKISHRAQAIASVASRWHAMMSCS 350

Query: 328 TLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEESEGEEGCDEEDELDNTKLIPAYAYS 387
           + DS  +    + +   A T+  + FP++ R D +    E  D    +  T   P+Y  S
Sbjct: 351 STDSTQIRASPSGVHLEATTNPPISFPIS-RSDSD---VESMDHYMRMPVTNQFPSY-MS 410

Query: 388 TISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGF 438
             S+ KRQA V Y + N  GITI+G+T+DR  ++TIF IELSLV ++LGKT+ F
Sbjct: 411 MSSYHKRQAFVLYLQMNPGGITIFGWTVDRHLINTIFFIELSLVTFVLGKTVVF 453

BLAST of CmoCh01G011620.1 vs. NCBI nr
Match: gi|449466580|ref|XP_004151004.1| (PREDICTED: uncharacterized protein LOC101212672 [Cucumis sativus])

HSP 1 Score: 756.5 bits (1952), Expect = 2.5e-215
Identity = 396/439 (90.21%), Postives = 413/439 (94.08%), Query Frame = 1

Query: 1   MGDTSREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAI 60
           MGD++REALISRK+ VF RS SHA DELHSFRSYLRWMCVDQSDIW+AGLSWS+FFLFAI
Sbjct: 1   MGDSNREALISRKSSVFKRSVSHAHDELHSFRSYLRWMCVDQSDIWTAGLSWSMFFLFAI 60

Query: 61  VVPATSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRL 120
           +VPATSHF LACSSCDS+HARPFDRVVQLSLSSVATVSFLCLS+FIRRYGLRRFLFFD+L
Sbjct: 61  IVPATSHFVLACSSCDSNHARPFDRVVQLSLSSVATVSFLCLSSFIRRYGLRRFLFFDKL 120

Query: 121 CDESETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAV 180
           CDESETVRRGYT KFNRSLRVLS FV+PCFAAESAYKIWWYASGASQIPFLGNVIVSDAV
Sbjct: 121 CDESETVRRGYTIKFNRSLRVLSTFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAV 180

Query: 181 ACSMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRH 240
           AC+MELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDV SVLSEHLRIRRH
Sbjct: 181 ACAMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVASVLSEHLRIRRH 240

Query: 241 LRIISHRYRAFILWSLILVTGSQFTSLLMTTKSSS-LNIYIAGELALCSMTLLTSLMILL 300
           LRIISHRYR FIL SL+LVTGSQFTSLL+TTKSSS LNIYIAGELALCSMTLLTSLMILL
Sbjct: 241 LRIISHRYRVFILGSLVLVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL 300

Query: 301 RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEES 360
           RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMA+      +  FP      EES
Sbjct: 301 RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMASTI----HQAFPPHHGVGEES 360

Query: 361 EGEEGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI 420
           EG+EGCD ED+LDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI
Sbjct: 361 EGDEGCD-EDDLDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI 420

Query: 421 FGIELSLVLWLLGKTIGFS 439
           FGIELSLVLWLLGKTIGFS
Sbjct: 421 FGIELSLVLWLLGKTIGFS 434

BLAST of CmoCh01G011620.1 vs. NCBI nr
Match: gi|659093777|ref|XP_008447712.1| (PREDICTED: uncharacterized protein LOC103490125 [Cucumis melo])

HSP 1 Score: 751.5 bits (1939), Expect = 8.2e-214
Identity = 393/439 (89.52%), Postives = 410/439 (93.39%), Query Frame = 1

Query: 1   MGDTSREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAI 60
           MGD++REALI+RK+ VF RS SHA DELHSFRSYLRWMCVDQSDIW+AGLSWS+FFLFAI
Sbjct: 1   MGDSNREALINRKSSVFKRSVSHAHDELHSFRSYLRWMCVDQSDIWTAGLSWSMFFLFAI 60

Query: 61  VVPATSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRL 120
           +VPATSHF LACSSCDS+HARPFDRVVQLSLSSVATVSFLCLS+FIRRYGLRRFLFFD+L
Sbjct: 61  IVPATSHFLLACSSCDSNHARPFDRVVQLSLSSVATVSFLCLSSFIRRYGLRRFLFFDKL 120

Query: 121 CDESETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAV 180
           CDESETVRRGYT K NRSLRVLS FV+PCFAAESAYKIWWYASGASQIPFLGNVIVSD V
Sbjct: 121 CDESETVRRGYTIKLNRSLRVLSTFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDVV 180

Query: 181 ACSMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRH 240
           AC MELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDV SVLSEHLRIRRH
Sbjct: 181 ACCMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVASVLSEHLRIRRH 240

Query: 241 LRIISHRYRAFILWSLILVTGSQFTSLLMTTKSSS-LNIYIAGELALCSMTLLTSLMILL 300
           LRIISHRYR FIL SL+LVTGSQFTSLL+TTKSSS LNIYIAGELALCSMTLLTSLMILL
Sbjct: 241 LRIISHRYRVFILGSLVLVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL 300

Query: 301 RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEES 360
           RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMA+      +  FP      EES
Sbjct: 301 RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMASTI----HQAFPTHHGVGEES 360

Query: 361 EGEEGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI 420
           EG+EGCD ED+LDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI
Sbjct: 361 EGDEGCD-EDDLDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI 420

Query: 421 FGIELSLVLWLLGKTIGFS 439
           FGIELSLVLWLLGKTIGFS
Sbjct: 421 FGIELSLVLWLLGKTIGFS 434

BLAST of CmoCh01G011620.1 vs. NCBI nr
Match: gi|590606354|ref|XP_007020713.1| (F11F12.5 protein [Theobroma cacao])

HSP 1 Score: 651.0 bits (1678), Expect = 1.5e-183
Identity = 337/439 (76.77%), Postives = 381/439 (86.79%), Query Frame = 1

Query: 1   MGDTSREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAI 60
           MGD +RE L+ R    F R+ SHA DEL SFR+YLRWMCVDQSD+W+A LSW VF L  +
Sbjct: 1   MGD-NREPLVDRTR--FMRTISHAHDELQSFRTYLRWMCVDQSDVWTACLSWFVFILLGL 60

Query: 61  VVPATSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRL 120
           VVPA SHF LACS+CD+ HARP+D VVQLSLSSVA++SF+CL+ F+++YGLRRFLFFD+L
Sbjct: 61  VVPAMSHFLLACSTCDARHARPYDWVVQLSLSSVASLSFVCLTRFVKKYGLRRFLFFDKL 120

Query: 121 CDESETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAV 180
           CDESETVR+GYT + NRSL+++S FV+PCF AESAYK+WWYASGASQIPFLG V +SDA+
Sbjct: 121 CDESETVRKGYTGQLNRSLKIVSIFVLPCFVAESAYKVWWYASGASQIPFLGIVWLSDAM 180

Query: 181 ACSMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRH 240
           AC MEL SWLYRTTV FLVC+LFRLIC+LQ+LR+QDFA VFQVDSDVGSVLSEHLRIRRH
Sbjct: 181 ACMMELCSWLYRTTVFFLVCVLFRLICNLQVLRIQDFAQVFQVDSDVGSVLSEHLRIRRH 240

Query: 241 LRIISHRYRAFILWSLILVTGSQFTSLLMTTKSSS-LNIYIAGELALCSMTLLTSLMILL 300
           LRIISHRYRAFILW LILVTGSQFTSLL+TTK+++ LNIY AGELA+CS+TLLT L ILL
Sbjct: 241 LRIISHRYRAFILWCLILVTGSQFTSLLITTKANAELNIYKAGELAMCSITLLTGLSILL 300

Query: 301 RSATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEES 360
           RSATKITHKAQSVT LAAKWHVCATLDSFD  DGETP   A    G   FP     D ES
Sbjct: 301 RSATKITHKAQSVTCLAAKWHVCATLDSFDANDGETPRIPAI--HGCQSFPYVGT-DGES 360

Query: 361 EGEEGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTI 420
           +GE+  DEED++DN+KLIPAYAYSTIS+QKRQALVTYFENNRAGITIYGFTLDR+TLHTI
Sbjct: 361 DGEDAGDEEDDIDNSKLIPAYAYSTISYQKRQALVTYFENNRAGITIYGFTLDRSTLHTI 420

Query: 421 FGIELSLVLWLLGKTIGFS 439
           FGIELSLVLWLLGKTIG S
Sbjct: 421 FGIELSLVLWLLGKTIGIS 433

BLAST of CmoCh01G011620.1 vs. NCBI nr
Match: gi|1000942927|ref|XP_015582109.1| (PREDICTED: uncharacterized protein LOC8269016 [Ricinus communis])

HSP 1 Score: 637.9 bits (1644), Expect = 1.3e-179
Identity = 324/433 (74.83%), Postives = 373/433 (86.14%), Query Frame = 1

Query: 7   EALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAIVVPATS 66
           + L+ RK+  F RS SHA DEL SFRSYLRWMCVDQS+IW   LSW +F LFA+VVPA S
Sbjct: 10  DPLLGRKSSKFTRSVSHAYDELQSFRSYLRWMCVDQSNIWVTCLSWFMFILFAVVVPAIS 69

Query: 67  HFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRLCDESET 126
           HF LACS+CDS H RP+D VVQLSLSSVAT+SF+CLS FIR+YGLRRFLF D+LCDESET
Sbjct: 70  HFVLACSTCDSKHKRPYDSVVQLSLSSVATLSFVCLSKFIRKYGLRRFLFLDKLCDESET 129

Query: 127 VRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACSMEL 186
           VR+GYT++ N SL++LS FV+PCF AESAYKIWWYASGASQIPFLGNVI+SD VAC MEL
Sbjct: 130 VRKGYTDQLNWSLKLLSIFVLPCFVAESAYKIWWYASGASQIPFLGNVILSDTVACIMEL 189

Query: 187 LSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRHLRIISH 246
            SWLYRTT+IFLVC+LF LIC LQILRLQDFA VFQVDSDV SVLSEHLRI+RHLRIISH
Sbjct: 190 CSWLYRTTIIFLVCVLFHLICHLQILRLQDFAQVFQVDSDVESVLSEHLRIKRHLRIISH 249

Query: 247 RYRAFILWSLILVTGSQFTSLLMTTKSSS-LNIYIAGELALCSMTLLTSLMILLRSATKI 306
           RYRAF+LW+LIL TGSQF SLL+TTKS + +++Y AGELALCS+TL+T L+I+LRSATKI
Sbjct: 250 RYRAFVLWALILATGSQFASLLITTKSGAVVDVYRAGELALCSITLVTGLLIILRSATKI 309

Query: 307 THKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEESEGEEGC 366
           THKAQSVT+LAAKWH+CATLD+FD T+GETP     PT   +         ++ EG++  
Sbjct: 310 THKAQSVTSLAAKWHICATLDTFDSTEGETP---RTPTANGI--------TDDEEGDDAG 369

Query: 367 DEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELS 426
           DEEDELDN+KLIPAYAYSTISFQKRQALV YFENNRAGIT+YGFTLDR+TLH+IFG+EL+
Sbjct: 370 DEEDELDNSKLIPAYAYSTISFQKRQALVNYFENNRAGITVYGFTLDRSTLHSIFGVELA 429

Query: 427 LVLWLLGKTIGFS 439
           LVLWLLGKT+G S
Sbjct: 430 LVLWLLGKTVGIS 431

BLAST of CmoCh01G011620.1 vs. NCBI nr
Match: gi|297834964|ref|XP_002885364.1| (extracellular ligand-gated ion channel [Arabidopsis lyrata subsp. lyrata])

HSP 1 Score: 636.7 bits (1641), Expect = 2.9e-179
Identity = 331/438 (75.57%), Postives = 374/438 (85.39%), Query Frame = 1

Query: 2   GDTSREALISRKAKVFNRSASHAQDELHSFRSYLRWMCVDQSDIWSAGLSWSVFFLFAIV 61
           G  +RE LI+R+ K F RS SHAQDEL SFR YLRWMCVDQS  W+A LSWS+F +F +V
Sbjct: 17  GRGTRERLINRETK-FTRSVSHAQDELQSFRKYLRWMCVDQSSPWTAVLSWSMFVVFTLV 76

Query: 62  VPATSHFHLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDRLC 121
           VPATSHF LAC+ CDSHH+RP+D VVQLSLSS A +SFLCLS F+ +YGLRRFLFFD+L 
Sbjct: 77  VPATSHFMLACADCDSHHSRPYDSVVQLSLSSFAALSFLCLSRFVSKYGLRRFLFFDKLW 136

Query: 122 DESETVRRGYTNKFNRSLRVLSAFVVPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVA 181
           DESETVRRGYTN+ NRSL++LS FV PCF A S+YKIWWYASGASQIPFLGNVI+SD VA
Sbjct: 137 DESETVRRGYTNQLNRSLKILSYFVTPCFLAMSSYKIWWYASGASQIPFLGNVILSDTVA 196

Query: 182 CSMELLSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVDSDVGSVLSEHLRIRRHL 241
           C MEL SWLYRTTVIFLVC+LFRLIC LQILRLQDFA VFQ+DSDVGS+LSEHLRIRRHL
Sbjct: 197 CLMELCSWLYRTTVIFLVCVLFRLICHLQILRLQDFAQVFQMDSDVGSILSEHLRIRRHL 256

Query: 242 RIISHRYRAFILWSLILVTGSQFTSLLMTTKS-SSLNIYIAGELALCSMTLLTSLMILLR 301
           RIISHRYR FIL SLILVTGSQF SLL+TTK+ + LNIY AGELALCSMTL+T+L+ILLR
Sbjct: 257 RIISHRYRTFILLSLILVTGSQFYSLLITTKAYAELNIYRAGELALCSMTLVTALLILLR 316

Query: 302 SATKITHKAQSVTALAAKWHVCATLDSFDVTDGETPMAAAAPTDGNLMFPVTPRGDEESE 361
           SA+KITHKAQ+VT LAAKWHVCAT++SF+  DGETP        G+  +P T   + ES+
Sbjct: 317 SASKITHKAQAVTCLAAKWHVCATIESFETVDGETPRLVDR-ASGHGYYP-TDDDNGESD 376

Query: 362 GEEGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIF 421
            E+  DEED+ DN  LIPAYAYSTISFQKRQALV YFENNRAGIT++GFTLDR+TLHTIF
Sbjct: 377 SEDYGDEEDDFDNNNLIPAYAYSTISFQKRQALVNYFENNRAGITVFGFTLDRSTLHTIF 436

Query: 422 GIELSLVLWLLGKTIGFS 439
           GIE+SLVLWLLGKTIG S
Sbjct: 437 GIEMSLVLWLLGKTIGIS 451

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K352_CUCSA1.8e-21590.21Uncharacterized protein OS=Cucumis sativus GN=Csa_7G179620 PE=4 SV=1[more]
A0A061F5U3_THECC1.0e-18376.77F11F12.5 protein OS=Theobroma cacao GN=TCM_030801 PE=4 SV=1[more]
D7LAY2_ARALL2.0e-17975.57Extracellular ligand-gated ion channel OS=Arabidopsis lyrata subsp. lyrata GN=AR... [more]
B9GJ40_POPTR2.7e-17975.06Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s36620g PE=4 SV=1[more]
A0A0D2QY93_GOSRA6.0e-17974.03Uncharacterized protein OS=Gossypium raimondii GN=B456_009G443500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G20300.13.9e-18275.86 Protein of unknown function (DUF3537)[more]
AT1G50630.12.0e-17372.45 Protein of unknown function (DUF3537)[more]
AT4G22270.17.3e-12055.24 Protein of unknown function (DUF3537)[more]
AT4G03820.23.5e-10651.70 Protein of unknown function (DUF3537)[more]
AT1G67570.13.2e-5934.06 Protein of unknown function (DUF3537)[more]
Match NameE-valueIdentityDescription
gi|449466580|ref|XP_004151004.1|2.5e-21590.21PREDICTED: uncharacterized protein LOC101212672 [Cucumis sativus][more]
gi|659093777|ref|XP_008447712.1|8.2e-21489.52PREDICTED: uncharacterized protein LOC103490125 [Cucumis melo][more]
gi|590606354|ref|XP_007020713.1|1.5e-18376.77F11F12.5 protein [Theobroma cacao][more]
gi|1000942927|ref|XP_015582109.1|1.3e-17974.83PREDICTED: uncharacterized protein LOC8269016 [Ricinus communis][more]
gi|297834964|ref|XP_002885364.1|2.9e-17975.57extracellular ligand-gated ion channel [Arabidopsis lyrata subsp. lyrata][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR021924DUF3537
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042538 hyperosmotic salinity response
biological_process GO:0009737 response to abscisic acid
biological_process GO:0009409 response to cold
biological_process GO:0009414 response to water deprivation
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh01G011620CmoCh01G011620gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh01G011620.1CmoCh01G011620.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh01G011620.1.three_prime_UTR.1CmoCh01G011620.1.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh01G011620.1.CDS.4CmoCh01G011620.1.CDS.4CDS
CmoCh01G011620.1.CDS.3CmoCh01G011620.1.CDS.3CDS
CmoCh01G011620.1.CDS.2CmoCh01G011620.1.CDS.2CDS
CmoCh01G011620.1.CDS.1CmoCh01G011620.1.CDS.1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh01G011620.1.five_prime_UTR.1CmoCh01G011620.1.five_prime_UTR.1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh01G011620.1.exon.4CmoCh01G011620.1.exon.4exon
CmoCh01G011620.1.exon.3CmoCh01G011620.1.exon.3exon
CmoCh01G011620.1.exon.2CmoCh01G011620.1.exon.2exon
CmoCh01G011620.1.exon.1CmoCh01G011620.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021924Protein of unknown function DUF3537PFAMPF12056DUF3537coord: 26..422
score: 6.5E
NoneNo IPR availablePANTHERPTHR31963FAMILY NOT NAMEDcoord: 3..438
score: 6.0E
NoneNo IPR availablePANTHERPTHR31963:SF4F11F12.5 PROTEINcoord: 3..438
score: 6.0E