Cp4.1LG18g07060 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG18g07060
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function (DUF789)
LocationCp4.1LG18 : 6914663 .. 6918668 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATTATTATAATGTCATCTTCTTCTTTTAGAACACACTCTCTGGTTCAAGCTTCTTCCTCTCTCTTTCTCTCTTTCTTTCTGCGTTTTCCTTCTCGCGTGGTTTCTTAATTCACACAAAACGCATTTTGGTCATTCATTGCGTGCATTCTTCAAATCTTTGCTGCGAAGTTTTTGAATTGATTTTCAGTTCTCTTCTTCATTTAATCGATTTGGAAACCCTCTGTGGTATTGATTTCTCTCTGATCTGATATTGTTTTGCTAGCATCCTCCATTCAATTCAGTGTTCTTGAGAGATATTTCGAGTAATTTTTTGGAAGAAGAAAGCTGAGTTTAAAGAATCTGCATACTATTTCTTTTAATCGTCATTTCGTCGGCTTTATGCTACTTGGGAGCTTAAAGAAATCGATATTTCCTTTGATTCTGGTTGGTTGAGTTTTTTTTTTTTTCTTCTCAGTTTAGTGAACAAGGTAGAGAGCTTTGGGGTTACCGATTATAATCTGTGTAATTTCCTCCGTTTTCCGTTGATACTCAACGGAGGATTCTTCTTTATTGTCGTTCTTTTGGTTTAGATATAGAAATGTTGGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTGAGGGCACGGAAGAGTTATAATCAGCAAAAGCCATCGAGGAGACCTACCAAGACCGATGAAACTGAGACTCCATCGAGTAAAGTTGTGGCTTCTACTACAACGCCTTCTAAGCCACTAACTCCTCAGTCTAAGAGCAACTTAGAGAGATTCTTGGACGCCACAAAGCCTTCAGTTCCAGCGCAGTACTTCTCTAAGGTAAATGTTAGAATCGAAGAAAGTTCATCACATATAAGCTTTTGATTGATTGCTGATGAAATTTGAGTTAATAACCAAGTTAGGATCATAATCTTAGTTTGATTTCTTGTAGCCCTTTAATGGTTTTCATTGCTATGAATTTGGGCGATATGAACTTGTAATCTTGTATTCTTTCCAGACAACTATGAGGGGTTGGAGGACTTGTGATATTGAATTTCAACCTTACTTCGTTCTGAATGATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGCGCTGGAGTTCCTTTAGTACTCAATGGAGGTGACTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTATTCAAATATATGGCGAATCTGCTGCATTGAGATCAGATTCTAAGTCCAGGTACTTGATCTGTGTTTGGAATATGTTAATTCATATTAGTACTTGTTTCTAATCTTTTATATGTGATCATGAAGTTTTAGAACATTGATTTTACTCTGAAGGTGAAAATTTCCTTCTAATTAATGTTTCTTTATAATGGTAGGCTGGCTAATGAGGACAGTGATCTTGACTCTTCTAGAGATACAAGCAGCGAGGGAAGCATTGACTATGAATTTGGTAAAAGCTGTAACTTATCCAGAGAACAGTGGGTTCATCACCATTTAGCTTGTGAAAGCGCTATTACAATGAGAAAGACGTCTTTAAGAGATGAACATAGCACAAGACAAGAAGGTTTTTCGAGTGACGATGGGGATGCAGAATATCCTCGGAGTGGTTTGCTCTTTCAGTTTCTGGAGCAAGATCTTCCTTATCAACGTGTACCGTTGGCTGATAAAGTTAGTGGTTTCTTATTAGTTCTTTTAGTAGTTATATTCTTACACGTTTTGCGATTTCGAGAACCGTATTCGTTTACTCGTACGGTTTATGTTATGACAGATATTTGATCTTGCTTACCAATTTCCTGGTTTGAAAACTTTAAGAAGTTGTGATATCCTGCCAGCCAGTTGGATCTCTGTAGCATGGTAAATATTTGCAGATTCAGTGTAATTTCTAGTTTTGTAGAACATGATATTCATTCATCCACTGGAATTGTAGGTACCCAATATACCGTATACCCACTGGTCCCACATTGAAGGATTTGGATGCTTGCTTCCTAACATATCATTCCCTTTCCACACCCATAAGAGGTACTATGGCATTCTTAGTTTTCTTCTGGTTTGTGCCTGTAGTCTTTGGTAATCTTCAAGATTTACGTTCTCTAGGCCCCATTTACTCGTTTTGTCAACTGAATAAACCCGTTAATTTCGTTTTGTCACGAGCAATCGTAGCCAGAGTTCTTCCGACATTTCAGTCTCATACAAACATTGTTATTTACATGTCATTATCTGACAGATTAGTAGAGATAGTATTGCTCGTGTATCTTTNGGGGGGGGGGGGGGGGATCAATGAAGTTTTATTTCTTCTCAAGTTCGAACTGACCGATAAATTCATTGGCGTTTACATATATGGAGATTTATAGTGTTAAATCATGGATTGTGTACGATATCTCCATTAAGGCATTTGTGTTCATCCTTCATCTGTGTCCGCAGGTAAAAGTTTATTCAAGAATTCAAGTAATTTTTCCTATTTTGTGTTCATTTTGGAGCTTATTCATTGCATTTGTGCTCGTCTCTAGCCACTTATTTCTACTATGAGTCTTGACTCAACATTCTTAATTCGGACATACATTTGTTTCAAAATGTAAGAAAATACTATCTATTTTCCTTCTTCATCCAGAGCACTTACGTTCTTGAGCCTGTCTTTGTCACTAGCCACCATTGTCTACAACAATCTACTTTTGTCAATGTTGTTCAAAGATCGTTGGTTTGGCTTGTTTTCTGCAAGTGTTGATCGTTTTCTTTGCTTCACAGGTAATGGACATGGTCAGGCACCAGCAATGATATATCCAAATGACAACGATGGTATCCCAAAGGTCTCCTTGCCTGTCTTTGGATTGGCTTCTTATAAGCTGAAAGGCTCAATATGGGCGCAAAATTGTGTCAAAGAGCATCAAATGGCAAATTCCCTCATGCAGGCGGCAGAGAAGTGGCTGAGGCGCCTTCAGGTCAATCAACCTGATTTTCAGTTCTTTGCATCGAATATGACATACTGGAGATGATAAGAAATGACTGAAAGCATGTCTCTACTTCCATTCTGCCCACTCGAACTACCAAAATAACAAACTCGTGCTCTTATCGACTATTCGACCTGCACTCAATACCGGTAAGGATACATTTTGGTTTTGGGTGGTTTGTTCTTTTGGGGAGTACACTGCATGGAGGCAGTTAAGAAAAATATTCACACCAACTGGCCAAAAATCATGTGGAAAATGGGGGAGGGAGTAATAAAACTATGACATGGGAGGGCAAAAGCAATATGGAAAGTAATGGCAATCAAGATGAATGTTGCCTATAGCATTATACACTGCCCTTGGAAACTTGTGAGAGTCATCGTTTGTCGTCGTTGCATCGATATGGTCTTTAGTATGGTCTTTTGTGAGCTTGAAGGAAACTTGTGAGAGTCGTCGTCGTTTGTCATTGTCGTTGCATCGATACGGTCTTCAGTGTGGTCTTTTGTGAACTTGAAGGAGTACAAATTTGTATGTATATTACGTAAGATTTGATGTAAGGTATCTGTTTTAGATTCACACAAAGTATAGATTAAAGATTATAGTTCTAGCATTTTATGTTTCCGAATATGGGCCATTCTGGATTGAATAAGGAACCATATTCTTCCCCATCCAACAATGGAGAATCAGTGTTCTTTTCCATTGGCAAACCAGATGCTTATTCTGAAAGGTTACTTTGTTTTTGACATTCTCACTAGTTTAGACCCAACAGTTTCTTGCCCAACATGTATAGTCATATGGACACAGAAGAAAGCTGAAGTGATATCGTTGTTGAGCACGTGGGGAAAGGAGTTCTAGGGTCGTGGTGACTCGCTCACTACAAGAGAAGTAAGACTCGGATTACTTTGTTGCTCAAATATCTCAAGACAAGAATATTTGTTTGAGATTTGAATCACTGTACAAACAATCACTGTACTTCATTTTTCATGGCTTTATATATCCTCAAAATTAAAATCCTTAACTTTCCACGAGACATTTCTCAATTATGACATATTAAGTTAGGTTGCTA

mRNA sequence

TATTATTATAATGTCATCTTCTTCTTTTAGAACACACTCTCTGGTTCAAGCTTCTTCCTCTCTCTTTCTCTCTTTCTTTCTGCGTTTTCCTTCTCGCGTGGTTTCTTAATTCACACAAAACGCATTTTGGTCATTCATTGCGTGCATTCTTCAAATCTTTGCTGCGAAGTTTTTGAATTGATTTTCAGTTCTCTTCTTCATTTAATCGATTTGGAAACCCTCTGTGGTATTGATTTCTCTCTGATCTGATATTGTTTTGCTAGCATCCTCCATTCAATTCAGTGTTCTTGAGAGATATTTCGAGTAATTTTTTGGAAGAAGAAAGCTGAGTTTAAAGAATCTGCATACTATTTCTTTTAATCGTCATTTCGTCGGCTTTATGCTACTTGGGAGCTTAAAGAAATCGATATTTCCTTTGATTCTGGTTGGTTGAGTTTTTTTTTTTTTCTTCTCAGTTTAGTGAACAAGGTAGAGAGCTTTGGGGTTACCGATTATAATCTGTGTAATTTCCTCCGTTTTCCGTTGATACTCAACGGAGGATTCTTCTTTATTGTCGTTCTTTTGGTTTAGATATAGAAATGTTGGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTGAGGGCACGGAAGAGTTATAATCAGCAAAAGCCATCGAGGAGACCTACCAAGACCGATGAAACTGAGACTCCATCGAGTAAAGTTGTGGCTTCTACTACAACGCCTTCTAAGCCACTAACTCCTCAGTCTAAGAGCAACTTAGAGAGATTCTTGGACGCCACAAAGCCTTCAGTTCCAGCGCAGTACTTCTCTAAGACAACTATGAGGGGTTGGAGGACTTGTGATATTGAATTTCAACCTTACTTCGTTCTGAATGATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGCGCTGGAGTTCCTTTAGTACTCAATGGAGGTGACTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTATTCAAATATATGGCGAATCTGCTGCATTGAGATCAGATTCTAAGTCCAGGCTGGCTAATGAGGACAGTGATCTTGACTCTTCTAGAGATACAAGCAGCGAGGGAAGCATTGACTATGAATTTGGTAAAAGCTGTAACTTATCCAGAGAACAGTGGGTTCATCACCATTTAGCTTGTGAAAGCGCTATTACAATGAGAAAGACGTCTTTAAGAGATGAACATAGCACAAGACAAGAAGGTTTTTCGAGTGACGATGGGGATGCAGAATATCCTCGGAGTGGTTTGCTCTTTCAGTTTCTGGAGCAAGATCTTCCTTATCAACGTGTACCGTTGGCTGATAAAATATTTGATCTTGCTTACCAATTTCCTGGTTTGAAAACTTTAAGAAGTTGTGATATCCTGCCAGCCAGTTGGATCTCTGTAGCATGGTACCCAATATACCGTATACCCACTGGTCCCACATTGAAGGATTTGGATGCTTGCTTCCTAACATATCATTCCCTTTCCACACCCATAAGAGGTAATGGACATGGTCAGGCACCAGCAATGATATATCCAAATGACAACGATGGTATCCCAAAGGTCTCCTTGCCTGTCTTTGGATTGGCTTCTTATAAGCTGAAAGGCTCAATATGGGCGCAAAATTGTGTCAAAGAGCATCAAATGGCAAATTCCCTCATGCAGGCGGCAGAGAAGTGGCTGAGGCGCCTTCAGGTCAATCAACCTGATTTTCAGTTCTTTGCATCGAATATGACATACTGGAGATGATAAGAAATGACTGAAAGCATGTCTCTACTTCCATTCTGCCCACTCGAACTACCAAAATAACAAACTCGTGCTCTTATCGACTATTCGACCTGCACTCAATACCGGTAAGGATACATTTTGGTTTTGGGTGGTTTGTTCTTTTGGGGAGTACACTGCATGGAGGCAGTTAAGAAAAATATTCACACCAACTGGCCAAAAATCATGTGGAAAATGGGGGAGGGAGTAATAAAACTATGACATGGGAGGGCAAAAGCAATATGGAAAGTAATGGCAATCAAGATGAATGTTGCCTATAGCATTATACACTGCCCTTGGAAACTTGTGAGAGTCATCGTTTGTCGTCGTTGCATCGATATGGTCTTTAGTATGGTCTTTTGTGAGCTTGAAGGAAACTTGTGAGAGTCGTCGTCGTTTGTCATTGTCGTTGCATCGATACGGTCTTCAGTGTGGTCTTTTGTGAACTTGAAGGAGTACAAATTTGTATGTATATTACGTAAGATTTGATGTAAGGTATCTGTTTTAGATTCACACAAAGTATAGATTAAAGATTATAGTTCTAGCATTTTATGTTTCCGAATATGGGCCATTCTGGATTGAATAAGGAACCATATTCTTCCCCATCCAACAATGGAGAATCAGTGTTCTTTTCCATTGGCAAACCAGATGCTTATTCTGAAAGGTTACTTTGTTTTTGACATTCTCACTAGTTTAGACCCAACAGTTTCTTGCCCAACATGTATAGTCATATGGACACAGAAGAAAGCTGAAGTGATATCGTTGTTGAGCACGTGGGGAAAGGAGTTCTAGGGTCGTGGTGACTCGCTCACTACAAGAGAAGTAAGACTCGGATTACTTTGTTGCTCAAATATCTCAAGACAAGAATATTTGTTTGAGATTTGAATCACTGTACAAACAATCACTGTACTTCATTTTTCATGGCTTTATATATCCTCAAAATTAAAATCCTTAACTTTCCACGAGACATTTCTCAATTATGACATATTAAGTTAGGTTGCTA

Coding sequence (CDS)

ATGTTGGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTGAGGGCACGGAAGAGTTATAATCAGCAAAAGCCATCGAGGAGACCTACCAAGACCGATGAAACTGAGACTCCATCGAGTAAAGTTGTGGCTTCTACTACAACGCCTTCTAAGCCACTAACTCCTCAGTCTAAGAGCAACTTAGAGAGATTCTTGGACGCCACAAAGCCTTCAGTTCCAGCGCAGTACTTCTCTAAGACAACTATGAGGGGTTGGAGGACTTGTGATATTGAATTTCAACCTTACTTCGTTCTGAATGATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGCGCTGGAGTTCCTTTAGTACTCAATGGAGGTGACTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTATTCAAATATATGGCGAATCTGCTGCATTGAGATCAGATTCTAAGTCCAGGCTGGCTAATGAGGACAGTGATCTTGACTCTTCTAGAGATACAAGCAGCGAGGGAAGCATTGACTATGAATTTGGTAAAAGCTGTAACTTATCCAGAGAACAGTGGGTTCATCACCATTTAGCTTGTGAAAGCGCTATTACAATGAGAAAGACGTCTTTAAGAGATGAACATAGCACAAGACAAGAAGGTTTTTCGAGTGACGATGGGGATGCAGAATATCCTCGGAGTGGTTTGCTCTTTCAGTTTCTGGAGCAAGATCTTCCTTATCAACGTGTACCGTTGGCTGATAAAATATTTGATCTTGCTTACCAATTTCCTGGTTTGAAAACTTTAAGAAGTTGTGATATCCTGCCAGCCAGTTGGATCTCTGTAGCATGGTACCCAATATACCGTATACCCACTGGTCCCACATTGAAGGATTTGGATGCTTGCTTCCTAACATATCATTCCCTTTCCACACCCATAAGAGGTAATGGACATGGTCAGGCACCAGCAATGATATATCCAAATGACAACGATGGTATCCCAAAGGTCTCCTTGCCTGTCTTTGGATTGGCTTCTTATAAGCTGAAAGGCTCAATATGGGCGCAAAATTGTGTCAAAGAGCATCAAATGGCAAATTCCCTCATGCAGGCGGCAGAGAAGTGGCTGAGGCGCCTTCAGGTCAATCAACCTGATTTTCAGTTCTTTGCATCGAATATGACATACTGGAGATGA

Protein sequence

MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSKVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASNMTYWR
BLAST of Cp4.1LG18g07060 vs. TrEMBL
Match: A0A0A0LS49_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G024240 PE=4 SV=1)

HSP 1 Score: 673.7 bits (1737), Expect = 1.4e-190
Identity = 327/397 (82.37%), Postives = 354/397 (89.17%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSKVVASTTTPSKPL 60
           MLGT LQFGGIKGEDRFY+PVRARK+YNQQKPSR PTKTDETE+ SSKVV  TT P + L
Sbjct: 1   MLGTTLQFGGIKGEDRFYVPVRARKNYNQQKPSRNPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGE+AALRSDS  RLA EDSDLDSSRDTSS+GSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSHVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240
           ++ GKS N SREQW H HLACE+ + MRKTSL DEH   QEGF SDDGDA YPRS LLFQ
Sbjct: 181 HDLGKSFNFSREQWDHPHLACENMLKMRKTSLTDEHKMVQEGFLSDDGDAGYPRSSLLFQ 240

Query: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300
           FLEQDLPYQRVPLADKIF+LAYQFPGLKTL SCDILPASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLSSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360
           DACFLTYHSLSTP +GN H   P M+YP D D I K+SLPVFG+ASYK+KGSIW QN + 
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPIMVYPKDIDDITKISLPVFGMASYKVKGSIWGQNGIS 360

Query: 361 EHQMANSLMQAAEKWLRRLQVNQPDFQFFASNMTYWR 398
           +HQ ANSLMQAA+KWLR LQV+QPDFQFF+S+ TYWR
Sbjct: 361 DHQKANSLMQAADKWLRSLQVSQPDFQFFSSHGTYWR 397

BLAST of Cp4.1LG18g07060 vs. TrEMBL
Match: A0A0D2S4J5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G005300 PE=4 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 1.9e-139
Identity = 259/422 (61.37%), Postives = 302/422 (71.56%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQ---QKPSRRPTKTDETETPSS---------- 60
           MLG  LQFG ++GEDRFYIPV+AR++ NQ   QKP +   K D  ++ S           
Sbjct: 1   MLGAGLQFGKVRGEDRFYIPVKARRNQNQKQQQKPKQEAVKEDNEKSNSKSSASLTKSKD 60

Query: 61  --------------KVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWR 120
                         K +AS+T PS   +  S+SNLERFL++T PSVPAQYFSKTT+RGWR
Sbjct: 61  LASGNNNNKNINPKKTLASSTIPSSEESRVSRSNLERFLESTTPSVPAQYFSKTTVRGWR 120

Query: 121 TCDIEFQPYFVLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALR 180
           TCD+EFQPYFVL DLWESFKEWSAYGAGVPLVL+G D VVQYYVPYLSGIQ+YGESA   
Sbjct: 121 TCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLDGNDGVVQYYVPYLSGIQLYGESA--- 180

Query: 181 SDSKSRLANEDSDLDSSRDTSSEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLR 240
              K RLA E+S+ D  RD+SS+GS DYE GK    SREQ+    L  E    +R  S+ 
Sbjct: 181 ---KQRLAGEESENDYYRDSSSDGSSDYEIGKGIKFSREQFSRFSLTNEIPFRVRSLSIS 240

Query: 241 DEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSC 300
           DE+S  QEGFSSDD +    R  LLF+F E   PY R  LADKIFDL+ ++PGL TLRSC
Sbjct: 241 DENSMLQEGFSSDDCETRNSRDHLLFEFFEHKTPYSRESLADKIFDLSCKYPGLNTLRSC 300

Query: 301 DILPASWISVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDG 360
           D+LP SW+SVAWYPIYRIPTG TLKDLDACFLTYHSL TP+ GNG+GQ P ++YP+D +G
Sbjct: 301 DLLPISWMSVAWYPIYRIPTGSTLKDLDACFLTYHSLCTPMEGNGNGQTPFLVYPDDANG 360

Query: 361 IPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASNM 396
           IPK+SLPVFG+  YKLKGSIW QN V E Q ANSLMQA E WL+ LQV  PDFQFFAS+ 
Sbjct: 361 IPKISLPVFGMGCYKLKGSIWTQNGVSECQHANSLMQATENWLKLLQVYHPDFQFFASHG 416

BLAST of Cp4.1LG18g07060 vs. TrEMBL
Match: W9SER1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_013258 PE=4 SV=1)

HSP 1 Score: 499.2 bits (1284), Expect = 4.6e-138
Identity = 259/410 (63.17%), Postives = 312/410 (76.10%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQ----QKPSRR---PTKTDETETPSSKVVAS- 60
           MLGT LQFG ++GEDRFYIPV+ARK+ NQ    QK  RR     K +E+   S+K +AS 
Sbjct: 1   MLGTGLQFGTVRGEDRFYIPVKARKNNNQNNNQQKQIRRLKSDNKNNESPDASTKSMASD 60

Query: 61  -----TTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLND 120
                +  PS   +    SNLERFL++T P VPAQYFSKTTMRGWRTCD+EFQ YF LND
Sbjct: 61  CRNPISEEPSSQPSITPSSNLERFLESTTPFVPAQYFSKTTMRGWRTCDVEFQHYFPLND 120

Query: 121 LWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDL 180
           LWESFKEWSAYGAGVPLVL+G DSV+QYYVPYLSGIQ+YGES+  RS++KSR  +EDSD 
Sbjct: 121 LWESFKEWSAYGAGVPLVLDGSDSVIQYYVPYLSGIQLYGESSG-RSNTKSRQTSEDSDG 180

Query: 181 DSSRDTSSEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDD 240
           D  +D+SS+GS DYE  K    S EQ   H L  +++  M + SL DE S  QEGFSSDD
Sbjct: 181 DYYKDSSSDGSSDYEIVKGMKFSGEQRNLHQLTNQTSFRMGRLSLHDEQSASQEGFSSDD 240

Query: 241 GDAEYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYP 300
           G+A+  +  LL+++LE+D PY R PLADK  +LA ++PGLKTLRSCD+LPASW+SVAWYP
Sbjct: 241 GEAQNSQGVLLYEYLERDPPYSREPLADKA-NLASRYPGLKTLRSCDLLPASWLSVAWYP 300

Query: 301 IYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASY 360
           IYRIPTGPTLKDLDACFLTYHSLSTP+ G+ + QAP +++P + DG+PK SL VFG+ASY
Sbjct: 301 IYRIPTGPTLKDLDACFLTYHSLSTPMTGSENTQAPIVVFPREIDGVPKFSLTVFGMASY 360

Query: 361 KLKGSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASNMTYWR 398
           K KGS+W QN + E  +ANSLMQAA+ WLRRLQV  PDFQFFAS+  Y R
Sbjct: 361 KFKGSMWIQNGITECHLANSLMQAADNWLRRLQVTHPDFQFFASHGIYCR 408

BLAST of Cp4.1LG18g07060 vs. TrEMBL
Match: A0A067E421_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015519mg PE=4 SV=1)

HSP 1 Score: 495.7 bits (1275), Expect = 5.1e-137
Identity = 253/407 (62.16%), Postives = 299/407 (73.46%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRP----------TKTDETETPSSKVV 60
           ML T LQFG ++GEDRFY+PV+ARK+ NQ++  ++P           K DE E+  S   
Sbjct: 1   MLPTNLQFGRVRGEDRFYVPVKARKNQNQKQQQQQPQQQKQKQAEAAKIDENESIISPES 60

Query: 61  ASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWE 120
            +    +   +  S SNL+RFL +T PSV AQYFSK TMRGWRTCD+EFQPYF+L DLWE
Sbjct: 61  EALKKAASAESVVSVSNLDRFLKSTTPSVLAQYFSKKTMRGWRTCDVEFQPYFMLGDLWE 120

Query: 121 SFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSS 180
           SFKEWS YGAGVPLVL+G D VVQYYVPYLSGIQ+YGES    S +KSR + EDSD +  
Sbjct: 121 SFKEWSVYGAGVPLVLDGSDCVVQYYVPYLSGIQLYGEST--ESAAKSRQSAEDSDGEYY 180

Query: 181 RDTSSEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDA 240
           RD+SS+GS DYE  K    SR +   +HL  +    +   S+ DE+ST QEGFSSDD + 
Sbjct: 181 RDSSSDGSSDYEVEKGSKFSRVRQGRYHLTNDDPCRIGSLSISDENSTMQEGFSSDDSEV 240

Query: 241 EYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYR 300
              R  LLFQ+LEQD PY R PLADKI DLA ++PGL TLRSCD+LP SW+SVAWYPIYR
Sbjct: 241 GSSRGHLLFQYLEQDTPYSREPLADKIADLAGRYPGLTTLRSCDLLPISWMSVAWYPIYR 300

Query: 301 IPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLK 360
           IPTGPTLKDLDACFLTYHSLSTP++G G GQAP ++YP++ DG+PKVSLP FG+ASYK K
Sbjct: 301 IPTGPTLKDLDACFLTYHSLSTPLKGIGSGQAPIVVYPSEIDGVPKVSLPTFGMASYKFK 360

Query: 361 GSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASNMTYWR 398
           G +W QN V EHQ ANSL QAAE WLR L VN PDFQFFAS+  Y R
Sbjct: 361 GPMWIQNGVSEHQQANSLAQAAENWLRLLHVNHPDFQFFASHGMYHR 405

BLAST of Cp4.1LG18g07060 vs. TrEMBL
Match: A0A067E3M8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015519mg PE=4 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 1.9e-136
Identity = 252/407 (61.92%), Postives = 298/407 (73.22%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRP----------TKTDETETPSSKVV 60
           ML T LQFG ++GEDRFY+PV+ARK+ NQ++  ++P           K DE E+  S   
Sbjct: 1   MLPTNLQFGRVRGEDRFYVPVKARKNQNQKQQQQQPQQQKQKQAEAAKIDENESIISPES 60

Query: 61  ASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWE 120
            +    +   +  S SNL+RFL +T PSV AQYFSK TMRGWRTCD+EFQPYF+L DLWE
Sbjct: 61  EALKKAASAESVVSVSNLDRFLKSTTPSVLAQYFSKKTMRGWRTCDVEFQPYFMLGDLWE 120

Query: 121 SFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSS 180
           SFKEWS YGAGVPLVL+G D VVQYYVPYLSGIQ+YGES    S +KSR + EDSD +  
Sbjct: 121 SFKEWSVYGAGVPLVLDGSDCVVQYYVPYLSGIQLYGEST--ESAAKSRQSAEDSDGEYY 180

Query: 181 RDTSSEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDA 240
           RD+SS+GS DYE  K    SR +   +HL  +    +   S+ DE+ST QEGFSSDD + 
Sbjct: 181 RDSSSDGSSDYEVEKGSKFSRVRQGRYHLTNDDPCRIGSLSISDENSTMQEGFSSDDSEV 240

Query: 241 EYPRSGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYR 300
              R  LLFQ+LEQD PY R PLADK  DLA ++PGL TLRSCD+LP SW+SVAWYPIYR
Sbjct: 241 GSSRGHLLFQYLEQDTPYSREPLADKATDLAGRYPGLTTLRSCDLLPISWMSVAWYPIYR 300

Query: 301 IPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLK 360
           IPTGPTLKDLDACFLTYHSLSTP++G G GQAP ++YP++ DG+PKVSLP FG+ASYK K
Sbjct: 301 IPTGPTLKDLDACFLTYHSLSTPLKGIGSGQAPIVVYPSEIDGVPKVSLPTFGMASYKFK 360

Query: 361 GSIWAQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASNMTYWR 398
           G +W QN V EHQ ANSL QAAE WLR L VN PDFQFFAS+  Y R
Sbjct: 361 GPMWIQNGVSEHQQANSLAQAAENWLRLLHVNHPDFQFFASHGMYHR 405

BLAST of Cp4.1LG18g07060 vs. TAIR10
Match: AT1G15030.1 (AT1G15030.1 Protein of unknown function (DUF789))

HSP 1 Score: 335.1 bits (858), Expect = 5.8e-92
Identity = 185/328 (56.40%), Postives = 221/328 (67.38%), Query Frame = 1

Query: 64  SKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQ-PYFVLNDLWESFKEWSAYGAGV 123
           S SN+ERFLD+  PSVPA Y SKT +R     D+E Q PYF+L D+WESF EWSAYG GV
Sbjct: 45  SSSNVERFLDSVTPSVPAHYLSKTIVRERGGSDVESQVPYFLLGDVWESFAEWSAYGIGV 104

Query: 124 PLVLNGG-DSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSIDY 183
           PL LN   D V QYYVP LSGIQ+Y +  AL S  ++R   E+S+ D  RD+SSEGS   
Sbjct: 105 PLTLNNNKDRVFQYYVPSLSGIQVYADVDALTSSLQARRQGEESESDF-RDSSSEGSSS- 164

Query: 184 EFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQF 243
           E  +    S+EQ          +  M K SLR EH   QE  SSDDG+    +  L+F++
Sbjct: 165 ESERGLCYSKEQ---------ISARMDKLSLRKEH---QEDSSSDDGEPLSSQGRLIFEY 224

Query: 244 LEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDLD 303
           LE+DLPY R P ADK+ DLA +FP LKTLRSCD+LP+SW SVAWYPIY+IPTGPTLKDLD
Sbjct: 225 LERDLPYVREPFADKMSDLASRFPELKTLRSCDLLPSSWFSVAWYPIYKIPTGPTLKDLD 284

Query: 304 ACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKE 363
           ACFLTYHSL TP +G G     +M      + + K+ LPVFGLASYKL+GS+W       
Sbjct: 285 ACFLTYHSLHTPFQGPG-VTTGSMHVVQPRESVEKMELPVFGLASYKLRGSVWTSFGGSG 344

Query: 364 HQMANSLMQAAEKWLRRLQVNQPDFQFF 390
           HQ+ANSL QAA+ WLR  QVN PDF FF
Sbjct: 345 HQLANSLFQAADNWLRLRQVNHPDFIFF 357

BLAST of Cp4.1LG18g07060 vs. TAIR10
Match: AT2G01260.1 (AT2G01260.1 Protein of unknown function (DUF789))

HSP 1 Score: 335.1 bits (858), Expect = 5.8e-92
Identity = 200/395 (50.63%), Postives = 247/395 (62.53%), Query Frame = 1

Query: 1   MLGTALQFG-GIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSKVVASTTTPSKP 60
           MLG   Q   G  G+D FY   + R++ NQ+    R  ++D +  PSS    + +   + 
Sbjct: 1   MLGAGFQLTRGRHGDDPFYTSAKTRRA-NQRIDQLRRAQSDVSNVPSS----APSPHKQQ 60

Query: 61  LTPQ--SKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDI--EFQPYFVLNDLWESFKEW 120
           L P   S SNL+RFL++  PSVPAQ+ SKT +R  R  D   +  PYFVL D+W+SF EW
Sbjct: 61  LEPSDLSSSNLDRFLESVTPSVPAQFLSKTLLRERRADDDYNKLVPYFVLGDIWDSFAEW 120

Query: 121 SAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTS 180
           SAYG GVPLVLN   D V+QYYVP LS IQIY  S AL S  KSR   + SD D  RD+S
Sbjct: 121 SAYGTGVPLVLNNNKDRVIQYYVPSLSAIQIYAHSHALDSSLKSRRPGDSSDSDF-RDSS 180

Query: 181 SEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPR 240
           S+ S D         S  + V   + C         SLRD+H   QE  SSDDG+    +
Sbjct: 181 SDVSSD---------SDSERVSARVDC--------ISLRDQH---QEDSSSDDGEPLGSQ 240

Query: 241 SGLLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTG 300
             L+F++LE+DLPY R P ADK+ DLA QFP L TLRSCD+L +SW SVAWYPIYRIPTG
Sbjct: 241 GRLMFEYLERDLPYIREPFADKVLDLAAQFPELMTLRSCDLLRSSWFSVAWYPIYRIPTG 300

Query: 301 PTLKDLDACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIW 360
           PTLKDLDACFLTYHSL T   G G  Q+ ++  P +++   K+SLPVFGLASYK +GS+W
Sbjct: 301 PTLKDLDACFLTYHSLHTSFGGEGSEQSMSLTQPRESE---KMSLPVFGLASYKFRGSLW 360

Query: 361 AQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFF 390
                 EHQ+ NSL QAA+KWL    V+ PDF FF
Sbjct: 361 TPIGGSEHQLVNSLFQAADKWLHSCHVSHPDFLFF 366

BLAST of Cp4.1LG18g07060 vs. TAIR10
Match: AT4G16100.1 (AT4G16100.1 Protein of unknown function (DUF789))

HSP 1 Score: 300.8 bits (769), Expect = 1.2e-81
Identity = 185/417 (44.36%), Postives = 250/417 (59.95%), Query Frame = 1

Query: 11  IKGEDRFYIPVRARKSYNQQKPSR------------------RPTKTDETE-------TP 70
           I+GE+RFY P   RK   +++  R                  R  K +E E       + 
Sbjct: 7   IRGENRFYNPPPMRKLQQEREKKRLEAEEIEKEKKKAKEILDRKIKVEEKEIKQPEECST 66

Query: 71  SSKVVASTTTPSKPLTPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVL 130
           S   V S  + +   T  + SNL RFLD T P V  Q+   T+ +GWRT + E++PYF+L
Sbjct: 67  SDCSVPSRVSSTTTTTGTTSSNLGRFLDCTTPIVSTQHLPLTSSKGWRTREPEYRPYFLL 126

Query: 131 NDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDS 190
           NDLW+SF+EWSAYG GVPL+LNG DSVVQYYVPYLSGIQ+Y + +  R+ +  R   E+S
Sbjct: 127 NDLWDSFEEWSAYGVGVPLLLNGIDSVVQYYVPYLSGIQLYEDPS--RACTTRRRVGEES 186

Query: 191 DLDSSRDTSSEGSIDYEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSS 250
           D DS RD SS+GS D      C              E +  + + SL ++      G SS
Sbjct: 187 DGDSPRDMSSDGSND------CR-------------ELSQNLYRASLEEKPCI---GSSS 246

Query: 251 DDGDAEYPRSG-LLFQFLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVA 310
           D+ +A     G L+F++LE  +P+ R PL DKI +L+ QFP L+T RSCD+ P+SW+SVA
Sbjct: 247 DESEASSNSPGELVFEYLEGAMPFGREPLTDKISNLSSQFPALRTYRSCDLSPSSWVSVA 306

Query: 311 WYPIYRIPTGPTLKDLDACFLTYHSLSTPIRG--NGHGQAPAMIYPNDNDGIPKVSLPVF 370
           WYPIYRIP G +L++LDACFLT+HSLSTP RG  N  GQ+ +    +      K+ LP F
Sbjct: 307 WYPIYRIPLGQSLQNLDACFLTFHSLSTPCRGTSNEEGQSSSKSVAS-----AKLPLPTF 366

Query: 371 GLASYKLKGSIWA-QNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFASNM-TYWR 398
           GLASYK K S W+ ++ V E+Q   +L++ AE+WLRRL+V  PDF+ F S+  + WR
Sbjct: 367 GLASYKFKLSEWSPESDVDENQRVGTLLRTAEEWLRRLKVILPDFRHFISHSGSAWR 394

BLAST of Cp4.1LG18g07060 vs. TAIR10
Match: AT5G49220.1 (AT5G49220.1 Protein of unknown function (DUF789))

HSP 1 Score: 271.2 bits (692), Expect = 1.0e-72
Identity = 176/441 (39.91%), Postives = 234/441 (53.06%), Query Frame = 1

Query: 3   GTALQFGGIKGEDRFYIPVRARKSYN----QQKPSRRPTKTDETET------PSSKVVAS 62
           G ++    I+GE+RFY P   R+       QQ+   +  + DE E         +  VA 
Sbjct: 6   GVSIARTAIRGENRFYNPPPMRRMQQEAQLQQQIREKQRRDDEDEVLMDKERRKAATVAP 65

Query: 63  TTTPSKPLTPQSKS----------------------------NLERFLDATKPSVPAQYF 122
            TT       +SKS                            NL+RFL+ T P VPA+ F
Sbjct: 66  RTTRKGLGVSESKSRVVVSGSEVCAGSSDSSSGSGRVLSDGSNLDRFLEHTTPVVPARLF 125

Query: 123 SKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGAGVPLVLN-----GGDSVVQYYVPY 182
              +    +T + +   YFVL DLWESF EWSAYGAGVPL ++     G DS VQYYVPY
Sbjct: 126 PMRSRWELKTRESDCHTYFVLEDLWESFAEWSAYGAGVPLEMHPLEMHGNDSTVQYYVPY 185

Query: 183 LSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGS---IDYEFGKSCNLSREQWVH 242
           LSGIQ+Y        D   +  N   D + S + SS      +D   G+           
Sbjct: 186 LSGIQLY-------VDPLKKPRNPVGDNEGSSEGSSNSRTLPVDLSVGE----------- 245

Query: 243 HHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQFLEQDLPYQRVPLADK 302
                     + + SL+D+  T     SS + +   P+  LLF++LE + P+ R PLA+K
Sbjct: 246 ----------LNRISLKDQSITGS--LSSGEAEISNPQGRLLFEYLEYEPPFGREPLANK 305

Query: 303 IFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRG 362
           I DLA + P L T RSCD+LP+SW+SV+WYPIYRIP GPTL++LDACFLT+HSLST    
Sbjct: 306 ISDLASRVPELMTYRSCDLLPSSWVSVSWYPIYRIPVGPTLQNLDACFLTFHSLSTAPPQ 365

Query: 363 NGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVKEHQMANSLMQAAEKWL 398
           +  G        +D+    K+ LP FGLASYKLK S+W QN ++E Q   SL+QAA+KWL
Sbjct: 366 SAMG-------CSDSQPSTKLPLPTFGLASYKLKVSVWNQNRIQESQKMTSLLQAADKWL 409

BLAST of Cp4.1LG18g07060 vs. TAIR10
Match: AT4G03420.1 (AT4G03420.1 Protein of unknown function (DUF789))

HSP 1 Score: 218.8 bits (556), Expect = 6.1e-57
Identity = 136/340 (40.00%), Postives = 190/340 (55.88%), Query Frame = 1

Query: 66  SNLERFLDATKPSVPAQYFSKTTMRG----WRTCDIEFQPYFVLNDLWESFKEWSAYGAG 125
           SNL+RFL  T P VP Q  SK  +R     W   + +   +F L+DLW+ + EWSAYGAG
Sbjct: 7   SNLDRFLHCTTPVVPPQSLSKAEIRSLNRIWHPWERQKVEFFRLSDLWDCYDEWSAYGAG 66

Query: 126 VPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSIDY 185
           VP+ L+ G+S+VQYYVPYLS IQI+   ++L      RL  +DS+   SRD+ S+   D 
Sbjct: 67  VPIRLSNGESLVQYYVPYLSAIQIFTSRSSL-----IRL-RDDSEDGESRDSFSDSYSDE 126

Query: 186 EFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLF-Q 245
              +S  LSR         C S                 EG   D       R G L+ Q
Sbjct: 127 --SESDKLSR---------CAS----------------DEGLEHDALLHPNDRLGYLYLQ 186

Query: 246 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 305
           + E+  PY RVPL DKI +LA ++PGL +LRS D+ PASW++VAWYPIY IP G T+KDL
Sbjct: 187 YFERSAPYARVPLMDKINELAQRYPGLMSLRSVDLSPASWMAVAWYPIYHIPMGRTIKDL 246

Query: 306 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPK--------VSLPVFGLASYKLKGS 365
             CFLTYH+LS+  +          + P +N G  +        V+L  FGLA+YK++G+
Sbjct: 247 STCFLTYHTLSSSFQD---------MEPEENGGEKERIRKEGEGVTLLPFGLATYKMQGN 304

Query: 366 IW--AQNCVKEHQMANSLMQAAEKWLRRLQVNQPDFQFFA 391
           +W    +  ++ +   SL+  A+ WL++L+V   DF +F+
Sbjct: 307 VWLSEDDQGQDQERVLSLLSVADSWLKQLRVQHHDFNYFS 304

BLAST of Cp4.1LG18g07060 vs. NCBI nr
Match: gi|659106752|ref|XP_008453426.1| (PREDICTED: uncharacterized protein LOC103494138 isoform X1 [Cucumis melo])

HSP 1 Score: 682.6 bits (1760), Expect = 4.2e-193
Identity = 334/397 (84.13%), Postives = 357/397 (89.92%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSKVVASTTTPSKPL 60
           MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRPTKTDETE+ SSKVV  TT P + L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGE+AALRSDS  RLA EDSDLDSSRDTSS+GSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSNVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240
           Y+ GKS NLSREQW H HLACE+   MRKTSL DE    QEGF SDDGDA YPRSGLLFQ
Sbjct: 181 YDLGKSFNLSREQWDHPHLACENMPKMRKTSLTDERKMVQEGFLSDDGDAGYPRSGLLFQ 240

Query: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300
           FLEQDLPYQRVPLADKIF+LAYQFPGLKTLRSCDILPASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360
           DACFLTYHSLSTP +GN H   P M+YP D D I K+SLPVFG+ASYKLKGSIW QN + 
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPVMVYPKDIDDITKISLPVFGMASYKLKGSIWGQNGIN 360

Query: 361 EHQMANSLMQAAEKWLRRLQVNQPDFQFFASNMTYWR 398
           +HQ ANSLMQAA+KWLR LQV+QPDFQFF+S+ TYWR
Sbjct: 361 DHQKANSLMQAADKWLRSLQVSQPDFQFFSSHGTYWR 397

BLAST of Cp4.1LG18g07060 vs. NCBI nr
Match: gi|449439091|ref|XP_004137321.1| (PREDICTED: uncharacterized protein LOC101215266 [Cucumis sativus])

HSP 1 Score: 673.7 bits (1737), Expect = 2.0e-190
Identity = 327/397 (82.37%), Postives = 354/397 (89.17%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSKVVASTTTPSKPL 60
           MLGT LQFGGIKGEDRFY+PVRARK+YNQQKPSR PTKTDETE+ SSKVV  TT P + L
Sbjct: 1   MLGTTLQFGGIKGEDRFYVPVRARKNYNQQKPSRNPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGE+AALRSDS  RLA EDSDLDSSRDTSS+GSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSHVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240
           ++ GKS N SREQW H HLACE+ + MRKTSL DEH   QEGF SDDGDA YPRS LLFQ
Sbjct: 181 HDLGKSFNFSREQWDHPHLACENMLKMRKTSLTDEHKMVQEGFLSDDGDAGYPRSSLLFQ 240

Query: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300
           FLEQDLPYQRVPLADKIF+LAYQFPGLKTL SCDILPASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLSSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360
           DACFLTYHSLSTP +GN H   P M+YP D D I K+SLPVFG+ASYK+KGSIW QN + 
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPIMVYPKDIDDITKISLPVFGMASYKVKGSIWGQNGIS 360

Query: 361 EHQMANSLMQAAEKWLRRLQVNQPDFQFFASNMTYWR 398
           +HQ ANSLMQAA+KWLR LQV+QPDFQFF+S+ TYWR
Sbjct: 361 DHQKANSLMQAADKWLRSLQVSQPDFQFFSSHGTYWR 397

BLAST of Cp4.1LG18g07060 vs. NCBI nr
Match: gi|659106758|ref|XP_008453429.1| (PREDICTED: uncharacterized protein LOC103494138 isoform X4 [Cucumis melo])

HSP 1 Score: 654.8 bits (1688), Expect = 9.4e-185
Identity = 322/381 (84.51%), Postives = 342/381 (89.76%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSKVVASTTTPSKPL 60
           MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRPTKTDETE+ SSKVV  TT P + L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGE+AALRSDS  RLA EDSDLDSSRDTSS+GSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSNVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240
           Y+ GKS NLSREQW H HLACE+   MRKTSL DE    QEGF SDDGDA YPRSGLLFQ
Sbjct: 181 YDLGKSFNLSREQWDHPHLACENMPKMRKTSLTDERKMVQEGFLSDDGDAGYPRSGLLFQ 240

Query: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300
           FLEQDLPYQRVPLADKIF+LAYQFPGLKTLRSCDILPASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360
           DACFLTYHSLSTP +GN H   P M+YP D D I K+SLPVFG+ASYKLKGSIW QN + 
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPVMVYPKDIDDITKISLPVFGMASYKLKGSIWGQNGIN 360

Query: 361 EHQMANSLMQAAEKWLRRLQV 382
           +HQ ANSLMQAA+KWLR LQV
Sbjct: 361 DHQKANSLMQAADKWLRSLQV 381

BLAST of Cp4.1LG18g07060 vs. NCBI nr
Match: gi|659106756|ref|XP_008453428.1| (PREDICTED: uncharacterized protein LOC103494138 isoform X3 [Cucumis melo])

HSP 1 Score: 652.9 bits (1683), Expect = 3.6e-184
Identity = 321/380 (84.47%), Postives = 341/380 (89.74%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSKVVASTTTPSKPL 60
           MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRPTKTDETE+ SSKVV  TT P + L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGE+AALRSDS  RLA EDSDLDSSRDTSS+GSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSNVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240
           Y+ GKS NLSREQW H HLACE+   MRKTSL DE    QEGF SDDGDA YPRSGLLFQ
Sbjct: 181 YDLGKSFNLSREQWDHPHLACENMPKMRKTSLTDERKMVQEGFLSDDGDAGYPRSGLLFQ 240

Query: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300
           FLEQDLPYQRVPLADKIF+LAYQFPGLKTLRSCDILPASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360
           DACFLTYHSLSTP +GN H   P M+YP D D I K+SLPVFG+ASYKLKGSIW QN + 
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPVMVYPKDIDDITKISLPVFGMASYKLKGSIWGQNGIN 360

Query: 361 EHQMANSLMQAAEKWLRRLQ 381
           +HQ ANSLMQAA+KWLR LQ
Sbjct: 361 DHQKANSLMQAADKWLRSLQ 380

BLAST of Cp4.1LG18g07060 vs. NCBI nr
Match: gi|659106754|ref|XP_008453427.1| (PREDICTED: uncharacterized protein LOC103494138 isoform X2 [Cucumis melo])

HSP 1 Score: 652.9 bits (1683), Expect = 3.6e-184
Identity = 321/380 (84.47%), Postives = 341/380 (89.74%), Query Frame = 1

Query: 1   MLGTALQFGGIKGEDRFYIPVRARKSYNQQKPSRRPTKTDETETPSSKVVASTTTPSKPL 60
           MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRPTKTDETE+ SSKVV  TT P + L
Sbjct: 1   MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPTKTDETESLSSKVVGCTTKPCEEL 60

Query: 61  TPQSKSNLERFLDATKPSVPAQYFSKTTMRGWRTCDIEFQPYFVLNDLWESFKEWSAYGA 120
           TPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQPYF+LNDLWESFKEWSAYGA
Sbjct: 61  TPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGA 120

Query: 121 GVPLVLNGGDSVVQYYVPYLSGIQIYGESAALRSDSKSRLANEDSDLDSSRDTSSEGSID 180
           GVPLVL+GGDSVVQYYVPYLSGIQIYGE+AALRSDS  RLA EDSDLDSSRDTSS+GSID
Sbjct: 121 GVPLVLDGGDSVVQYYVPYLSGIQIYGEAAALRSDSNVRLACEDSDLDSSRDTSSDGSID 180

Query: 181 YEFGKSCNLSREQWVHHHLACESAITMRKTSLRDEHSTRQEGFSSDDGDAEYPRSGLLFQ 240
           Y+ GKS NLSREQW H HLACE+   MRKTSL DE    QEGF SDDGDA YPRSGLLFQ
Sbjct: 181 YDLGKSFNLSREQWDHPHLACENMPKMRKTSLTDERKMVQEGFLSDDGDAGYPRSGLLFQ 240

Query: 241 FLEQDLPYQRVPLADKIFDLAYQFPGLKTLRSCDILPASWISVAWYPIYRIPTGPTLKDL 300
           FLEQDLPYQRVPLADKIF+LAYQFPGLKTLRSCDILPASW+SVAWYPIYRIPTGPTLKDL
Sbjct: 241 FLEQDLPYQRVPLADKIFELAYQFPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDL 300

Query: 301 DACFLTYHSLSTPIRGNGHGQAPAMIYPNDNDGIPKVSLPVFGLASYKLKGSIWAQNCVK 360
           DACFLTYHSLSTP +GN H   P M+YP D D I K+SLPVFG+ASYKLKGSIW QN + 
Sbjct: 301 DACFLTYHSLSTPKKGNRHSLPPVMVYPKDIDDITKISLPVFGMASYKLKGSIWGQNGIN 360

Query: 361 EHQMANSLMQAAEKWLRRLQ 381
           +HQ ANSLMQAA+KWLR LQ
Sbjct: 361 DHQKANSLMQAADKWLRSLQ 380

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LS49_CUCSA1.4e-19082.37Uncharacterized protein OS=Cucumis sativus GN=Csa_1G024240 PE=4 SV=1[more]
A0A0D2S4J5_GOSRA1.9e-13961.37Uncharacterized protein OS=Gossypium raimondii GN=B456_007G005300 PE=4 SV=1[more]
W9SER1_9ROSA4.6e-13863.17Uncharacterized protein OS=Morus notabilis GN=L484_013258 PE=4 SV=1[more]
A0A067E421_CITSI5.1e-13762.16Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015519mg PE=4 SV=1[more]
A0A067E3M8_CITSI1.9e-13661.92Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015519mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G15030.15.8e-9256.40 Protein of unknown function (DUF789)[more]
AT2G01260.15.8e-9250.63 Protein of unknown function (DUF789)[more]
AT4G16100.11.2e-8144.36 Protein of unknown function (DUF789)[more]
AT5G49220.11.0e-7239.91 Protein of unknown function (DUF789)[more]
AT4G03420.16.1e-5740.00 Protein of unknown function (DUF789)[more]
Match NameE-valueIdentityDescription
gi|659106752|ref|XP_008453426.1|4.2e-19384.13PREDICTED: uncharacterized protein LOC103494138 isoform X1 [Cucumis melo][more]
gi|449439091|ref|XP_004137321.1|2.0e-19082.37PREDICTED: uncharacterized protein LOC101215266 [Cucumis sativus][more]
gi|659106758|ref|XP_008453429.1|9.4e-18584.51PREDICTED: uncharacterized protein LOC103494138 isoform X4 [Cucumis melo][more]
gi|659106756|ref|XP_008453428.1|3.6e-18484.47PREDICTED: uncharacterized protein LOC103494138 isoform X3 [Cucumis melo][more]
gi|659106754|ref|XP_008453427.1|3.6e-18484.47PREDICTED: uncharacterized protein LOC103494138 isoform X2 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008507DUF789
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g07060.1Cp4.1LG18g07060.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008507Protein of unknown function DUF789PFAMPF05623DUF789coord: 66..389
score: 1.5E
NoneNo IPR availablePANTHERPTHR31343FAMILY NOT NAMEDcoord: 199..397
score: 8.6E-204coord: 2..180
score: 8.6E
NoneNo IPR availablePANTHERPTHR31343:SF5SUBFAMILY NOT NAMEDcoord: 199..397
score: 8.6E-204coord: 2..180
score: 8.6E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG18g07060Cucsa.111200Cucumber (Gy14) v1cgycpeB0253
Cp4.1LG18g07060Cucsa.283340Cucumber (Gy14) v1cgycpeB0748
Cp4.1LG18g07060CmaCh11G002870Cucurbita maxima (Rimu)cmacpeB132
Cp4.1LG18g07060CmoCh10G003300Cucurbita moschata (Rifu)cmocpeB055
Cp4.1LG18g07060CmoCh11G002900Cucurbita moschata (Rifu)cmocpeB108
Cp4.1LG18g07060Cla018958Watermelon (97103) v1cpewmB372
Cp4.1LG18g07060Csa5G610390Cucumber (Chinese Long) v2cpecuB376
Cp4.1LG18g07060Csa1G024240Cucumber (Chinese Long) v2cpecuB363
Cp4.1LG18g07060MELO3C012272Melon (DHL92) v3.5.1cpemeB319
Cp4.1LG18g07060MELO3C017323Melon (DHL92) v3.5.1cpemeB330
Cp4.1LG18g07060ClCG06G014620Watermelon (Charleston Gray)cpewcgB335
Cp4.1LG18g07060CSPI01G03980Wild cucumber (PI 183967)cpecpiB363
Cp4.1LG18g07060Lsi04G002290Bottle gourd (USVL1VR-Ls)cpelsiB296
Cp4.1LG18g07060Lsi06G013160Bottle gourd (USVL1VR-Ls)cpelsiB301
Cp4.1LG18g07060MELO3C017323.2Melon (DHL92) v3.6.1cpemedB388
Cp4.1LG18g07060MELO3C012272.2Melon (DHL92) v3.6.1cpemedB377
Cp4.1LG18g07060CsaV3_1G003990Cucumber (Chinese Long) v3cpecucB0444
Cp4.1LG18g07060CsaV3_5G034560Cucumber (Chinese Long) v3cpecucB0465
Cp4.1LG18g07060Bhi07G001075Wax gourdcpewgoB0450
Cp4.1LG18g07060Bhi01G002685Wax gourdcpewgoB0439
Cp4.1LG18g07060CsGy5G024970Cucumber (Gy14) v2cgybcpeB623
Cp4.1LG18g07060CsGy1G003900Cucumber (Gy14) v2cgybcpeB055
Cp4.1LG18g07060Carg09128Silver-seed gourdcarcpeB0843
Cp4.1LG18g07060Carg15160Silver-seed gourdcarcpeB1103
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG18g07060Cp4.1LG09g03210Cucurbita pepo (Zucchini)cpecpeB030
Cp4.1LG18g07060Cp4.1LG01g13840Cucurbita pepo (Zucchini)cpecpeB345
Cp4.1LG18g07060Cp4.1LG04g13890Cucurbita pepo (Zucchini)cpecpeB360