Cp4.1LG04g05880 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g05880
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function, DUF642
LocationCp4.1LG04 : 5514476 .. 5517145 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCATCCATTTGAATTCCCATCTACCGACATTTGGATGGCCGAGGCCAAGTGGAAAGGCACGTGATTGGTCGTACTTCCACATTCATTAAAAGCTCCATCTCCTCACAATACCCACTCGAAATAGCTGTCTGCCTTTGGAAACTTTCAAGTCAACGCCTTCACTTTTTCTCCACACTGCAATGGCTTTGCAGTTCCCTTCTCCTTCCTCCTCCAACCAACCATGGCGGCTGCCGGAAACCCTCCTCCTCCTCCTCCTCCTTTCCGCCGGAGTAACCTCCAGAGGTAAAACCATCAGCCATTTGTTCATATTCCCATAAGAAAATTTAGATTTCAAGTGATTGGAAGATTTGAAGATAATCTGAGTATATATATATATGTCTTACCCAGTTCAGCTAAAGACAGCTATGTTGTCTCCTCTGACATTGTCTCTGTTCTTTTTCTTTTTTCAGAACTTTTGAAGAATTCAGATTTTGAATCTCCGCCGTCAAACCTGCCGGAGAATTCAAACAAAACATCAGTGAAACTGAACGAAAACAGCACAATTCCAGGATGGACATTTCAGGGGACGGGTGAGTACATAACAGTAAGCGAAAACATTTCATTGCCAGAAAAAGGACATGCTATTTTATTGGGTGAAGATGGAAGAATCAACCAGACCTTCATAGCCGATGTAGACTTTTTGAATTACCTGCTGACCTTCGCACTGGCTCCTGGCGGTCAGAATTGTTCATCTACAGCTCCATTGCTAGTCTCGGCACCAGATAGTGATGCCATGTTTACCTTTAGCCAGCATTATGGGAAGGAGCCATGGGAGGTCCATGGCGTTTTTCTGGGTAGCTGGGGAGATGAAGAGCCTGTAAACCTGCAAATTGGAAGCCAAGCTAATGACTCTACTCCAGCCTGTTGGCCTGTAATTGACTCACTTCATATCAAGACAATGGGTGTAGTGATGCCAGATAGTGGTAAGTAGATTTCTAAGATCCTCAAAATGTGAGTTAAACACTGGTCTGTTAGGAATCACGTCTCTCCACGATAGTATAATATTATTCACTTTGAGCATAAGCTCTCATGGCTTTGCTTTTGGCTTCTCCAAAAGGCCTCATACCAATGGAGATGTATTTCTTGCTTATAAACTCATGATCATTCCATAAATTAGCCAATGTGTGACTCCCTCCCAACAATCCTTAACAATCCTCCCTTCAAACAAAGTACACCCTGGTCTATGGAGCCCTCGAACAACCTCCCCTTAATCGAGGCTCGACTCCTTCTCTAGAGTCCTCGAACAAAGTATATCCTTTGTTTGACACTTGAGTCACTTTTGACTACACCTTCGAGGCTCACAACTTCTTTGTTCGATATTTGAGGATTCTATTGACATGACTAAGTTAAAAGCATGATTTTAATACCATGTTAGGAATCACGACTCTCCACAATGGTATGATATTGTCCACTTTGAGTATAAGCTTTCATGGTTTTACTTTTGGCTTCCCCAAAAGGTTTCATACCAATGAAAATGTATTCCTTACTTATAAACCCAGGGGCGGATTAGTGTCTGTAATTAAACTTCTTACTGCTACTTTTGATTGATGCAGGAAACTTAGTAGTCAATGGAGGATTCGAGTTCGGCCCAGATTTTCTCGAAAGCGTGGAAGGCGGAGTTCTACTGGATTCAGTTCCTACTCCTCTCTTTTCACCACTAGCACAGTGGGCTATACTGGGGACAGTCAGATACATAAATTCAAAACACTTCTTTGTTCCACAAGGTAATGCTGCAGTAGAGCTGATATCTGGAGCTTCATCAGGAGTTCAAGCAGCAGTGAAACTCCAGGCAGGTTCATATACCTTGAATTTCACGCTTGGCGACGCTAACGATTCATGCGAAGCTAAATTTCTTGTCGGAGTACAAGCTGGATCGGGGTCACAGAATTTCACGTTGGAGAGCAACGGAACAGGTTCAGCTGTAAAATTCTCCATGCCATTCAATGCAGCTCCTGATGATAATACAATCACTTTCCTTAGCTACACTACATCCAAAACAAAAGATGGAGACTTCTGTGGTCCAGTCATAGATGATGTATTTTTGCGTGCTTCTCATGGGCTCAGAATTTTAATGCCCTGGAAGACCTTGATTCCTCTGTGTTTGATTACAATTCTCTTTTTGCTCTGAGGAAAAAGTCCAGGAATTCTACATTTGACAAGTTAGGAGGCCTGTATTATTGCTGTACCTGTCTTCTTTGAGGAAAGTTCAGAATAAAAACCAACAAACACAGTAATTTGTCTCAAAAGAAACATACTTCATTAATATGTGGAGATAGAATAGAATGACAGTAATTGTGACAGTATAGTACATTTAGATTGATATTACTTCATATCTAAGACTATGTATAATTTGCTAACGAAGTGGAAGTAATGAAGATTTATATATATACCACTTCTGAAATGACTGAAATCTGCTGTATATCTGGCTTGGATATCTAAGAAGAAGCCGTTTATTTGGGTAATTCTGAACATGATAGCCAATGGTGGAGAGCTTTTCGTCAATTGTTCATTGTATTGACATTTTGATGACCTGAAAATCAAGCAAAATAATGAAACACAATAAAGTGGATAGTGTACAACTGCGAGGACGTGGAAGTAAATAGTAATGATTCCATTGAGGAAATTTGAAG

mRNA sequence

ATCATCCATTTGAATTCCCATCTACCGACATTTGGATGGCCGAGGCCAAGTGGAAAGGCACGTGATTGGTCGTACTTCCACATTCATTAAAAGCTCCATCTCCTCACAATACCCACTCGAAATAGCTGTCTGCCTTTGGAAACTTTCAAGTCAACGCCTTCACTTTTTCTCCACACTGCAATGGCTTTGCAGTTCCCTTCTCCTTCCTCCTCCAACCAACCATGGCGGCTGCCGGAAACCCTCCTCCTCCTCCTCCTCCTTTCCGCCGGAGTAACCTCCAGAGAACTTTTGAAGAATTCAGATTTTGAATCTCCGCCGTCAAACCTGCCGGAGAATTCAAACAAAACATCAGTGAAACTGAACGAAAACAGCACAATTCCAGGATGGACATTTCAGGGGACGGGTGAGTACATAACAGTAAGCGAAAACATTTCATTGCCAGAAAAAGGACATGCTATTTTATTGGGTGAAGATGGAAGAATCAACCAGACCTTCATAGCCGATGTAGACTTTTTGAATTACCTGCTGACCTTCGCACTGGCTCCTGGCGGTCAGAATTGTTCATCTACAGCTCCATTGCTAGTCTCGGCACCAGATAGTGATGCCATGTTTACCTTTAGCCAGCATTATGGGAAGGAGCCATGGGAGGTCCATGGCGTTTTTCTGGGTAGCTGGGGAGATGAAGAGCCTGTAAACCTGCAAATTGGAAGCCAAGCTAATGACTCTACTCCAGCCTGTTGGCCTGTAATTGACTCACTTCATATCAAGACAATGGGAAACTTAGTAGTCAATGGAGGATTCGAGTTCGGCCCAGATTTTCTCGAAAGCGTGGAAGGCGGAGTTCTACTGGATTCAGTTCCTACTCCTCTCTTTTCACCACTAGCACAGTGGGCTATACTGGGGACAGTCAGATACATAAATTCAAAACACTTCTTTGTTCCACAAGGTAATGCTGCAGTAGAGCTGATATCTGGAGCTTCATCAGGAGTTCAAGCAGCAGTGAAACTCCAGGCAGGTTCATATACCTTGAATTTCACGCTTGGCGACGCTAACGATTCATGCGAAGCTAAATTTCTTGTCGGAGTACAAGCTGGATCGGGGTCACAGAATTTCACGTTGGAGAGCAACGGAACAGGTTCAGCTGTAAAATTCTCCATGCCATTCAATGCAGCTCCTGATGATAATACAATCACTTTCCTTAGCTACACTACATCCAAAACAAAAGATGGAGACTTCTGTGGTCCAGTCATAGATGATGTATTTTTGCGTGCTTCTCATGGGCTCAGAATTTTAATGCCCTGGAAGACCTTGATTCCTCTGTGTTTGATTACAATTCTCTTTTTGCTCTGAGGAAAAAGTCCAGGAATTCTACATTTGACAAGTTAGGAGGCCTGTATTATTGCTGTACCTGTCTTCTTTGAGGAAAGTTCAGAATAAAAACCAACAAACACAGTAATTTGTCTCAAAAGAAACATACTTCATTAATATGTGGAGATAGAATAGAATGACAGTAATTGTGACAGTATAGTACATTTAGATTGATATTACTTCATATCTAAGACTATGTATAATTTGCTAACGAAGTGGAAGTAATGAAGATTTATATATATACCACTTCTGAAATGACTGAAATCTGCTGTATATCTGGCTTGGATATCTAAGAAGAAGCCGTTTATTTGGGTAATTCTGAACATGATAGCCAATGGTGGAGAGCTTTTCGTCAATTGTTCATTGTATTGACATTTTGATGACCTGAAAATCAAGCAAAATAATGAAACACAATAAAGTGGATAGTGTACAACTGCGAGGACGTGGAAGTAAATAGTAATGATTCCATTGAGGAAATTTGAAG

Coding sequence (CDS)

ATGGCTTTGCAGTTCCCTTCTCCTTCCTCCTCCAACCAACCATGGCGGCTGCCGGAAACCCTCCTCCTCCTCCTCCTCCTTTCCGCCGGAGTAACCTCCAGAGAACTTTTGAAGAATTCAGATTTTGAATCTCCGCCGTCAAACCTGCCGGAGAATTCAAACAAAACATCAGTGAAACTGAACGAAAACAGCACAATTCCAGGATGGACATTTCAGGGGACGGGTGAGTACATAACAGTAAGCGAAAACATTTCATTGCCAGAAAAAGGACATGCTATTTTATTGGGTGAAGATGGAAGAATCAACCAGACCTTCATAGCCGATGTAGACTTTTTGAATTACCTGCTGACCTTCGCACTGGCTCCTGGCGGTCAGAATTGTTCATCTACAGCTCCATTGCTAGTCTCGGCACCAGATAGTGATGCCATGTTTACCTTTAGCCAGCATTATGGGAAGGAGCCATGGGAGGTCCATGGCGTTTTTCTGGGTAGCTGGGGAGATGAAGAGCCTGTAAACCTGCAAATTGGAAGCCAAGCTAATGACTCTACTCCAGCCTGTTGGCCTGTAATTGACTCACTTCATATCAAGACAATGGGAAACTTAGTAGTCAATGGAGGATTCGAGTTCGGCCCAGATTTTCTCGAAAGCGTGGAAGGCGGAGTTCTACTGGATTCAGTTCCTACTCCTCTCTTTTCACCACTAGCACAGTGGGCTATACTGGGGACAGTCAGATACATAAATTCAAAACACTTCTTTGTTCCACAAGGTAATGCTGCAGTAGAGCTGATATCTGGAGCTTCATCAGGAGTTCAAGCAGCAGTGAAACTCCAGGCAGGTTCATATACCTTGAATTTCACGCTTGGCGACGCTAACGATTCATGCGAAGCTAAATTTCTTGTCGGAGTACAAGCTGGATCGGGGTCACAGAATTTCACGTTGGAGAGCAACGGAACAGGTTCAGCTGTAAAATTCTCCATGCCATTCAATGCAGCTCCTGATGATAATACAATCACTTTCCTTAGCTACACTACATCCAAAACAAAAGATGGAGACTTCTGTGGTCCAGTCATAGATGATGTATTTTTGCGTGCTTCTCATGGGCTCAGAATTTTAATGCCCTGGAAGACCTTGATTCCTCTGTGTTTGATTACAATTCTCTTTTTGCTCTGA

Protein sequence

MALQFPSPSSSNQPWRLPETLLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEYITVSENISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLVSAPDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQANDSTPACWPVIDSLHIKTMGNLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAILGTVRYINSKHFFVPQGNAAVELISGASSGVQAAVKLQAGSYTLNFTLGDANDSCEAKFLVGVQAGSGSQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKDGDFCGPVIDDVFLRASHGLRILMPWKTLIPLCLITILFLL
BLAST of Cp4.1LG04g05880 vs. TrEMBL
Match: A0A0A0LDL0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G585880 PE=4 SV=1)

HSP 1 Score: 607.8 bits (1566), Expect = 9.0e-171
Identity = 307/397 (77.33%), Postives = 332/397 (83.63%), Query Frame = 1

Query: 1   MALQFPSPSSS-NQPWRLPETLLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVK 60
           MA   PS SSS N  W LPE +LL LL+S GVTSRE LKN+DFESPPSN PENSNKTSV 
Sbjct: 30  MAFWLPSSSSSSNHSWLLPE-ILLFLLVSTGVTSREFLKNADFESPPSNFPENSNKTSVA 89

Query: 61  LNENSTIPGWTFQGTGEYITVSE--NISLPEKGHAILLGEDGRINQTFIADVDFLNYLLT 120
           L EN+T PGWTFQG  EYITV +  NISLP+KGHAILLGEDG+INQTF AD D L YLLT
Sbjct: 90  LKENNTFPGWTFQGAVEYITVDQIKNISLPDKGHAILLGEDGKINQTFTADADILTYLLT 149

Query: 121 FALAPGGQNCSSTAPLLVSAPDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGS 180
           FALAPGG NCS TAPL +SAPDSDA+F+FSQHYGK+PWEVHGV+LGSWGD E VNL+I S
Sbjct: 150 FALAPGGHNCSLTAPLQISAPDSDALFSFSQHYGKQPWEVHGVYLGSWGDRESVNLEIMS 209

Query: 181 QANDSTPACWPVIDSLHIKTMG-------NLVVNGGFEFGPDFLESVEGGVLLDSVPTPL 240
           Q+NDSTP CWP +DSLHIKTMG       NLVVNGGFE+GPDFLES EGGVLLDSVPT  
Sbjct: 210 QSNDSTPTCWPAVDSLHIKTMGIVMPDGDNLVVNGGFEYGPDFLESSEGGVLLDSVPTTF 269

Query: 241 FSPLAQWAILGTVRYINSKHFFVPQGNAAVELISGASSGVQAAVKLQAG-SYTLNFTLGD 300
           FSPL QWAILG VRYINSKHFFVPQGN AVEL+SG SSG+QA  KLQAG SYTL+FTLGD
Sbjct: 270 FSPLIQWAILGKVRYINSKHFFVPQGNTAVELVSGVSSGLQAVPKLQAGSSYTLSFTLGD 329

Query: 301 ANDSCEAKFLVGVQAGSGSQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKD 360
           ANDSC+A FLVG QAG  S+NFTLESNGTGSA KFSM F A PD NTIT LSYTTS+TKD
Sbjct: 330 ANDSCKATFLVGAQAGLTSRNFTLESNGTGSAAKFSMTFTAGPDVNTITLLSYTTSQTKD 389

Query: 361 GDFCGPVIDDVFLRASHGLRILMPWKTLIPLCLITIL 387
           GDFCGPVIDDV LR S GLRI +PWK+LI LCLITI+
Sbjct: 390 GDFCGPVIDDVILRVSRGLRISVPWKSLISLCLITIV 425

BLAST of Cp4.1LG04g05880 vs. TrEMBL
Match: B9RBI5_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1677190 PE=4 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 2.3e-118
Identity = 213/381 (55.91%), Postives = 281/381 (73.75%), Query Frame = 1

Query: 18  PETLLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEY 77
           P  L+ LL + +   + +LL+N DFE+PP ++P NS      LNENSTIPGWTF+GT  Y
Sbjct: 3   PVLLVSLLFIGSAFAAADLLQNPDFETPPLHVPRNSTSPFQLLNENSTIPGWTFEGTVVY 62

Query: 78  ITVSENISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLVSA 137
           +T S+ ++LP  GHAI L +DG+INQTF  +  + +YLLTF LAPGGQNCS++  + VS 
Sbjct: 63  VTASQTVALPGDGHAIQLIQDGKINQTFHPNASYSHYLLTFVLAPGGQNCSNSGSIGVSV 122

Query: 138 PDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQA--NDSTPACWPVIDSLHI 197
           PD+ A+F+F QHYGKE WE +GV+LGSW ++EP+NL I SQA  +D+   CWPVID L I
Sbjct: 123 PDNHAVFSFKQHYGKEGWETYGVYLGSWEEQEPINLIIESQATESDANSTCWPVIDKLLI 182

Query: 198 KTM-------GNLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAILGTVRYINS 257
           KT         NL++NGGFEFGP+FL +   G+LLD  P+P+ S L QW+I GTV+YI+S
Sbjct: 183 KTTETLAPGNDNLLLNGGFEFGPEFLFNSTEGILLDPAPSPVLSALRQWSITGTVKYIDS 242

Query: 258 KHFFVPQGNAAVELISGASSGVQAAVKLQAG-SYTLNFTLGDANDSCEAKFLVGVQAGSG 317
           KH+FVP+GNAAVE++SG S+G+Q A+ +  G SY+L FTLGDANDSC   F+VG QAG  
Sbjct: 243 KHYFVPEGNAAVEMVSGVSAGIQTAMTVTEGSSYSLEFTLGDANDSCVGSFIVGAQAGPA 302

Query: 318 SQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKDGDFCGPVIDDVFLRASHG 377
           +QNFTL+SNGTGSA K S+ F A     +I+F+SYTT++TKDG FCGPV+D+V LRASH 
Sbjct: 303 AQNFTLQSNGTGSAKKLSLAFKADSMTTSISFVSYTTTQTKDGLFCGPVVDNVVLRASHA 362

Query: 378 LRILMPWKTLIPLC-LITILF 388
           ++ +M W+ LIPL  L+ IL+
Sbjct: 363 IKSVMKWEGLIPLLFLVAILW 383

BLAST of Cp4.1LG04g05880 vs. TrEMBL
Match: A0A061ECZ6_THECC (Emb:CAB87702.1, putative OS=Theobroma cacao GN=TCM_017188 PE=4 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 6.8e-118
Identity = 214/380 (56.32%), Postives = 273/380 (71.84%), Query Frame = 1

Query: 18  PETLLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEY 77
           P+    L L   G  S   L+N DFESPP +L EN+    V LNEN+TIPGWTFQGT +Y
Sbjct: 3   PQIFSFLFLFFIGFASAGYLQNPDFESPPKSLTENTGSPFVTLNENNTIPGWTFQGTVQY 62

Query: 78  ITVSENISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLVSA 137
           +T  + I+LP+ GHA+ LG+DG+INQTF AD D+ NY+LTF LAPGGQNCS+ A +LVS 
Sbjct: 63  VTAGQTIALPDNGHAVQLGQDGKINQTFTADADYTNYILTFTLAPGGQNCSANADVLVSG 122

Query: 138 PDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQA--NDSTPACWPVIDSLHI 197
           PDS  +F+F QHYGKE W+ +G  LG  G +EP+NL I SQ   +D    CWPVIDSL I
Sbjct: 123 PDSQGIFSFKQHYGKEAWQSYGQHLGLGGQKEPINLVIESQGVESDDNSTCWPVIDSLLI 182

Query: 198 KTMG-------NLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAILGTVRYINS 257
           KT+G       NL++NGGFEFGP+FL +   G+LLDS  +P+ SPL QWA++GT++YI+S
Sbjct: 183 KTIGTLVQGKDNLLLNGGFEFGPEFLSNSTEGILLDSALSPVLSPLRQWAVVGTIKYIDS 242

Query: 258 KHFFVPQGNAAVELISGASSGVQAAVKLQAGS-YTLNFTLGDANDSCEAKFLVGVQAGSG 317
           KHFFVP GNAAVE++SG S+G+Q  V L AGS Y+L FTLGDAN++C+  F+V V+A S 
Sbjct: 243 KHFFVPHGNAAVEIVSGVSAGIQTDVTLTAGSAYSLEFTLGDANNACKGDFIVEVRAESV 302

Query: 318 SQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKDGDFCGPVIDDVFLRASHG 377
            QNFT++SNGTGSA K SM F A      I+F SYTTS+TKDG FCGPV+DDV L +S+ 
Sbjct: 303 VQNFTVQSNGTGSAQKSSMKFEAGSRATRISFFSYTTSQTKDGIFCGPVVDDVLLLSSNC 362

Query: 378 LRILMPWKTLIPLCLITILF 388
           LR+ +    LI L  + ++F
Sbjct: 363 LRLAIKPNILISLLFLILIF 382

BLAST of Cp4.1LG04g05880 vs. TrEMBL
Match: V4U344_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10015578mg PE=4 SV=1)

HSP 1 Score: 407.9 bits (1047), Expect = 1.4e-110
Identity = 206/379 (54.35%), Postives = 275/379 (72.56%), Query Frame = 1

Query: 21  LLLLLLLSAGVTSR-ELLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEYIT 80
           +LL+ +L AG+ S  ++L+N DFESPP+NL  N +   V LN N+TIPGWTF+GT +Y+T
Sbjct: 7   ILLVQVLFAGLASAADILQNPDFESPPTNLTPNRSTPFVLLNGNNTIPGWTFEGTVQYVT 66

Query: 81  VSENISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLVSAPD 140
            S+ I LP+ GHAI L +DGRINQTF A+ D L Y+LT  LAPGGQNCS+ A L+ SAPD
Sbjct: 67  ASQTIRLPDNGHAIQLAQDGRINQTFAANGDDLIYILTLTLAPGGQNCSANANLVASAPD 126

Query: 141 SDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQA--NDSTPACWPVIDSLHIKT 200
           S  +++  QHYGKE WE +G +LGSWG +EP+NL I SQ+  +D    CWPVID L +K+
Sbjct: 127 SHGVYSLKQHYGKETWESYGHYLGSWGQDEPINLVIQSQSTESDDNSTCWPVIDMLLLKS 186

Query: 201 M-------GNLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAILGTVRYINSKH 260
                    NL++NGGFEFGPDFL +   GVLL+S P+P+ S L QW+++GTV+YI+SKH
Sbjct: 187 SKTLVQGNDNLLLNGGFEFGPDFLSNSTEGVLLESAPSPIQSALQQWSVIGTVKYIDSKH 246

Query: 261 FFVPQGNAAVELISGASSGVQAAVKL--QAGSYTLNFTLGDANDSCEAKFLVGVQAGSGS 320
           F+VP+GNAA+E++S  S+G+Q A  +  +  +Y L+FTLGDA D+CE  F+V VQAGS  
Sbjct: 247 FYVPKGNAAIEIVS-VSAGIQTATTMLTEGSAYNLDFTLGDAKDACEGMFVVRVQAGSLV 306

Query: 321 QNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKDGDFCGPVIDDVFLRASHGL 380
           QNFT++S GTGSA+K S+ F A      I+F+SY  ++TKDG FCGP+IDDV LRASHG 
Sbjct: 307 QNFTVQSLGTGSAIKHSVTFKAGSGSTPISFISYNINQTKDGVFCGPLIDDVVLRASHGF 366

Query: 381 RILMPWKTLI-PLCLITIL 387
           ++ +  + LI  L L+ IL
Sbjct: 367 KLQLRLEILIYALVLVAIL 384

BLAST of Cp4.1LG04g05880 vs. TrEMBL
Match: A0A067FCQ4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003457mg PE=4 SV=1)

HSP 1 Score: 407.9 bits (1047), Expect = 1.4e-110
Identity = 206/379 (54.35%), Postives = 274/379 (72.30%), Query Frame = 1

Query: 21  LLLLLLLSAGVTSR-ELLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEYIT 80
           +LL+ +L AG+ S  ++L+N DFESPP+NL  N +   V LN N+TIPGWTF+GT +Y+T
Sbjct: 441 ILLVQVLFAGLASAADILQNPDFESPPTNLTPNRSTPFVLLNGNNTIPGWTFEGTVQYVT 500

Query: 81  VSENISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLVSAPD 140
            S+ I LP+ GHAI L +DGRINQTF AD D L Y+LT  LAPGGQNCS+ A L+VSAPD
Sbjct: 501 ASQTIRLPDNGHAIQLAQDGRINQTFAADGDDLIYILTLTLAPGGQNCSANANLVVSAPD 560

Query: 141 SDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQA--NDSTPACWPVIDSLHIKT 200
           S  +++  QHYGKE W+ +G +LG WG +EP+NL I SQ+  +D    CWPVID L +KT
Sbjct: 561 SHGVYSLKQHYGKETWKSYGHYLGRWGQDEPINLVIRSQSTESDDNSTCWPVIDMLLLKT 620

Query: 201 M-------GNLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAILGTVRYINSKH 260
                    NL++NGGFEFGPDFL +   GVLL+S P+P+ S L QW+++GTV+YI+SKH
Sbjct: 621 SKTLVQGNDNLLLNGGFEFGPDFLSNSTEGVLLESAPSPIQSALQQWSVIGTVKYIDSKH 680

Query: 261 FFVPQGNAAVELISGASSGVQAAVKL--QAGSYTLNFTLGDANDSCEAKFLVGVQAGSGS 320
           F+VP+GNAA+E++S  S+G+Q A  +  +  +Y L+FTLGDA D+CE  F+V VQAGS  
Sbjct: 681 FYVPKGNAAIEIVS-VSAGIQTATTMLTEGSAYNLDFTLGDAKDACEGMFVVRVQAGSLV 740

Query: 321 QNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKDGDFCGPVIDDVFLRASHGL 380
           QNFT++S GTGS +K S+ F A      I+F+SY  ++TKDG FCGP+IDDV LRASHG 
Sbjct: 741 QNFTVQSLGTGSVIKHSVTFKAGSGSTPISFISYNINQTKDGVFCGPLIDDVVLRASHGF 800

Query: 381 RILMPWKTLI-PLCLITIL 387
           ++ +  + LI  L L+ IL
Sbjct: 801 KLQLRLEILIYALVLVAIL 818

BLAST of Cp4.1LG04g05880 vs. TAIR10
Match: AT5G14150.1 (AT5G14150.1 Protein of unknown function, DUF642)

HSP 1 Score: 379.0 bits (972), Expect = 3.5e-105
Identity = 198/382 (51.83%), Postives = 260/382 (68.06%), Query Frame = 1

Query: 22  LLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEYITVS 81
           + LLLL +   S + L+N DFESPP NLP NSN +SV L++NST+PGWTFQGT  Y+   
Sbjct: 8   IFLLLLVSCCASSDFLENPDFESPPLNLPTNSNASSVSLDQNSTLPGWTFQGTVLYV--- 67

Query: 82  ENISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLVSAPDSD 141
               LP+ GHA+ LGEDG+INQTFIA  D LNY+LTFAL   GQNC+S+A L VS PDS+
Sbjct: 68  ---ELPDTGHAVQLGEDGKINQTFIAKGDELNYILTFALIHAGQNCTSSAGLSVSGPDSN 127

Query: 142 AMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQA----NDSTPACWPVIDSLHIKT 201
           A+F++ Q+Y K  W+ +   LGSWG+ EP+NL + SQA    +D+   CWP+ID+L IKT
Sbjct: 128 AVFSYRQNYSKVSWQSYSHNLGSWGNGEPINLVLESQAIDSDSDTNSTCWPIIDTLLIKT 187

Query: 202 M--------GNLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAILGTVRYINSK 261
           +        GNL++NGGFE GP FL +   GVL+D+VP+ + SPL QW+++GTVRYI+S+
Sbjct: 188 VGVTLVQDSGNLLINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQWSVIGTVRYIDSE 247

Query: 262 HFFVPQGNAAVELISG-ASSGVQAAVK--LQAGSYTLNFTLGDANDSCEAKFLVGVQAGS 321
           HF VP+G AA+E++S  A SG+Q A K   +   Y L FTLGDAND+C   F+VG QAGS
Sbjct: 248 HFHVPEGKAAIEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDANDACRGHFVVGAQAGS 307

Query: 322 GSQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKDGDFCGPVIDDVFLRASH 381
            +QNFTLESNGTGS  KF + F A  D   I+F SY+ + TK+   CGPVID+V +    
Sbjct: 308 VTQNFTLESNGTGSGEKFGLVFEADKDAAQISFTSYSVTMTKENVVCGPVIDEVMVHPLG 367

Query: 382 GLRILMPWKTLIPLCLITILFL 389
           G   + P   L+   L+ +  L
Sbjct: 368 GTASVKPTWLLLIFALLYVAVL 383

BLAST of Cp4.1LG04g05880 vs. TAIR10
Match: AT2G34510.1 (AT2G34510.1 Protein of unknown function, DUF642)

HSP 1 Score: 159.8 bits (403), Expect = 3.3e-39
Identity = 132/406 (32.51%), Postives = 187/406 (46.06%), Query Frame = 1

Query: 11  SNQPWRLPETLLLLLLLS-------AGVTSRE---LLKNSDFESPPSN-LPENSNKTSVK 70
           SN  WR    L+LLL LS       AG TS     L+ N DFE+PPSN  P+++      
Sbjct: 5   SNNSWRSNSILILLLGLSIVAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAI----- 64

Query: 71  LNENSTIPGWTFQGTGEYITVSEN-----ISLPEKGHAILLGEDGRINQTFIADVDFLNY 130
           + + S IP W   GT E I   +      + +PE  HA+ LG D  I+Q    +   + Y
Sbjct: 65  IEDTSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGNDAEISQELTVEKGSI-Y 124

Query: 131 LLTFALAPGGQNCSSTAPLLVSAPDSDAMFTFSQHYGKEPWEVHGVFLGSWGDE---EPV 190
            +TF+ A   + C+    L VS   SD          +  + V G    +W  E   + V
Sbjct: 125 SVTFSAA---RTCAQLESLNVSVASSDEPIASQTIDLQTVYSVQGWDPYAWAFEAVVDRV 184

Query: 191 NLQIGSQANDSTPACWPVIDSLHIKTM-------GNLVVNGGFEFGPDFLESVEGGVLLD 250
            L   +   +  P C P+ID + +K +       GN V+NG FE GP    +   GVLL 
Sbjct: 185 RLVFKNPGMEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVLLP 244

Query: 251 SVPTPLFSPLAQWAILGT--VRYINSKHFFVPQGNAAVELISGASSGVQAAVKLQAG-SY 310
           +      S L  W +     VR+I+S HF VP+G  A+EL+SG    +   V+ +A   Y
Sbjct: 245 TNLDEEISSLPGWTVESNRAVRFIDSDHFSVPEGKRALELLSGKEGIISQMVETKANIPY 304

Query: 311 TLNFTLGDANDSCEAKFLVGVQAGSGSQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLS 370
            ++F+LG A D C+    V   AG  +QNF   +    S  +  + F A  +   I F S
Sbjct: 305 KMSFSLGHAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFERSELNFTAKAERTRIAFYS 364

Query: 371 -YTTSKTKD-GDFCGPVIDDVFLRASHGLRILMPWKTLIPLCLITI 386
            Y  ++T D    CGPVIDDV +  S   RI   +   I L L+ I
Sbjct: 365 IYYNTRTDDMTSLCGPVIDDVKVWFSGSSRIGFSFPLFILLSLVFI 401

BLAST of Cp4.1LG04g05880 vs. TAIR10
Match: AT1G80240.1 (AT1G80240.1 Protein of unknown function, DUF642)

HSP 1 Score: 152.9 bits (385), Expect = 4.0e-37
Identity = 114/371 (30.73%), Postives = 167/371 (45.01%), Query Frame = 1

Query: 21  LLLLLLLSAGVTSRE-----LLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTG 80
           LL LL +S+ V         LL N +FE  P   P     + VK  E + +P W   G  
Sbjct: 9   LLALLFISSNVVLSAPVRDGLLPNGNFELGPK--PSQMKGSVVK--ERTAVPNWNIIGFV 68

Query: 81  EYITVSEN-----ISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSST 140
           E+I   +      + +P+   A+ LG +  I+Q  I+ +    Y +TF+ A   + C+  
Sbjct: 69  EFIKSGQKQDDMVLVVPQGSSAVRLGNEASISQK-ISVLPGRLYSITFSAA---RTCAQD 128

Query: 141 APLLVSAPDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQANDSTPACWPVI 200
             L +S      +      YG + W+ +     + G E  + ++  +   +  PAC P+I
Sbjct: 129 ERLNISVTHESGVIPIQTMYGSDGWDSYSWAFKAGGPE--IEIRFHNPGVEEHPACGPLI 188

Query: 201 DSLHIKTMG-------NLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAI--LG 260
           D++ IK +        NL+ NG FE GP    + + GVL+        SPL  W I  L 
Sbjct: 189 DAVAIKALFPPRFSGYNLIKNGNFEEGPYVFPTAKWGVLIPPFIEDDNSPLPGWMIESLK 248

Query: 261 TVRYINSKHFFVPQGNAAVELISGASSGVQAAVKLQAGS-YTLNFTLGDANDSCEAKFLV 320
            V+Y++  HF VP+G+ A+EL+ G  S +   V+      Y L F +GDA D CE   +V
Sbjct: 249 AVKYVDKAHFAVPEGHRAIELVGGKESAISQIVRTSLNKFYALTFNVGDARDGCEGPMIV 308

Query: 321 GVQAGSGSQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLS--YTTSKTKDGDFCGPVID 370
              AG G       S G G   +  + F A      +TFLS  Y       G  CGPVID
Sbjct: 309 EAFAGQGKVMVDYASKGKGGFRRGRLVFKAVSARTRVTFLSTFYHMKSDHSGSLCGPVID 368

BLAST of Cp4.1LG04g05880 vs. TAIR10
Match: AT5G11420.1 (AT5G11420.1 Protein of unknown function, DUF642)

HSP 1 Score: 147.1 bits (370), Expect = 2.2e-35
Identity = 112/362 (30.94%), Postives = 164/362 (45.30%), Query Frame = 1

Query: 21  LLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEYITV 80
           LL+  + S    S  +L N DFE  P   P +   T V +N+ + IP W   G  EYI  
Sbjct: 12  LLIATITSVICFSDGMLPNGDFELGPK--PSDMKGTQV-INKKA-IPSWELSGFVEYIKS 71

Query: 81  SEN-----ISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLV 140
            +      + +P    AI LG +  I Q        + Y LTF+ A   + C+    L +
Sbjct: 72  GQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKG-MYYSLTFSAA---RTCAQDERLNI 131

Query: 141 SAPDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQ---IGSQANDSTPACWPVIDS 200
           S      +      Y    W+++     +W  +   N+    I +   +  PAC P+ID 
Sbjct: 132 SVAPDSGVIPIQTVYSSSGWDLY-----AWAFQAESNVAEIVIHNPGEEEDPACGPLIDG 191

Query: 201 LHIK-------TMGNLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAI--LGTV 260
           + IK       T  N++ NGGFE GP  L +   GVL+        SPL  W +  L  +
Sbjct: 192 VAIKALYPPRPTNKNILKNGGFEEGPYVLPNATTGVLVPPFIEDDHSPLPAWMVESLKAI 251

Query: 261 RYINSKHFFVPQGNAAVELISGASSGVQAAVKLQAG-SYTLNFTLGDANDSCEAKFLVGV 320
           +Y++ +HF VPQG  AVEL++G  S +    +   G +Y L+F +GDAN++C+   +V  
Sbjct: 252 KYVDVEHFSVPQGRRAVELVAGKESAIAQVARTVVGKTYVLSFAVGDANNACQGSMVVEA 311

Query: 321 QAGSGSQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKD--GDFCGPVIDDV 363
            AG  +     ES G G   + S+ F A      + F S   S   D     CGPVIDDV
Sbjct: 312 FAGKDTLKVPYESRGKGGFKRASLRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDV 360

BLAST of Cp4.1LG04g05880 vs. TAIR10
Match: AT4G32460.1 (AT4G32460.1 Protein of unknown function, DUF642)

HSP 1 Score: 146.0 bits (367), Expect = 4.9e-35
Identity = 108/347 (31.12%), Postives = 159/347 (45.82%), Query Frame = 1

Query: 36  LLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEYITVSEN-----ISLPEKG 95
           LL N DFE  P     +S+    ++   + IP W   G  EYI          + +P+  
Sbjct: 26  LLPNGDFELGP----RHSDMKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGA 85

Query: 96  HAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLVSAPDSDAMFTFSQHY 155
            A+ LG +  I Q  I+      Y +TF+ A   + C+    L VS     A+      Y
Sbjct: 86  FAVRLGNEASIKQK-ISVKKGSYYSITFSAA---RTCAQDERLNVSVAPHHAVMPIQTVY 145

Query: 156 GKEPWEVHGVFLGSWG---DEEPVNLQIGSQANDSTPACWPVIDSLHIK-------TMGN 215
               W+++     SW      +  ++ I +   +  PAC P+ID + ++       T  N
Sbjct: 146 SSSGWDLY-----SWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKN 205

Query: 216 LVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAI--LGTVRYINSKHFFVPQGNA 275
           ++ NGGFE GP  L ++  GVL+        SPL  W +  L  V+YI+S HF VPQG  
Sbjct: 206 ILKNGGFEEGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRR 265

Query: 276 AVELISGASSGVQAAVKLQAG-SYTLNFTLGDANDSCEAKFLVGVQAGSGSQNFTLESNG 335
           AVEL++G  S V   V+   G +Y L+F++GDA+++C    +V   AG  +     ES G
Sbjct: 266 AVELVAGKESAVAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKG 325

Query: 336 TGSAVKFSMPFNAAPDDNTITFLS--YTTSKTKDGDFCGPVIDDVFL 363
            G   + S+ F A      + F S  Y          CGPVIDDV L
Sbjct: 326 KGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKL 359

BLAST of Cp4.1LG04g05880 vs. NCBI nr
Match: gi|659098361|ref|XP_008450101.1| (PREDICTED: uncharacterized protein LOC103491788 [Cucumis melo])

HSP 1 Score: 620.9 bits (1600), Expect = 1.5e-174
Identity = 306/396 (77.27%), Postives = 338/396 (85.35%), Query Frame = 1

Query: 1   MALQFPSPSSSNQPWRLPETLLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVKL 60
           + + F  PSSSN  WRLPE LL LLL+S GVTSRE LKN+DFESPPSNLPENSNKTSV L
Sbjct: 28  VTMAFWLPSSSNHSWRLPEILLFLLLVSTGVTSREFLKNADFESPPSNLPENSNKTSVAL 87

Query: 61  NENSTIPGWTFQGTGEYITV--SENISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTF 120
           N+N+TIPGWTFQG  EYIT   ++NISLP+KGHAILLGEDG+INQTF AD D L YLLTF
Sbjct: 88  NKNNTIPGWTFQGAVEYITADQTKNISLPDKGHAILLGEDGKINQTFTADADILTYLLTF 147

Query: 121 ALAPGGQNCSSTAPLLVSAPDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQ 180
           AL PGG NCS TAPL +SAPD+DA+F+FSQHYGKEPWEVHGV+LGSWGD E VNL+I SQ
Sbjct: 148 ALVPGGHNCSLTAPLQISAPDTDALFSFSQHYGKEPWEVHGVYLGSWGDREFVNLEILSQ 207

Query: 181 ANDSTPACWPVIDSLHIKTMG-------NLVVNGGFEFGPDFLESVEGGVLLDSVPTPLF 240
           +NDSTP CWP +DSLHIKTMG       +LVVNGGFE+GPDFLES E G+LLDS PTP F
Sbjct: 208 SNDSTPTCWPAVDSLHIKTMGIVMPDGDSLVVNGGFEYGPDFLESSEEGILLDSAPTPFF 267

Query: 241 SPLAQWAILGTVRYINSKHFFVPQGNAAVELISGASSGVQAAVKLQAG-SYTLNFTLGDA 300
           SPL QWAILG VRYINSKHFFVPQGNAAVEL+SG SSGVQAA KLQAG SYTL+FTLGDA
Sbjct: 268 SPLIQWAILGKVRYINSKHFFVPQGNAAVELVSGVSSGVQAATKLQAGSSYTLSFTLGDA 327

Query: 301 NDSCEAKFLVGVQAGSGSQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKDG 360
           NDSC+A FLVG QAG  S+NFTLESNGTGSA KF+M F A PD NTIT LSYTTS+TKDG
Sbjct: 328 NDSCKATFLVGAQAGLTSRNFTLESNGTGSAAKFNMTFTAGPDVNTITLLSYTTSQTKDG 387

Query: 361 DFCGPVIDDVFLRASHGLRILMPWKTLIPLCLITIL 387
           DFCGPVIDDV LR SHGLR+ +PWK+LIP+CLITI+
Sbjct: 388 DFCGPVIDDVILRVSHGLRVSVPWKSLIPVCLITIV 423

BLAST of Cp4.1LG04g05880 vs. NCBI nr
Match: gi|449455950|ref|XP_004145713.1| (PREDICTED: uncharacterized protein LOC101207350 [Cucumis sativus])

HSP 1 Score: 607.8 bits (1566), Expect = 1.3e-170
Identity = 307/397 (77.33%), Postives = 332/397 (83.63%), Query Frame = 1

Query: 1   MALQFPSPSSS-NQPWRLPETLLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVK 60
           MA   PS SSS N  W LPE +LL LL+S GVTSRE LKN+DFESPPSN PENSNKTSV 
Sbjct: 1   MAFWLPSSSSSSNHSWLLPE-ILLFLLVSTGVTSREFLKNADFESPPSNFPENSNKTSVA 60

Query: 61  LNENSTIPGWTFQGTGEYITVSE--NISLPEKGHAILLGEDGRINQTFIADVDFLNYLLT 120
           L EN+T PGWTFQG  EYITV +  NISLP+KGHAILLGEDG+INQTF AD D L YLLT
Sbjct: 61  LKENNTFPGWTFQGAVEYITVDQIKNISLPDKGHAILLGEDGKINQTFTADADILTYLLT 120

Query: 121 FALAPGGQNCSSTAPLLVSAPDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGS 180
           FALAPGG NCS TAPL +SAPDSDA+F+FSQHYGK+PWEVHGV+LGSWGD E VNL+I S
Sbjct: 121 FALAPGGHNCSLTAPLQISAPDSDALFSFSQHYGKQPWEVHGVYLGSWGDRESVNLEIMS 180

Query: 181 QANDSTPACWPVIDSLHIKTMG-------NLVVNGGFEFGPDFLESVEGGVLLDSVPTPL 240
           Q+NDSTP CWP +DSLHIKTMG       NLVVNGGFE+GPDFLES EGGVLLDSVPT  
Sbjct: 181 QSNDSTPTCWPAVDSLHIKTMGIVMPDGDNLVVNGGFEYGPDFLESSEGGVLLDSVPTTF 240

Query: 241 FSPLAQWAILGTVRYINSKHFFVPQGNAAVELISGASSGVQAAVKLQAG-SYTLNFTLGD 300
           FSPL QWAILG VRYINSKHFFVPQGN AVEL+SG SSG+QA  KLQAG SYTL+FTLGD
Sbjct: 241 FSPLIQWAILGKVRYINSKHFFVPQGNTAVELVSGVSSGLQAVPKLQAGSSYTLSFTLGD 300

Query: 301 ANDSCEAKFLVGVQAGSGSQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKD 360
           ANDSC+A FLVG QAG  S+NFTLESNGTGSA KFSM F A PD NTIT LSYTTS+TKD
Sbjct: 301 ANDSCKATFLVGAQAGLTSRNFTLESNGTGSAAKFSMTFTAGPDVNTITLLSYTTSQTKD 360

Query: 361 GDFCGPVIDDVFLRASHGLRILMPWKTLIPLCLITIL 387
           GDFCGPVIDDV LR S GLRI +PWK+LI LCLITI+
Sbjct: 361 GDFCGPVIDDVILRVSRGLRISVPWKSLISLCLITIV 396

BLAST of Cp4.1LG04g05880 vs. NCBI nr
Match: gi|700203043|gb|KGN58176.1| (hypothetical protein Csa_3G585880 [Cucumis sativus])

HSP 1 Score: 607.8 bits (1566), Expect = 1.3e-170
Identity = 307/397 (77.33%), Postives = 332/397 (83.63%), Query Frame = 1

Query: 1   MALQFPSPSSS-NQPWRLPETLLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVK 60
           MA   PS SSS N  W LPE +LL LL+S GVTSRE LKN+DFESPPSN PENSNKTSV 
Sbjct: 30  MAFWLPSSSSSSNHSWLLPE-ILLFLLVSTGVTSREFLKNADFESPPSNFPENSNKTSVA 89

Query: 61  LNENSTIPGWTFQGTGEYITVSE--NISLPEKGHAILLGEDGRINQTFIADVDFLNYLLT 120
           L EN+T PGWTFQG  EYITV +  NISLP+KGHAILLGEDG+INQTF AD D L YLLT
Sbjct: 90  LKENNTFPGWTFQGAVEYITVDQIKNISLPDKGHAILLGEDGKINQTFTADADILTYLLT 149

Query: 121 FALAPGGQNCSSTAPLLVSAPDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGS 180
           FALAPGG NCS TAPL +SAPDSDA+F+FSQHYGK+PWEVHGV+LGSWGD E VNL+I S
Sbjct: 150 FALAPGGHNCSLTAPLQISAPDSDALFSFSQHYGKQPWEVHGVYLGSWGDRESVNLEIMS 209

Query: 181 QANDSTPACWPVIDSLHIKTMG-------NLVVNGGFEFGPDFLESVEGGVLLDSVPTPL 240
           Q+NDSTP CWP +DSLHIKTMG       NLVVNGGFE+GPDFLES EGGVLLDSVPT  
Sbjct: 210 QSNDSTPTCWPAVDSLHIKTMGIVMPDGDNLVVNGGFEYGPDFLESSEGGVLLDSVPTTF 269

Query: 241 FSPLAQWAILGTVRYINSKHFFVPQGNAAVELISGASSGVQAAVKLQAG-SYTLNFTLGD 300
           FSPL QWAILG VRYINSKHFFVPQGN AVEL+SG SSG+QA  KLQAG SYTL+FTLGD
Sbjct: 270 FSPLIQWAILGKVRYINSKHFFVPQGNTAVELVSGVSSGLQAVPKLQAGSSYTLSFTLGD 329

Query: 301 ANDSCEAKFLVGVQAGSGSQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKD 360
           ANDSC+A FLVG QAG  S+NFTLESNGTGSA KFSM F A PD NTIT LSYTTS+TKD
Sbjct: 330 ANDSCKATFLVGAQAGLTSRNFTLESNGTGSAAKFSMTFTAGPDVNTITLLSYTTSQTKD 389

Query: 361 GDFCGPVIDDVFLRASHGLRILMPWKTLIPLCLITIL 387
           GDFCGPVIDDV LR S GLRI +PWK+LI LCLITI+
Sbjct: 390 GDFCGPVIDDVILRVSRGLRISVPWKSLISLCLITIV 425

BLAST of Cp4.1LG04g05880 vs. NCBI nr
Match: gi|255536905|ref|XP_002509519.1| (PREDICTED: uncharacterized protein LOC8271454 [Ricinus communis])

HSP 1 Score: 433.7 bits (1114), Expect = 3.4e-118
Identity = 213/381 (55.91%), Postives = 281/381 (73.75%), Query Frame = 1

Query: 18  PETLLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEY 77
           P  L+ LL + +   + +LL+N DFE+PP ++P NS      LNENSTIPGWTF+GT  Y
Sbjct: 3   PVLLVSLLFIGSAFAAADLLQNPDFETPPLHVPRNSTSPFQLLNENSTIPGWTFEGTVVY 62

Query: 78  ITVSENISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLVSA 137
           +T S+ ++LP  GHAI L +DG+INQTF  +  + +YLLTF LAPGGQNCS++  + VS 
Sbjct: 63  VTASQTVALPGDGHAIQLIQDGKINQTFHPNASYSHYLLTFVLAPGGQNCSNSGSIGVSV 122

Query: 138 PDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQA--NDSTPACWPVIDSLHI 197
           PD+ A+F+F QHYGKE WE +GV+LGSW ++EP+NL I SQA  +D+   CWPVID L I
Sbjct: 123 PDNHAVFSFKQHYGKEGWETYGVYLGSWEEQEPINLIIESQATESDANSTCWPVIDKLLI 182

Query: 198 KTM-------GNLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAILGTVRYINS 257
           KT         NL++NGGFEFGP+FL +   G+LLD  P+P+ S L QW+I GTV+YI+S
Sbjct: 183 KTTETLAPGNDNLLLNGGFEFGPEFLFNSTEGILLDPAPSPVLSALRQWSITGTVKYIDS 242

Query: 258 KHFFVPQGNAAVELISGASSGVQAAVKLQAG-SYTLNFTLGDANDSCEAKFLVGVQAGSG 317
           KH+FVP+GNAAVE++SG S+G+Q A+ +  G SY+L FTLGDANDSC   F+VG QAG  
Sbjct: 243 KHYFVPEGNAAVEMVSGVSAGIQTAMTVTEGSSYSLEFTLGDANDSCVGSFIVGAQAGPA 302

Query: 318 SQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKDGDFCGPVIDDVFLRASHG 377
           +QNFTL+SNGTGSA K S+ F A     +I+F+SYTT++TKDG FCGPV+D+V LRASH 
Sbjct: 303 AQNFTLQSNGTGSAKKLSLAFKADSMTTSISFVSYTTTQTKDGLFCGPVVDNVVLRASHA 362

Query: 378 LRILMPWKTLIPLC-LITILF 388
           ++ +M W+ LIPL  L+ IL+
Sbjct: 363 IKSVMKWEGLIPLLFLVAILW 383

BLAST of Cp4.1LG04g05880 vs. NCBI nr
Match: gi|590647310|ref|XP_007031865.1| (Emb:CAB87702.1, putative [Theobroma cacao])

HSP 1 Score: 432.2 bits (1110), Expect = 9.8e-118
Identity = 214/380 (56.32%), Postives = 273/380 (71.84%), Query Frame = 1

Query: 18  PETLLLLLLLSAGVTSRELLKNSDFESPPSNLPENSNKTSVKLNENSTIPGWTFQGTGEY 77
           P+    L L   G  S   L+N DFESPP +L EN+    V LNEN+TIPGWTFQGT +Y
Sbjct: 3   PQIFSFLFLFFIGFASAGYLQNPDFESPPKSLTENTGSPFVTLNENNTIPGWTFQGTVQY 62

Query: 78  ITVSENISLPEKGHAILLGEDGRINQTFIADVDFLNYLLTFALAPGGQNCSSTAPLLVSA 137
           +T  + I+LP+ GHA+ LG+DG+INQTF AD D+ NY+LTF LAPGGQNCS+ A +LVS 
Sbjct: 63  VTAGQTIALPDNGHAVQLGQDGKINQTFTADADYTNYILTFTLAPGGQNCSANADVLVSG 122

Query: 138 PDSDAMFTFSQHYGKEPWEVHGVFLGSWGDEEPVNLQIGSQA--NDSTPACWPVIDSLHI 197
           PDS  +F+F QHYGKE W+ +G  LG  G +EP+NL I SQ   +D    CWPVIDSL I
Sbjct: 123 PDSQGIFSFKQHYGKEAWQSYGQHLGLGGQKEPINLVIESQGVESDDNSTCWPVIDSLLI 182

Query: 198 KTMG-------NLVVNGGFEFGPDFLESVEGGVLLDSVPTPLFSPLAQWAILGTVRYINS 257
           KT+G       NL++NGGFEFGP+FL +   G+LLDS  +P+ SPL QWA++GT++YI+S
Sbjct: 183 KTIGTLVQGKDNLLLNGGFEFGPEFLSNSTEGILLDSALSPVLSPLRQWAVVGTIKYIDS 242

Query: 258 KHFFVPQGNAAVELISGASSGVQAAVKLQAGS-YTLNFTLGDANDSCEAKFLVGVQAGSG 317
           KHFFVP GNAAVE++SG S+G+Q  V L AGS Y+L FTLGDAN++C+  F+V V+A S 
Sbjct: 243 KHFFVPHGNAAVEIVSGVSAGIQTDVTLTAGSAYSLEFTLGDANNACKGDFIVEVRAESV 302

Query: 318 SQNFTLESNGTGSAVKFSMPFNAAPDDNTITFLSYTTSKTKDGDFCGPVIDDVFLRASHG 377
            QNFT++SNGTGSA K SM F A      I+F SYTTS+TKDG FCGPV+DDV L +S+ 
Sbjct: 303 VQNFTVQSNGTGSAQKSSMKFEAGSRATRISFFSYTTSQTKDGIFCGPVVDDVLLLSSNC 362

Query: 378 LRILMPWKTLIPLCLITILF 388
           LR+ +    LI L  + ++F
Sbjct: 363 LRLAIKPNILISLLFLILIF 382

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LDL0_CUCSA9.0e-17177.33Uncharacterized protein OS=Cucumis sativus GN=Csa_3G585880 PE=4 SV=1[more]
B9RBI5_RICCO2.3e-11855.91Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1677190 PE=4 SV=1[more]
A0A061ECZ6_THECC6.8e-11856.32Emb:CAB87702.1, putative OS=Theobroma cacao GN=TCM_017188 PE=4 SV=1[more]
V4U344_9ROSI1.4e-11054.35Uncharacterized protein OS=Citrus clementina GN=CICLE_v10015578mg PE=4 SV=1[more]
A0A067FCQ4_CITSI1.4e-11054.35Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003457mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G14150.13.5e-10551.83 Protein of unknown function, DUF642[more]
AT2G34510.13.3e-3932.51 Protein of unknown function, DUF642[more]
AT1G80240.14.0e-3730.73 Protein of unknown function, DUF642[more]
AT5G11420.12.2e-3530.94 Protein of unknown function, DUF642[more]
AT4G32460.14.9e-3531.12 Protein of unknown function, DUF642[more]
Match NameE-valueIdentityDescription
gi|659098361|ref|XP_008450101.1|1.5e-17477.27PREDICTED: uncharacterized protein LOC103491788 [Cucumis melo][more]
gi|449455950|ref|XP_004145713.1|1.3e-17077.33PREDICTED: uncharacterized protein LOC101207350 [Cucumis sativus][more]
gi|700203043|gb|KGN58176.1|1.3e-17077.33hypothetical protein Csa_3G585880 [Cucumis sativus][more]
gi|255536905|ref|XP_002509519.1|3.4e-11855.91PREDICTED: uncharacterized protein LOC8271454 [Ricinus communis][more]
gi|590647310|ref|XP_007031865.1|9.8e-11856.32Emb:CAB87702.1, putative [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008979Galactose-bd-like_sf
IPR006946DUF642
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g05880.1Cp4.1LG04g05880.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006946Domain of unknown function DUF642PFAMPF04862DUF642coord: 36..196
score: 1.3E-33coord: 201..362
score: 5.9
IPR008979Galactose-binding domain-likeGENE3DG3DSA:2.60.120.260coord: 198..363
score: 3.
NoneNo IPR availablePANTHERPTHR31265FAMILY NOT NAMEDcoord: 6..374
score: 7.9E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG04g05880Cp4.1LG03g07000Cucurbita pepo (Zucchini)cpecpeB477