LsiUNG001790 (gene) Bottle gourd (USVL1VR-Ls)

NameLsiUNG001790
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionProtein CHUP1, chloroplastic-like protein
Locationchr00 : 3958481 .. 3962239 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCGGACTTGGAGCTACACACCTTTCTTCTCCGCCAAAATCTCTGTCCCCTTCGGCTCCACCTCACTGTCCGAACCTTCTTTCCAAACAAAACAAACCTCGCTCCTTCTTCCATTTTTTCATGTTCGTTTCTTCGACAATGGAACTGGAGAACTTGTCGTGAACTTCTACACTTGGCTGTTTCTCCAACGGAACCATTCTTGTCCTTCCTTCCTTCCTTCCTTTGTTTTTTTAAACTTTATACAGAATGAAGGAAGATAACCCATCAGAAAACGGAGGGAAACCATCTAGGTTTGCTGATCAAAATCAAAATCCCAAGTGTCTAAATCAGAATAATGCCAAAGGAACTACTGGGAATAGTTCGAAATTGAGGGCTGCTTCTTCCTGGGGTTCTCACATTGTCAAAGGTTTCTCCACAGACAAGAGAACTAAAGCTCACAGTAATCTTCAACCCAAGAAAGCACCACCACTTGGGAATTCGGATTTAGCTAATCAGAAGGAGAAGTTTGTTCCTTCCCATTCTCGCATCAAGCGTTCCATCATTGGGGATTTAGCTTGTTCCGCCAATCCTGCTCAAGTTCATCCACAGTCTTATCAGACCCACCGCAGACAATCGTCTCGGGATTTGTTCGTCGAGCTCGATCAACTCAGAAGTTTGCTAAACGAATCTAAGCAGAGGGAATTCGAACTTCAGAACGAACTTGCAGAATTTAAGCGGAATACTAGAAATTATGAACTCGAAAGGGAACTTGAGGAAAAGAAAGCCGAATTAGACGGCCTTACCCAGAAAGTTAGTGTATTGGAGGAAGATAGAAGAGTACTGTCCGAGCAATTAGTGGCTCTATCATCAATTCCTGAGAAGCAAGAAGAGCCACAGACTGCGCCTGTAAACGTAGAGGTGGAAGTTGTTGAGTTGAGACGCTTGAATAAGGAACTTCAGCTTCAGAAGAGGAACCTCGCTTGCAGGCTTTCTTCGGTGGAGTCTGAGCTGGCTTGTCTAGCAAAGAATTCTGAGGTAACATTCAGTCATTTACTAGTGTCTTTGTGTTGAATGATCAAATTTTTTAAATGTTTATGAAACATTCCACAACACCACGAGCTAAGGTTTACCTTTGTTTTACTACCAGAGTGAAGCTATAGCAAAGATCAAAGCAGAGGCATCTTTGCTGAGACACACAAATGAAGATCTGTGCAAGCAAGTGGAAGGTCTGCAGATGAGCAGACTGAATGAGGTTGAGGAACTTGCATACCTTAGGTGGGTCAATTCCTGTTTAAGGAGTGAGCTTCGAAATTCTTGTCCCTCAGCGAATTCTGGTAGCCCATCCAGCCCTCAGCCAATTGAGAGGAGTAGTGAATCAGTTGGTTCATTATCCAGCCAAAAGGAGAACATGGAGTACAGTAGTGCAAAGAGAATAAATTTAATTAAGAAGTTGAAGAAATGGCCTATTACTGATGAGGACTTATCTAATTTAGATCGTTCTGATAATAGTCTTTTAGACAAAAATTGGGTTGATACAGAGGAAGGAAGAAGCCCCAGAAGAAGACACTCCATTAGTGGAGCCAAATGCTGGCCTGAAGAATTGGAGCCAAACAAGAGGAGGCAATCTGATGGCTTTATATGTGCAAAAGAGATGGAAAAGGAAGCAGATCCTCTTTCCTCTCAGAAATATGATTTGGGTGTGATTCAAAGGCCTCATGTTTTGGGAAATTGCCATGACACTAACAGGAGTTTTGCCTCTTTGGATGTGGAGAAACGAGCATTGCGTATACCAAATCCCCCTCCGAGGCCTTCTTGCTCGATTTCTAGTGAACCTAAAGAAGAAAACACAGCTCATGTCCCGCCACCTCTGCCGCCGCCGCCTCCGCCCCCTCCTCTTCCCAAGTTTGCTGTGAGGAGCGCCACAGGAATGGTACAGCGAGCTCCACAAGTTGTTGAATTCTACCATTCACTAATGAAGAGAGATTCTAGAAAAGAATCTTCTAATGGAGCCATATGCAATGTTCCAGACGTTTCAAATGTCCGGAGCAGCATGATTGGAGAAATTGAGAATCGATCATCTCATTTGCTTGCTGTAAGCTCTCAGTCATATTGTCTTCCATTCTGATTGTAGTTCTCTCATCAGAAGACATAATAATATTTCATACTTTTGCAGATAAAGGCAGATATTGAGACACAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCAGTTTATCTGAAGATCGAAGATATTGTGGAATTTGTGAAGTGGCTTGACGATGAACTTTGCTTTCTGGTATTTCTTTAATTCTCTTAACCTTGGTGCATTTGATTGAAACGAATAGGAAAAAAATTGAACATAAGAGAGAAGCAAGTAAATGAATGCATGCTGTAGAATGTAGAGTTGCTATTTTTATGACAATCATATGAAGTTGTTACCCTGCATAATGAATAGATCATTTTGTTTAAGAGTTGAGTATTTTGCTGATGATTAATATTCTTAAAGGTTGATGAAAGGGCAGTTCTGAAGCACTTTGATTGGCCAGAGAGAAAGGCTGACACCTTGCGAGAAGCAGCCTTTGGGTACAGAGATCTAAAGAAATTGGAGTGTGAAATCTCAGCCTACAAAGATGATCCCAGATTGCCTTGTGACATTGCTCTCAAAAAAATGGTTGCTTTATCAGAGAAGTAAGGATCTAAAGTTTCTCACATCTGAATTTGTCTGTATTCTTTGAAGTCAATGATTATAAACAATGACTTTGTGAAATCAGGATGGAGCGTAGTAGTTATAACCTTCTCCGGATGAGAGAATCATTGATGCGAAATTGCAAAGAGTTCCAAATTCCCACAGATTGGATGCTTGACTATGGAATTATAAGCAAGGTAAAACTTCTTAATATATCATAAGCCTCTGGAAAAGCAGTTTGAAGAGTCGACAAGACGATGTCTTGTTATTGCTGGATAATTTCTTAATAACAACCATAGTCTATGATGTATTTGAACAAAACCAAAATTGAACTCTATAATGTAGATAAACACAGATGTCTTGTTATTGCTGGATAATTTCTCAGTAGCAACCAAAATCTAGATCTATATGAACAAATCAAAATTGAACTCTATGTTGTAGATAAAGTTGGGTTCGGTGAAGTTGGCAAAAATGTACATGAAGAGAATAGCAATGGAACTTCAATCAAAGGCTTCATCAGAGAAAGATCCCGCAATGGATTACATGCTTCTTCAAGGAGTGAGATTTGCCTTTAGAATTCATCAGGTAGCTATTAGTTAAGCCTTCTTTCACTGCATTTTTCTTCAGGAAAACACAAACAAAGGCTGTTGCTGATTGTGTTTTTGCCTCTCCCGGCTACAGTTTGCAGGAGGGTTCGATGCCGAAACAATGCATGCATTTGAGGATCTGCGAAACTTGGCCAACCTTCTGAACAAAAAGTGAAAGTTATACATTACAGGACACAAAGCAAATGGAAATCAGTTGTTTTAGGTTTACTAACTCGGTAAGCAGCTAGCTACCATGCTGGCAATACATTCATCAGGTCAGGGAACTATATTTTTCTCTTCAATGGGAAGGTTTTGGATTTGGAACTTAGAGCTGAGCGAGCTGCTGGGGAAGCCAGATCTTTAGCGGACTATATTTTTGTCCTCTGCCGTCCTTCCTCCTCTCTTCATGAAGCAAGCTCCTCCAAAGCACAGGCAGGTCCTACGGAAAGAGAGGTGGCAAACTGCCAGAGGGAGGAGGGTCGTTTTAATATATATTAA

mRNA sequence

CTCGGACTTGGAGCTACACACCTTTCTTCTCCGCCAAAATCTCTGTCCCCTTCGGCTCCACCTCACTGTCCGAACCTTCTTTCCAAACAAAACAAACCTCGCTCCTTCTTCCATTTTTTCATGTTCGTTTCTTCGACAATGGAACTGGAGAACTTGTCGTGAACTTCTACACTTGGCTGTTTCTCCAACGGAACCATTCTTGTCCTTCCTTCCTTCCTTCCTTTGTTTTTTTAAACTTTATACAGAATGAAGGAAGATAACCCATCAGAAAACGGAGGGAAACCATCTAGGTTTGCTGATCAAAATCAAAATCCCAAGTGTCTAAATCAGAATAATGCCAAAGGAACTACTGGGAATAGTTCGAAATTGAGGGCTGCTTCTTCCTGGGGTTCTCACATTGTCAAAGGTTTCTCCACAGACAAGAGAACTAAAGCTCACAGTAATCTTCAACCCAAGAAAGCACCACCACTTGGGAATTCGGATTTAGCTAATCAGAAGGAGAAGTTTGTTCCTTCCCATTCTCGCATCAAGCGTTCCATCATTGGGGATTTAGCTTGTTCCGCCAATCCTGCTCAAGTTCATCCACAGTCTTATCAGACCCACCGCAGACAATCGTCTCGGGATTTGTTCGTCGAGCTCGATCAACTCAGAAGTTTGCTAAACGAATCTAAGCAGAGGGAATTCGAACTTCAGAACGAACTTGCAGAATTTAAGCGGAATACTAGAAATTATGAACTCGAAAGGGAACTTGAGGAAAAGAAAGCCGAATTAGACGGCCTTACCCAGAAAGTTAGTGTATTGGAGGAAGATAGAAGAGTACTGTCCGAGCAATTAGTGGCTCTATCATCAATTCCTGAGAAGCAAGAAGAGCCACAGACTGCGCCTGTAAACGTAGAGGTGGAAGTTGTTGAGTTGAGACGCTTGAATAAGGAACTTCAGCTTCAGAAGAGGAACCTCGCTTGCAGGCTTTCTTCGGTGGAGTCTGAGCTGGCTTGTCTAGCAAAGAATTCTGAGAGTGAAGCTATAGCAAAGATCAAAGCAGAGGCATCTTTGCTGAGACACACAAATGAAGATCTGTGCAAGCAAGTGGAAGGTCTGCAGATGAGCAGACTGAATGAGGTTGAGGAACTTGCATACCTTAGGTGGGTCAATTCCTGTTTAAGGAGTGAGCTTCGAAATTCTTGTCCCTCAGCGAATTCTGGTAGCCCATCCAGCCCTCAGCCAATTGAGAGGAGTAGTGAATCAGTTGGTTCATTATCCAGCCAAAAGGAGAACATGGAGTACAGTAGTGCAAAGAGAATAAATTTAATTAAGAAGTTGAAGAAATGGCCTATTACTGATGAGGACTTATCTAATTTAGATCGTTCTGATAATAGTCTTTTAGACAAAAATTGGGTTGATACAGAGGAAGGAAGAAGCCCCAGAAGAAGACACTCCATTAGTGGAGCCAAATGCTGGCCTGAAGAATTGGAGCCAAACAAGAGGAGGCAATCTGATGGCTTTATATGTGCAAAAGAGATGGAAAAGGAAGCAGATCCTCTTTCCTCTCAGAAATATGATTTGGGTGTGATTCAAAGGCCTCATGTTTTGGGAAATTGCCATGACACTAACAGGAGTTTTGCCTCTTTGGATGTGGAGAAACGAGCATTGCGTATACCAAATCCCCCTCCGAGGCCTTCTTGCTCGATTTCTAGTGAACCTAAAGAAGAAAACACAGCTCATGTCCCGCCACCTCTGCCGCCGCCGCCTCCGCCCCCTCCTCTTCCCAAGTTTGCTGTGAGGAGCGCCACAGGAATGGTACAGCGAGCTCCACAAGTTGTTGAATTCTACCATTCACTAATGAAGAGAGATTCTAGAAAAGAATCTTCTAATGGAGCCATATGCAATGTTCCAGACGTTTCAAATGTCCGGAGCAGCATGATTGGAGAAATTGAGAATCGATCATCTCATTTGCTTGCTATAAAGGCAGATATTGAGACACAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCAGTTTATCTGAAGATCGAAGATATTGTGGAATTTGTGAAGTGGCTTGACGATGAACTTTGCTTTCTGGTTGATGAAAGGGCAGTTCTGAAGCACTTTGATTGGCCAGAGAGAAAGGCTGACACCTTGCGAGAAGCAGCCTTTGGGTACAGAGATCTAAAGAAATTGGAGTGTGAAATCTCAGCCTACAAAGATGATCCCAGATTGCCTTGTGACATTGCTCTCAAAAAAATGGTTGCTTTATCAGAGAAGATGGAGCGTAGTAGTTATAACCTTCTCCGGATGAGAGAATCATTGATGCGAAATTGCAAAGAGTTCCAAATTCCCACAGATTGGATGCTTGACTATGGAATTATAAGCAAGATAAAGTTGGGTTCGGTGAAGTTGGCAAAAATGTACATGAAGAGAATAGCAATGGAACTTCAATCAAAGGCTTCATCAGAGAAAGATCCCGCAATGGATTACATGCTTCTTCAAGGAGTGAGATTTGCCTTTAGAATTCATCAGTTTGCAGGAGGGTTCGATGCCGAAACAATGCATGCATTTGAGGATCTGCGAAACTTGGCCAACCTTCTGAACAAAAAGTCAGGGAACTATATTTTTCTCTTCAATGGGAAGGTTTTGGATTTGGAACTTAGAGCTGAGCGAGCTGCTGGGGAAGCCAGATCTTTAGCGGACTATATTTTTGTCCTCTGCCGTCCTTCCTCCTCTCTTCATGAAGCAAGCTCCTCCAAAGCACAGGCAGGTCCTACGGAAAGAGAGGTGGCAAACTGCCAGAGGGAGGAGGGTCGTTTTAATATATATTAA

Coding sequence (CDS)

ATGAAGGAAGATAACCCATCAGAAAACGGAGGGAAACCATCTAGGTTTGCTGATCAAAATCAAAATCCCAAGTGTCTAAATCAGAATAATGCCAAAGGAACTACTGGGAATAGTTCGAAATTGAGGGCTGCTTCTTCCTGGGGTTCTCACATTGTCAAAGGTTTCTCCACAGACAAGAGAACTAAAGCTCACAGTAATCTTCAACCCAAGAAAGCACCACCACTTGGGAATTCGGATTTAGCTAATCAGAAGGAGAAGTTTGTTCCTTCCCATTCTCGCATCAAGCGTTCCATCATTGGGGATTTAGCTTGTTCCGCCAATCCTGCTCAAGTTCATCCACAGTCTTATCAGACCCACCGCAGACAATCGTCTCGGGATTTGTTCGTCGAGCTCGATCAACTCAGAAGTTTGCTAAACGAATCTAAGCAGAGGGAATTCGAACTTCAGAACGAACTTGCAGAATTTAAGCGGAATACTAGAAATTATGAACTCGAAAGGGAACTTGAGGAAAAGAAAGCCGAATTAGACGGCCTTACCCAGAAAGTTAGTGTATTGGAGGAAGATAGAAGAGTACTGTCCGAGCAATTAGTGGCTCTATCATCAATTCCTGAGAAGCAAGAAGAGCCACAGACTGCGCCTGTAAACGTAGAGGTGGAAGTTGTTGAGTTGAGACGCTTGAATAAGGAACTTCAGCTTCAGAAGAGGAACCTCGCTTGCAGGCTTTCTTCGGTGGAGTCTGAGCTGGCTTGTCTAGCAAAGAATTCTGAGAGTGAAGCTATAGCAAAGATCAAAGCAGAGGCATCTTTGCTGAGACACACAAATGAAGATCTGTGCAAGCAAGTGGAAGGTCTGCAGATGAGCAGACTGAATGAGGTTGAGGAACTTGCATACCTTAGGTGGGTCAATTCCTGTTTAAGGAGTGAGCTTCGAAATTCTTGTCCCTCAGCGAATTCTGGTAGCCCATCCAGCCCTCAGCCAATTGAGAGGAGTAGTGAATCAGTTGGTTCATTATCCAGCCAAAAGGAGAACATGGAGTACAGTAGTGCAAAGAGAATAAATTTAATTAAGAAGTTGAAGAAATGGCCTATTACTGATGAGGACTTATCTAATTTAGATCGTTCTGATAATAGTCTTTTAGACAAAAATTGGGTTGATACAGAGGAAGGAAGAAGCCCCAGAAGAAGACACTCCATTAGTGGAGCCAAATGCTGGCCTGAAGAATTGGAGCCAAACAAGAGGAGGCAATCTGATGGCTTTATATGTGCAAAAGAGATGGAAAAGGAAGCAGATCCTCTTTCCTCTCAGAAATATGATTTGGGTGTGATTCAAAGGCCTCATGTTTTGGGAAATTGCCATGACACTAACAGGAGTTTTGCCTCTTTGGATGTGGAGAAACGAGCATTGCGTATACCAAATCCCCCTCCGAGGCCTTCTTGCTCGATTTCTAGTGAACCTAAAGAAGAAAACACAGCTCATGTCCCGCCACCTCTGCCGCCGCCGCCTCCGCCCCCTCCTCTTCCCAAGTTTGCTGTGAGGAGCGCCACAGGAATGGTACAGCGAGCTCCACAAGTTGTTGAATTCTACCATTCACTAATGAAGAGAGATTCTAGAAAAGAATCTTCTAATGGAGCCATATGCAATGTTCCAGACGTTTCAAATGTCCGGAGCAGCATGATTGGAGAAATTGAGAATCGATCATCTCATTTGCTTGCTATAAAGGCAGATATTGAGACACAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCAGTTTATCTGAAGATCGAAGATATTGTGGAATTTGTGAAGTGGCTTGACGATGAACTTTGCTTTCTGGTTGATGAAAGGGCAGTTCTGAAGCACTTTGATTGGCCAGAGAGAAAGGCTGACACCTTGCGAGAAGCAGCCTTTGGGTACAGAGATCTAAAGAAATTGGAGTGTGAAATCTCAGCCTACAAAGATGATCCCAGATTGCCTTGTGACATTGCTCTCAAAAAAATGGTTGCTTTATCAGAGAAGATGGAGCGTAGTAGTTATAACCTTCTCCGGATGAGAGAATCATTGATGCGAAATTGCAAAGAGTTCCAAATTCCCACAGATTGGATGCTTGACTATGGAATTATAAGCAAGATAAAGTTGGGTTCGGTGAAGTTGGCAAAAATGTACATGAAGAGAATAGCAATGGAACTTCAATCAAAGGCTTCATCAGAGAAAGATCCCGCAATGGATTACATGCTTCTTCAAGGAGTGAGATTTGCCTTTAGAATTCATCAGTTTGCAGGAGGGTTCGATGCCGAAACAATGCATGCATTTGAGGATCTGCGAAACTTGGCCAACCTTCTGAACAAAAAGTCAGGGAACTATATTTTTCTCTTCAATGGGAAGGTTTTGGATTTGGAACTTAGAGCTGAGCGAGCTGCTGGGGAAGCCAGATCTTTAGCGGACTATATTTTTGTCCTCTGCCGTCCTTCCTCCTCTCTTCATGAAGCAAGCTCCTCCAAAGCACAGGCAGGTCCTACGGAAAGAGAGGTGGCAAACTGCCAGAGGGAGGAGGGTCGTTTTAATATATATTAA

Protein sequence

MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKRTKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHRRQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQKVSVLEEDRRVLSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACRLSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRSELRNSCPSANSGSPSSPQPIERSSESVGSLSSQKENMEYSSAKRINLIKKLKKWPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEKRALRIPNPPPRPSCSISSEPKEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDYGIISKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKKSGNYIFLFNGKVLDLELRAERAAGEARSLADYIFVLCRPSSSLHEASSSKAQAGPTEREVANCQREEGRFNIY
BLAST of LsiUNG001790 vs. Swiss-Prot
Match: CHUP1_ARATH (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 1.1e-92
Identity = 190/350 (54.29%), Postives = 240/350 (68.57%), Query Frame = 1

Query: 461 LDVEKRALRIPNPPPRPSCS------ISSEPKEENTAHVPPPLPP-----------PPPP 520
           +D+EKR  R+P PPPR +         S+ P        PPP PP           PPPP
Sbjct: 642 VDIEKRPPRVPRPPPRSAGGGKSTNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPP 701

Query: 521 PPLPKFAVRSATG--MVQRAPQVVEFYHSLMKRDSRKESSNGAICN-VPDVSNVRSSMIG 580
           PP P    R A G   V RAP++VEFY SLMKR+S+KE +   I +   + S  R++MIG
Sbjct: 702 PPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIG 761

Query: 581 EIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERA 640
           EIENRS+ LLA+KAD+ETQG+FV SL  EV  + +  IED++ FV WLD+EL FLVDERA
Sbjct: 762 EIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERA 821

Query: 641 VLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERS 700
           VLKHFDWPE KAD LREAAF Y+DL KLE +++++ DDP L C+ ALKKM  L EK+E+S
Sbjct: 822 VLKHFDWPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQS 881

Query: 701 SYNLLRMRESLMRNCKEFQIPTDWMLDYGIISKIKLGSVKLAKMYMKRIAMELQSKASSE 760
            Y LLR R+  +   KEF IP DW+ D G++ KIKL SV+LAK YMKR+A EL S + S+
Sbjct: 882 VYALLRTRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSD 941

Query: 761 KDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKKSGN 791
           KDP  +++LLQGVRFAFR+HQFAGGFDAE+M AFE+LR+ A   +  + N
Sbjct: 942 KDPNREFLLLQGVRFAFRVHQFAGGFDAESMKAFEELRSRAKTESGDNNN 991

BLAST of LsiUNG001790 vs. TrEMBL
Match: A0A0A0KMA9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G526260 PE=4 SV=1)

HSP 1 Score: 1442.6 bits (3733), Expect = 0.0e+00
Identity = 747/787 (94.92%), Postives = 762/787 (96.82%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNP E  GKPSRFADQNQNPKCLNQNNAKG+TGN SKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPLEIRGKPSRFADQNQNPKCLNQNNAKGSTGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TKA SNLQPKKAPPLGNSDL NQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  TKAQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAE KRNTRNYELERELEEKK ELD L +
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKVELDSLAK 180

Query: 181 KVSVLEEDRRVLSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEEDRR LSEQLV L S+ EKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEDRRALSEQLVTLPSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300
           LSSVESELACLAKNSESEA+AKIKAE SLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW
Sbjct: 241 LSSVESELACLAKNSESEAVAKIKAEVSLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300

Query: 301 VNSCLRSELRNSCPSANSGSPSSPQPIERSSESVGSLSSQKENMEYSSAKRINLIKKLKK 360
           VNSCLRSELRNS PSANSGSPSSPQP+ERSSE++GSLSSQKE MEYSSAKRINLIKKLKK
Sbjct: 301 VNSCLRSELRNSSPSANSGSPSSPQPVERSSEAIGSLSSQKEYMEYSSAKRINLIKKLKK 360

Query: 361 WPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI 420
           WPITDEDLSNLD SDN+LLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF+
Sbjct: 361 WPITDEDLSNLDCSDNNLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFM 420

Query: 421 CAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEKRALRIPNPPPRPSCS 480
           CAKEMEK+ DPLSSQKYDLGVIQRPHVLGNCH+TNR+FASLDVEKRALRIPNPPPRPSCS
Sbjct: 421 CAKEMEKDVDPLSSQKYDLGVIQRPHVLGNCHETNRNFASLDVEKRALRIPNPPPRPSCS 480

Query: 481 ISSEPKEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKES 540
           ISSEPKEEN A VPPPLPPPPPPPPLPKF+VRSATGMVQRAPQVVEFYHSLMKRDSRK+S
Sbjct: 481 ISSEPKEENRAQVPPPLPPPPPPPPLPKFSVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540

Query: 541 SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600
           SNG ICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED
Sbjct: 541 SNGTICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600

Query: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660
           IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR
Sbjct: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660

Query: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDYGIISKIKLGSVK 720
           LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLD GIISKIKLGSVK
Sbjct: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720

Query: 721 LAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780
           LAKMYMKR+AMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL
Sbjct: 721 LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780

Query: 781 ANLLNKK 788
           ANLLNKK
Sbjct: 781 ANLLNKK 787

BLAST of LsiUNG001790 vs. TrEMBL
Match: M5WMB7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001630mg PE=4 SV=1)

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 586/803 (72.98%), Postives = 672/803 (83.69%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKR 60
           M+E+NPSE+  +  +F+DQNQ PKC    N KG + N+SKLR+ASSWGSHIVKG + DK+
Sbjct: 1   MREENPSESRARSIKFSDQNQIPKC---QNVKGNS-NASKLRSASSWGSHIVKGLAGDKK 60

Query: 61  TKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TK    +  KK PPL  SD+ANQK  FVPSH R+KRS+IGDL+CS N  QVHPQ + THR
Sbjct: 61  TKVQPIVTNKK-PPLMGSDMANQKNSFVPSHPRVKRSLIGDLSCSVNGNQVHPQMHPTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLF+ELD LR+LL ESK+REF+LQ EL+E KRN +  +LERELE K+ ELDGL +
Sbjct: 121 RQSSRDLFIELDHLRNLLRESKEREFQLQAELSECKRNPKVLDLERELEVKRIELDGLAR 180

Query: 181 KVSVLEEDRRVLSEQLVALSSIPEKQE------EPQTAPV-----NVEVEVVELRRLNKE 240
           KV +LEE++  LSEQL AL+SI ++ E      E Q + V     +VE+EVVELRRLNKE
Sbjct: 181 KVELLEEEKTSLSEQLSALTSILDRNEGVTLKKEEQESSVASASGSVEMEVVELRRLNKE 240

Query: 241 LQLQKRNLACRLSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRL 300
           LQLQKRNLAC+LSSV S+LA LAK SES+ + KIKAEAS LRHTNEDLCKQVEGLQMSRL
Sbjct: 241 LQLQKRNLACKLSSVTSQLASLAKASESDIVEKIKAEASALRHTNEDLCKQVEGLQMSRL 300

Query: 301 NEVEELAYLRWVNSCLRSELRNS--CPSANSGSPSSPQPIERSSESVGSLSSQK-ENMEY 360
           NEVEELAYLRWVNSCLR+EL+NS  C + NS  P SP   ERSS+S G+L S+  E +EY
Sbjct: 301 NEVEELAYLRWVNSCLRNELQNSNSCSTTNSDKPLSPGSFERSSKSAGALPSRSSEYLEY 360

Query: 361 SSAKRINLIKKLKKWPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPE 420
            S KR+NLIKKLKKWPI DEDL NL+  D  LLDK+WVD+EEGRSPRRRHSISG+KC  E
Sbjct: 361 GSVKRLNLIKKLKKWPIADEDLPNLECPDG-LLDKSWVDSEEGRSPRRRHSISGSKCCAE 420

Query: 421 EL-EPNKRRQSDGFICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEK 480
           EL + NKRRQSDGF+CA+EMEK+ +P++S+ +DL         GNCH+ N+  ASLDVEK
Sbjct: 421 ELVQSNKRRQSDGFMCAQEMEKDTEPVASENFDL-------FFGNCHEINKIPASLDVEK 480

Query: 481 RALRIPNPPPRPSCSISSEPKEENTAHVPPPLPPPPPPPPLPKFAVR-SATGMVQRAPQV 540
           RALRIPNPPPRPSCSIS   K + +A VPPP PPPPPPPP PKFA++ S TGMVQRAPQV
Sbjct: 481 RALRIPNPPPRPSCSISRGTKVDGSAQVPPP-PPPPPPPPPPKFAMKTSTTGMVQRAPQV 540

Query: 541 VEFYHSLMKRDSRKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVN 600
           VEFYHSLMKRDSRK+SSNG +C+ PDV+NVRSSMIGEIENRSSHLLAIKAD+ETQGEFVN
Sbjct: 541 VEFYHSLMKRDSRKDSSNGGVCDGPDVANVRSSMIGEIENRSSHLLAIKADVETQGEFVN 600

Query: 601 SLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRD 660
           SLIREVNNAVY  I+D+V FVKWLDDELCFLVDERAVLKHFDWPE+KADTLREAAFGYRD
Sbjct: 601 SLIREVNNAVYQNIDDVVAFVKWLDDELCFLVDERAVLKHFDWPEKKADTLREAAFGYRD 660

Query: 661 LKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDW 720
           LKKLE E+S+YK+D RLPCDIALKKMVALSEKMER+ YNLLR RE LMR+CKEFQIPTDW
Sbjct: 661 LKKLESEVSSYKEDIRLPCDIALKKMVALSEKMERTVYNLLRTREPLMRHCKEFQIPTDW 720

Query: 721 MLDYGIISKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLD GI+SKIK GSVKLAKMYMKR+AMELQSKA++EKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 721 MLDNGILSKIKFGSVKLAKMYMKRVAMELQSKAAAEKDPAMDYMLLQGVRFAFRIHQFAG 780

Query: 781 GFDAETMHAFEDLRNLANLLNKK 788
           GFDA+TMHAFE+LR LA+LLNKK
Sbjct: 781 GFDADTMHAFEELRYLAHLLNKK 789

BLAST of LsiUNG001790 vs. TrEMBL
Match: F6HM16_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0003g05680 PE=4 SV=1)

HSP 1 Score: 1068.5 bits (2762), Expect = 4.1e-309
Identity = 575/798 (72.06%), Postives = 656/798 (82.21%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKR 60
           M+E+NPSEN  K  +FADQNQ            T+ N S+LR+ASSWGSHIVKGFS DK+
Sbjct: 1   MREENPSENRVKSLKFADQNQGK----------TSSNPSRLRSASSWGSHIVKGFSADKK 60

Query: 61  TKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
            K  +    KK P L +SD+ NQK   V SHSR+KRS+IGDL+CS N +QVHPQ Y ++R
Sbjct: 61  NKLQTTAAAKKVP-LTSSDIVNQKNPLVTSHSRVKRSLIGDLSCSVNASQVHPQLYNSNR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQ 180
            +SSRDLF+ELD LRSLL ESK+REF+LQ EL EFKRN    +LERELE KK+E++ L+Q
Sbjct: 121 TKSSRDLFLELDHLRSLLQESKEREFKLQAELLEFKRNPEILDLERELEVKKSEVNELSQ 180

Query: 181 KVSVLEEDRRVLSEQLVALSSIPEKQEEP--------QTAPVN--VEVEVVELRRLNKEL 240
           KV +LE ++  LSEQL  L+SI E++EE          +AP    +E+EVVELRRLNKEL
Sbjct: 181 KVRLLESEKTSLSEQLSGLASIAERREELLKREDLEISSAPSQRTLEMEVVELRRLNKEL 240

Query: 241 QLQKRNLACRLSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLN 300
           QLQKR+L+CRLSS ES+L  L+K  ES+ +A I+AEASLLRHTNEDLCKQVE LQMSRLN
Sbjct: 241 QLQKRDLSCRLSSTESQLNTLSKVYESDTVANIQAEASLLRHTNEDLCKQVEDLQMSRLN 300

Query: 301 EVEELAYLRWVNSCLRSELRNSCPSANSGSPSSPQPIERSSESVGSLSSQKEN-MEYSSA 360
           EVEEL YLRWVNSCLR+ELRNSC   NS   SSP  IE S ES  S S Q ++ +EYSS 
Sbjct: 301 EVEELVYLRWVNSCLRNELRNSCSVTNSDKTSSPNSIEGSRESDSSFSCQTDDSLEYSSI 360

Query: 361 KRINLIKKLKKWPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELE 420
           KR+NLIKKLKKWPI  EDL NLD  DN LL+K+WVD EEGRSPRRRHSISG+KC  E+L 
Sbjct: 361 KRLNLIKKLKKWPIISEDLPNLDCPDN-LLEKSWVDPEEGRSPRRRHSISGSKCSAEDLV 420

Query: 421 PNKRRQSDGFICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEKRALR 480
            +KRRQSDGF+C KEMEKEA+PL SQKY+LG++Q+P + GNC +T +  ASLDVEKRALR
Sbjct: 421 QSKRRQSDGFMCPKEMEKEAEPLVSQKYELGIVQKPQLWGNCQETGKFMASLDVEKRALR 480

Query: 481 IPNPPPRPSCSISSEPKEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYH 540
           IPNPPPRPS ++SS PKE   A +PPP PPPPPPPP PKF+ RS TG+VQRAPQVVEFYH
Sbjct: 481 IPNPPPRPSGALSSGPKEMVLAQIPPPPPPPPPPPPPPKFSARSTTGIVQRAPQVVEFYH 540

Query: 541 SLMKRDSRKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIRE 600
           SLMKRDSRK+SSNG I + PDV+NVRS+MIGEIENRSS+LLAIKAD+ETQGEFVNSLIRE
Sbjct: 541 SLMKRDSRKDSSNGGIYDTPDVANVRSNMIGEIENRSSYLLAIKADVETQGEFVNSLIRE 600

Query: 601 VNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLE 660
           VNNAVY  IED+V FVKWLDDELCFLVDERAVLKHFDWPE+KADTLREAAFGYRDLKKLE
Sbjct: 601 VNNAVYQNIEDVVAFVKWLDDELCFLVDERAVLKHFDWPEKKADTLREAAFGYRDLKKLE 660

Query: 661 CEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDYG 720
            E+S YKDDPR+PCDIALKKMVALSEKMERS YNL R RESLMRNCKEFQIPTDWMLD G
Sbjct: 661 SEVSYYKDDPRVPCDIALKKMVALSEKMERSVYNLFRTRESLMRNCKEFQIPTDWMLDNG 720

Query: 721 IISKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAE 780
           II+KIK GSVKLAK YM+R+AMELQSK + EKDPAMDYMLLQGVRFAFRIHQFAGGFD E
Sbjct: 721 IINKIKFGSVKLAKKYMRRVAMELQSKGAFEKDPAMDYMLLQGVRFAFRIHQFAGGFDVE 780

Query: 781 TMHAFEDLRNLANLLNKK 788
           TMHAFE+LRNLA+LLNKK
Sbjct: 781 TMHAFEELRNLAHLLNKK 786

BLAST of LsiUNG001790 vs. TrEMBL
Match: W9RI54_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_025949 PE=4 SV=1)

HSP 1 Score: 1059.7 bits (2739), Expect = 1.9e-306
Identity = 579/796 (72.74%), Postives = 660/796 (82.91%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDK- 60
           M+E+NP E   K S+F+DQNQ PKC    N K  T NSSKLR    WGS IVK  + +K 
Sbjct: 1   MREENPPETRPKSSKFSDQNQPPKC---QNPKLNTTNSSKLR----WGSQIVKNLAGEKS 60

Query: 61  RTKAHSNLQPKKAPPLGNSDLA-NQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQS--- 120
           +TKA    QP  +  +  SD A NQK     SHSR KRS+IGDLACS N  QVHPQS   
Sbjct: 61  KTKAQLKKQPLAS--MATSDTASNQKNPLAHSHSRAKRSLIGDLACSVNATQVHPQSMTA 120

Query: 121 YQT-HRRQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAE 180
           +QT HRRQSSRDLF ELD LRSLL ESK+REF+LQ EL+E+KRN R  ELE E+E K++E
Sbjct: 121 FQTAHRRQSSRDLFTELDHLRSLLQESKEREFKLQAELSEWKRNPRVLELETEVEVKRSE 180

Query: 181 LDGLTQKVSVLEEDRRVLSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQK 240
           +DGL +++ +LEE++  LSEQL        ++ E      N+E+EVVELRRLNKELQLQK
Sbjct: 181 VDGLKRRLELLEEEKASLSEQLSGSERERSEESESSVVSQNLEMEVVELRRLNKELQLQK 240

Query: 241 RNLACRLSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEE 300
           RNLACRL+SVES+LA LAK SES+ +AKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEE
Sbjct: 241 RNLACRLASVESQLASLAKASESDIVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEE 300

Query: 301 LAYLRWVNSCLRSELRNSCPSANS-GSPSSPQPIERSSES-VGSLSSQK-ENMEYSSAKR 360
           LAYLRWVNSCLR+EL+NSC + NS  +P+SP  +ERS+ES  GSLS Q  E +EYSS KR
Sbjct: 301 LAYLRWVNSCLRNELKNSCSTTNSIKTPTSPSSVERSAESSFGSLSCQSNEYLEYSSTKR 360

Query: 361 INLIKKLKKWPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPN 420
           +NLIKKLKKWP+TDEDLSNL+ SD+SLLDKNWVD EE  SPRRRHSISG+KC  EEL P+
Sbjct: 361 VNLIKKLKKWPLTDEDLSNLECSDHSLLDKNWVDEEERISPRRRHSISGSKCSTEELMPD 420

Query: 421 KRRQSDGFICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEKRALRIP 480
           KRRQSDGF+ ++E EKEA+PL+SQK+DL +  R  + G+CH++ +   S+DVEKRALRIP
Sbjct: 421 KRRQSDGFMLSRETEKEAEPLASQKFDLEIGHRSQLYGSCHESVKFSTSMDVEKRALRIP 480

Query: 481 NPPPRPSCSISSEPKEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSL 540
           NPPPRP  S+SS  KEE +A + PP PPPPPPPP PKFA R+ +G  QRAPQVVEFYHSL
Sbjct: 481 NPPPRPYRSVSSGTKEEGSAQILPPPPPPPPPPPPPKFAARNTSGTFQRAPQVVEFYHSL 540

Query: 541 MKRDSRKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVN 600
           MKRDSRK+SSNG IC+ PDV+NVRSSMIGEIENRSSHLLAIKAD+ETQGEFVNSLIREVN
Sbjct: 541 MKRDSRKDSSNGGICDAPDVANVRSSMIGEIENRSSHLLAIKADVETQGEFVNSLIREVN 600

Query: 601 NAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECE 660
           +AVY  IED+V FVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGY+D+K+LE E
Sbjct: 601 DAVYQNIEDVVAFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYKDIKRLESE 660

Query: 661 ISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDYGII 720
           +S+YKDD RLPCDIALKKMVALSEKMER+ YNLLR RESLMRNCKEFQIPTDWMLD GII
Sbjct: 661 VSSYKDDLRLPCDIALKKMVALSEKMERTVYNLLRTRESLMRNCKEFQIPTDWMLDNGII 720

Query: 721 SKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETM 780
           SKIK GSVKLAKMYMKR+AMELQSKA+ EKDPAMDYMLLQGVRFAFRIHQFAGGFDAETM
Sbjct: 721 SKIKFGSVKLAKMYMKRVAMELQSKAAIEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETM 780

Query: 781 HAFEDLRNLANLLNKK 788
           HAFE+LRNLA+LLNKK
Sbjct: 781 HAFEELRNLAHLLNKK 787

BLAST of LsiUNG001790 vs. TrEMBL
Match: A0A061G7Q8_THECC (Hydroxyproline-rich glycoprotein family protein isoform 1 OS=Theobroma cacao GN=TCM_026987 PE=4 SV=1)

HSP 1 Score: 1038.9 bits (2685), Expect = 3.5e-300
Identity = 563/797 (70.64%), Postives = 652/797 (81.81%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKR 60
           M+E+NPSEN  K S+FADQNQ P+    +N K TT  S   +  SSWGSHIVKGF+ DK+
Sbjct: 1   MREENPSENRAKASKFADQNQAPR---SHNTKTTTHQS---KPKSSWGSHIVKGFTADKK 60

Query: 61  TKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TK  +   P K   + NSD  NQK   + SHSR+KRS+I DLACS N  QVHPQ YQTHR
Sbjct: 61  TKVQTITVPTKKETISNSDAGNQKNPSLASHSRVKRSLISDLACSVNANQVHPQVYQTHR 120

Query: 121 RQSS--RDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGL 180
           RQSS  RDLF+ELD +RSLL ESK+RE +LQ ELAE+K N +  +LER+L+ + +E+D L
Sbjct: 121 RQSSGSRDLFIELDHVRSLLQESKERELKLQAELAEWKTNAKVLDLERQLQRRNSEVDDL 180

Query: 181 TQKVSVLEEDRRVLSEQLVALSSIPEKQE-------EPQTAPVNVEVEVVELRRLNKELQ 240
           + +V +LE ++  L EQ+  LSSI E+ E       EPQ+   N+E+EVVELRRLNKELQ
Sbjct: 181 SHRVGLLESEKTSLCEQVATLSSILERNEDNLEISKEPQSIR-NLEMEVVELRRLNKELQ 240

Query: 241 LQKRNLACRLSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNE 300
           LQKRNLAC+LSS+ESELA LAK +ES+ +AKIKAEAS+LRHTNE+L KQVEGLQMSRLNE
Sbjct: 241 LQKRNLACKLSSLESELASLAKANESDVVAKIKAEASMLRHTNENLSKQVEGLQMSRLNE 300

Query: 301 VEELAYLRWVNSCLRSELRNSCPSANSGSPSSPQPIERSSESVGSLSSQK-ENMEYSSAK 360
           VEELAYLRWVNSCLR ELRNSC + N     S  P +   E V + +S   ++ EYSS  
Sbjct: 301 VEELAYLRWVNSCLRDELRNSCSTMNFDKTLS--PAQSKGEYVDTPNSLSCKSPEYSSVM 360

Query: 361 RINLIKKLKKWPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEP 420
           R++LIKKLKKWPI+ +D S+ + + N L+DK+WV  EEGRSP RRHSISG+KC+ EEL P
Sbjct: 361 RLSLIKKLKKWPISSQDFSSTECAAN-LVDKDWVHLEEGRSPGRRHSISGSKCYVEELIP 420

Query: 421 NKRRQSDGFICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEKRALRI 480
           NKRRQSDGF+C KE+E+EA+PLSSQKY  G +QR    GNC +TN+  ASLDVEKRALRI
Sbjct: 421 NKRRQSDGFMCTKEVEREAEPLSSQKY--GSVQRMRFFGNCQETNKPAASLDVEKRALRI 480

Query: 481 PNPPPRPSCSISSEPKEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHS 540
           PNPPPRPSCSIS+ PKEE++  +PP  PPPPPPPP PKF+VRS  G+VQRAPQVVEFYHS
Sbjct: 481 PNPPPRPSCSISNGPKEESSTQIPP--PPPPPPPPPPKFSVRSGAGLVQRAPQVVEFYHS 540

Query: 541 LMKRDSRKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREV 600
           LMKRDSRK+S+NG IC+VPDV+NVRSSMIGEIENRSSHLLAIKAD+ETQGEFVNSLIREV
Sbjct: 541 LMKRDSRKDSTNGGICDVPDVANVRSSMIGEIENRSSHLLAIKADVETQGEFVNSLIREV 600

Query: 601 NNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLEC 660
           NNAVY  IED+V FVKWLDDELC+LVDERAVLKHF WPE+KADTLREAAFGYRDLKKLE 
Sbjct: 601 NNAVYQNIEDVVAFVKWLDDELCYLVDERAVLKHFAWPEKKADTLREAAFGYRDLKKLES 660

Query: 661 EISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDYGI 720
           E+  YKDD R+PCDIALKKMVALSEKMER+ YNLLR RES MRNCK+FQIPTDWMLD GI
Sbjct: 661 EVLYYKDDSRMPCDIALKKMVALSEKMERTVYNLLRTRESSMRNCKQFQIPTDWMLDNGI 720

Query: 721 ISKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAET 780
           ISKIKLGSVKLAK YMKR+AMELQ KA+ EKDP+MDYMLLQGVRFAFRIHQFAGGFD+ET
Sbjct: 721 ISKIKLGSVKLAKKYMKRVAMELQLKATLEKDPSMDYMLLQGVRFAFRIHQFAGGFDSET 780

Query: 781 MHAFEDLRNLANLLNKK 788
           MHAFE+LRNLANLLNKK
Sbjct: 781 MHAFEELRNLANLLNKK 783

BLAST of LsiUNG001790 vs. TAIR10
Match: AT3G25690.1 (AT3G25690.1 Hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 342.8 bits (878), Expect = 6.1e-94
Identity = 190/350 (54.29%), Postives = 240/350 (68.57%), Query Frame = 1

Query: 461 LDVEKRALRIPNPPPRPSCS------ISSEPKEENTAHVPPPLPP-----------PPPP 520
           +D+EKR  R+P PPPR +         S+ P        PPP PP           PPPP
Sbjct: 642 VDIEKRPPRVPRPPPRSAGGGKSTNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPP 701

Query: 521 PPLPKFAVRSATG--MVQRAPQVVEFYHSLMKRDSRKESSNGAICN-VPDVSNVRSSMIG 580
           PP P    R A G   V RAP++VEFY SLMKR+S+KE +   I +   + S  R++MIG
Sbjct: 702 PPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIG 761

Query: 581 EIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERA 640
           EIENRS+ LLA+KAD+ETQG+FV SL  EV  + +  IED++ FV WLD+EL FLVDERA
Sbjct: 762 EIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERA 821

Query: 641 VLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERS 700
           VLKHFDWPE KAD LREAAF Y+DL KLE +++++ DDP L C+ ALKKM  L EK+E+S
Sbjct: 822 VLKHFDWPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQS 881

Query: 701 SYNLLRMRESLMRNCKEFQIPTDWMLDYGIISKIKLGSVKLAKMYMKRIAMELQSKASSE 760
            Y LLR R+  +   KEF IP DW+ D G++ KIKL SV+LAK YMKR+A EL S + S+
Sbjct: 882 VYALLRTRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSD 941

Query: 761 KDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKKSGN 791
           KDP  +++LLQGVRFAFR+HQFAGGFDAE+M AFE+LR+ A   +  + N
Sbjct: 942 KDPNREFLLLQGVRFAFRVHQFAGGFDAESMKAFEELRSRAKTESGDNNN 991

BLAST of LsiUNG001790 vs. TAIR10
Match: AT4G18570.1 (AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 330.9 bits (847), Expect = 2.4e-90
Identity = 190/361 (52.63%), Postives = 236/361 (65.37%), Query Frame = 1

Query: 450 NCHDTNRSFASLDVEKRALRIPNPPPRPSCSIS------SEPKEENTAHVPPPLP----- 509
           N  +   S +   V  R  R+P PPP+ S S+       ++P  + +   PPP P     
Sbjct: 262 NSEELTESSSLSTVRSRVPRVPKPPPKRSISLGDSTENRADPPPQKSIPPPPPPPPPPLL 321

Query: 510 ------------PPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRD---SRKESSNG 569
                       PPPPPPP P  ++  A+  V+R P+VVEFYHSLM+RD   SR++S+ G
Sbjct: 322 QQPPPPPSVSKAPPPPPPPPPPKSLSIASAKVRRVPEVVEFYHSLMRRDSTNSRRDSTGG 381

Query: 570 AICNVPDV---SNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 629
                  +   SN R  MIGEIENRS +LLAIK D+ETQG+F+  LI+EV NA +  IED
Sbjct: 382 GNAAAEAILANSNAR-DMIGEIENRSVYLLAIKTDVETQGDFIRFLIKEVGNAAFSDIED 441

Query: 630 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 689
           +V FVKWLDDEL +LVDERAVLKHF+WPE+KAD LREAAF Y DLKKL  E S +++DPR
Sbjct: 442 VVPFVKWLDDELSYLVDERAVLKHFEWPEQKADALREAAFCYFDLKKLISEASRFREDPR 501

Query: 690 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDYGIISKIKLGSVK 749
                ALKKM AL EK+E   Y+L RMRES     K FQIP DWML+ GI S+IKL SVK
Sbjct: 502 QSSSSALKKMQALFEKLEHGVYSLSRMRESAATKFKSFQIPVDWMLETGITSQIKLASVK 561

Query: 750 LAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 782
           LA  YMKR++ EL+  A     P  + +++QGVRFAFR+HQFAGGFDAETM AFE+LR+ 
Sbjct: 562 LAMKYMKRVSAELE--AIEGGGPEEEELIVQGVRFAFRVHQFAGGFDAETMKAFEELRDK 619

BLAST of LsiUNG001790 vs. TAIR10
Match: AT1G48280.1 (AT1G48280.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 296.6 bits (758), Expect = 5.0e-80
Identity = 155/314 (49.36%), Postives = 218/314 (69.43%), Query Frame = 1

Query: 469 RIPNPPPRPSCSISSEP----KEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQV 528
           R+P  PP P   +S       ++EN++   PP PPPPPPPP P+   ++A    Q++P V
Sbjct: 228 RLPPTPPLPKFLVSPASSLGKRDENSSPFAPPTPPPPPPPPPPRPLAKAARA--QKSPPV 287

Query: 529 VEFYHSLMKRDSRKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVN 588
            + +  L K+D+ +  S     N   V++  +S++GEI+NRS+HL+AIKADIET+GEF+N
Sbjct: 288 SQLFQLLNKQDNSRNLSQSVNGNKSQVNSAHNSIVGEIQNRSAHLIAIKADIETKGEFIN 347

Query: 589 SLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRD 648
            LI++V    +  +ED+++FV WLD EL  L DERAVLKHF WPE+KADTL+EAA  YR+
Sbjct: 348 DLIQKVLTTCFSDMEDVMKFVDWLDKELATLADERAVLKHFKWPEKKADTLQEAAVEYRE 407

Query: 649 LKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDW 708
           LKKLE E+S+Y DDP +   +ALKKM  L +K E+    L+R+R S MR+ ++F+IP +W
Sbjct: 408 LKKLEKELSSYSDDPNIHYGVALKKMANLLDKSEQRIRRLVRLRGSSMRSYQDFKIPVEW 467

Query: 709 MLDYGIISKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 768
           MLD G+I KIK  S+KLAK YM R+A ELQS  + +++   + +LLQGVRFA+R HQFAG
Sbjct: 468 MLDSGMICKIKRASIKLAKTYMNRVANELQSARNLDRESTKEALLLQGVRFAYRTHQFAG 527

Query: 769 GFDAETMHAFEDLR 779
           G D ET+ A E+++
Sbjct: 528 GLDPETLCALEEIK 539

BLAST of LsiUNG001790 vs. TAIR10
Match: AT1G07120.1 (AT1G07120.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 234.6 bits (597), Expect = 2.3e-61
Identity = 139/325 (42.77%), Postives = 198/325 (60.92%), Query Frame = 1

Query: 460 SLDVEKRALRIPNPPPRPSCSISSEPKEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQ 519
           S+    +   + NP P+P+    S   +            PPPPPPLP          V+
Sbjct: 83  SVKSNTKGQEVRNPNPKPTIQGQSTATK------------PPPPPPLPSKRTLGKRS-VR 142

Query: 520 RAPQVVEFYHSLMKRDS---RKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADI 579
           RAP+VVEFY +L KR+S    K + NG +           +MIGEIENRS +L  IK+D 
Sbjct: 143 RAPEVVEFYRALTKRESHMGNKINQNGVLSPA-----FNRNMIGEIENRSKYLSDIKSDT 202

Query: 580 ETQGEFVNSLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHF-DWPERKADTL 639
           +   + ++ LI +V  A +  I ++  FVKW+D+EL  LVDERAVLKHF  WPERK D+L
Sbjct: 203 DRHRDHIHILISKVEAATFTDISEVETFVKWIDEELSSLVDERAVLKHFPKWPERKVDSL 262

Query: 640 REAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNC 699
           REAA  Y+  K L  EI ++KD+P+     AL+++ +L +++E S  N  +MR+S  +  
Sbjct: 263 REAACNYKRPKNLGNEILSFKDNPKDSLTQALQRIQSLQDRLEESVNNTEKMRDSTGKRY 322

Query: 700 KEFQIPTDWMLDYGIISKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRF 759
           K+FQIP +WMLD G+I ++K  S++LA+ YMKRIA EL+S  S ++      ++LQGVRF
Sbjct: 323 KDFQIPWEWMLDTGLIGQLKYSSLRLAQEYMKRIAKELESNGSGKE----GNLMLQGVRF 382

Query: 760 AFRIHQFAGGFDAETMHAFEDLRNL 781
           A+ IHQFAGGFD ET+  F +L+ +
Sbjct: 383 AYTIHQFAGGFDGETLSIFHELKKI 385

BLAST of LsiUNG001790 vs. TAIR10
Match: AT1G52080.1 (AT1G52080.1 actin binding protein family)

HSP 1 Score: 62.4 bits (150), Expect = 1.6e-09
Identity = 77/282 (27.30%), Postives = 135/282 (47.87%), Query Frame = 1

Query: 129 VELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQKVSVLEED 188
           ++L+Q+ + +   K ++ + +NE    K     +E  + L     ELD    +V VL++ 
Sbjct: 182 LKLNQMETKVFNFKIKKLQAENE----KLKAECFEHSKVL----LELDMAKSQVQVLKKK 241

Query: 189 RRVLSEQLVA-LSSIPEK----QEEPQTAPV-------------NVEVEVVELRRLNKEL 248
             + ++Q VA + S+ ++    QEE   A +             ++E E+ EL   N  L
Sbjct: 242 LNINTQQHVAQILSLKQRVARLQEEEIKAVLPDLEADKMMQRLRDLESEINELTDTNTRL 301

Query: 249 QLQKRNLACRLSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLN 308
           Q +   L+ +L SV+  +   +K  E E I  ++ + + LR  NE+L K VE LQ  R  
Sbjct: 302 QFENFELSEKLESVQ--IIANSKLEEPEEIETLREDCNRLRSENEELKKDVEQLQGDRCT 361

Query: 309 EVEELAYLRWVNSCLRSELRNSCPSANS------GSPSSPQPIERSSESVGSLSSQKENM 368
           ++E+L YLRW+N+CLR ELR   P A         +  SP   E++ + +   +  ++N 
Sbjct: 362 DLEQLVYLRWINACLRYELRTYQPPAGKTVARDLSTTLSPTSEEKAKQLILEYAHSEDNT 421

Query: 369 EYSSAKRINLIKKLKKWPITDEDLSNLDRSDNSLLDKNWVDT 387
           +Y             +W  + E+ S +  +D+  LD + VDT
Sbjct: 422 DYD------------RWSSSQEESSMI--TDSMFLDDSSVDT 439

BLAST of LsiUNG001790 vs. NCBI nr
Match: gi|449433527|ref|XP_004134549.1| (PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 1442.6 bits (3733), Expect = 0.0e+00
Identity = 747/787 (94.92%), Postives = 762/787 (96.82%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNP E  GKPSRFADQNQNPKCLNQNNAKG+TGN SKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPLEIRGKPSRFADQNQNPKCLNQNNAKGSTGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TKA SNLQPKKAPPLGNSDL NQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  TKAQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAE KRNTRNYELERELEEKK ELD L +
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKVELDSLAK 180

Query: 181 KVSVLEEDRRVLSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEEDRR LSEQLV L S+ EKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEDRRALSEQLVTLPSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300
           LSSVESELACLAKNSESEA+AKIKAE SLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW
Sbjct: 241 LSSVESELACLAKNSESEAVAKIKAEVSLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300

Query: 301 VNSCLRSELRNSCPSANSGSPSSPQPIERSSESVGSLSSQKENMEYSSAKRINLIKKLKK 360
           VNSCLRSELRNS PSANSGSPSSPQP+ERSSE++GSLSSQKE MEYSSAKRINLIKKLKK
Sbjct: 301 VNSCLRSELRNSSPSANSGSPSSPQPVERSSEAIGSLSSQKEYMEYSSAKRINLIKKLKK 360

Query: 361 WPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI 420
           WPITDEDLSNLD SDN+LLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF+
Sbjct: 361 WPITDEDLSNLDCSDNNLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFM 420

Query: 421 CAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEKRALRIPNPPPRPSCS 480
           CAKEMEK+ DPLSSQKYDLGVIQRPHVLGNCH+TNR+FASLDVEKRALRIPNPPPRPSCS
Sbjct: 421 CAKEMEKDVDPLSSQKYDLGVIQRPHVLGNCHETNRNFASLDVEKRALRIPNPPPRPSCS 480

Query: 481 ISSEPKEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKES 540
           ISSEPKEEN A VPPPLPPPPPPPPLPKF+VRSATGMVQRAPQVVEFYHSLMKRDSRK+S
Sbjct: 481 ISSEPKEENRAQVPPPLPPPPPPPPLPKFSVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540

Query: 541 SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600
           SNG ICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED
Sbjct: 541 SNGTICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600

Query: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660
           IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR
Sbjct: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660

Query: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDYGIISKIKLGSVK 720
           LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLD GIISKIKLGSVK
Sbjct: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720

Query: 721 LAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780
           LAKMYMKR+AMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL
Sbjct: 721 LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780

Query: 781 ANLLNKK 788
           ANLLNKK
Sbjct: 781 ANLLNKK 787

BLAST of LsiUNG001790 vs. NCBI nr
Match: gi|659078023|ref|XP_008439508.1| (PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo])

HSP 1 Score: 1428.3 bits (3696), Expect = 0.0e+00
Identity = 744/788 (94.42%), Postives = 758/788 (96.19%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNP E  GKPSRFADQNQNPKCLNQNNAKG++GN SKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPLEIRGKPSRFADQNQNPKCLNQNNAKGSSGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
            K  SNLQPKKAPPLGNSDL NQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  AKTQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAE KRNTRNYELERELEEKK ELD L +
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKVELDSLAK 180

Query: 181 KVSVLEEDRRVLSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEEDRR LSEQLV LSS+ EKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEDRRALSEQLVTLSSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSSVESELACLAKN-SESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR 300
           LSSVESELACLAKN SESEA+AK+KAE SLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR
Sbjct: 241 LSSVESELACLAKNNSESEAVAKVKAEVSLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR 300

Query: 301 WVNSCLRSELRNSCPSANSGSPSSPQPIERSSESVGSLSSQKENMEYSSAKRINLIKKLK 360
           WVNSCLRSELRNSCPSANSGSPSSPQP+ERSSE V SLSSQKE MEYSSAKRINLIKKLK
Sbjct: 301 WVNSCLRSELRNSCPSANSGSPSSPQPVERSSEPVCSLSSQKEYMEYSSAKRINLIKKLK 360

Query: 361 KWPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF 420
           KWPITDEDLSNLD SDN+LLDK WVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF
Sbjct: 361 KWPITDEDLSNLDCSDNTLLDKKWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF 420

Query: 421 ICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEKRALRIPNPPPRPSC 480
           +CAKEMEK+ DPLSSQKYDLGVIQRPHVLGN H+TNR+FASLDVEKRALRIPNPPPRPSC
Sbjct: 421 MCAKEMEKDVDPLSSQKYDLGVIQRPHVLGNFHETNRNFASLDVEKRALRIPNPPPRPSC 480

Query: 481 SISSEPKEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKE 540
           SISSEPKEEN A VPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRK+
Sbjct: 481 SISSEPKEENRAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD 540

Query: 541 SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE 600
           SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE
Sbjct: 541 SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE 600

Query: 601 DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP 660
           DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP
Sbjct: 601 DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP 660

Query: 661 RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDYGIISKIKLGSV 720
           RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLD GIISKIKLGSV
Sbjct: 661 RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDSGIISKIKLGSV 720

Query: 721 KLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN 780
           KLAKMYMKR+A ELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN
Sbjct: 721 KLAKMYMKRVATELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN 780

Query: 781 LANLLNKK 788
           LANLLNKK
Sbjct: 781 LANLLNKK 788

BLAST of LsiUNG001790 vs. NCBI nr
Match: gi|645238462|ref|XP_008225690.1| (PREDICTED: protein CHUP1, chloroplastic-like [Prunus mume])

HSP 1 Score: 1075.8 bits (2781), Expect = 0.0e+00
Identity = 584/803 (72.73%), Postives = 675/803 (84.06%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKR 60
           M+E+NPSE+  +  +F+DQNQ PKC    N KG + N+SKLR+ASSWGSHIVKG + DK+
Sbjct: 1   MREENPSESRARSIKFSDQNQIPKC---QNVKGNS-NASKLRSASSWGSHIVKGLAGDKK 60

Query: 61  TKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TK    +  KK PPL  SD+ANQK  FVPSH R+KRS+IGDL+CS N  QVHPQ + THR
Sbjct: 61  TKVQPTVTNKK-PPLMGSDMANQKNSFVPSHPRVKRSLIGDLSCSVNGNQVHPQMHPTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLF+ELD LR+LL ESK+REF+LQ EL+E KRN +  +LERELE K+ ELDGL +
Sbjct: 121 RQSSRDLFIELDHLRNLLRESKEREFQLQAELSECKRNPKVLDLERELEVKRIELDGLAR 180

Query: 181 KVSVLEEDRRVLSEQLVALSSIPE-------KQEEPQTAPVN----VEVEVVELRRLNKE 240
           KV +LEE++  LSEQL AL+SI +       K+EE +++  +    VE+EVVELRRLNKE
Sbjct: 181 KVELLEEEKTSLSEQLSALTSILDRNEGVTLKKEEQESSAASASGSVEMEVVELRRLNKE 240

Query: 241 LQLQKRNLACRLSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRL 300
           LQLQKRNLAC+LSSV S+LA LAK SES+ + KIKAEAS LRHTNEDLCKQVEGLQMSRL
Sbjct: 241 LQLQKRNLACKLSSVTSQLASLAKASESDIVEKIKAEASALRHTNEDLCKQVEGLQMSRL 300

Query: 301 NEVEELAYLRWVNSCLRSELRNS--CPSANSGSPSSPQPIERSSESVGSLSSQK-ENMEY 360
           NEVEELAYLRWVNSCLR+EL+NS  C + NS  P SP   ERSS+S G+L S+  E +EY
Sbjct: 301 NEVEELAYLRWVNSCLRNELQNSNSCSTTNSDKPLSPGSFERSSKSAGALPSRSSEYLEY 360

Query: 361 SSAKRINLIKKLKKWPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPE 420
            S KR+NLIKKLKKWPI D+DL NL+  D  LLDK+WVD+EEGRSPRRRHSISG+KC  E
Sbjct: 361 GSIKRLNLIKKLKKWPIADDDLPNLECPDG-LLDKSWVDSEEGRSPRRRHSISGSKCCAE 420

Query: 421 EL-EPNKRRQSDGFICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEK 480
           EL + NKRRQSDGF+CA+EMEK+ +P++S+ +DL         GNCH+ N+  ASLDVEK
Sbjct: 421 ELVQSNKRRQSDGFMCAQEMEKDTEPVASENFDL-------FFGNCHEINKIPASLDVEK 480

Query: 481 RALRIPNPPPRPSCSISSEPKEENTAHVPPPLPPPPPPPPLPKFAVR-SATGMVQRAPQV 540
           RALRIPNPPPRPSCSISS  K + +A VPPP PPPPPPPP PKFA++ S+TGMVQRAPQV
Sbjct: 481 RALRIPNPPPRPSCSISSGTKVDGSAQVPPP-PPPPPPPPPPKFAMKTSSTGMVQRAPQV 540

Query: 541 VEFYHSLMKRDSRKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVN 600
           VEFYHSLMKRDSRK+SSNG +C+ PDV+NVRSSMIGEIENRSSHLLAIKAD+ETQGEFVN
Sbjct: 541 VEFYHSLMKRDSRKDSSNGGVCDGPDVANVRSSMIGEIENRSSHLLAIKADVETQGEFVN 600

Query: 601 SLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRD 660
           SLIREVNNAVY  I+D+V FVKWLDDELCFLVDERAVLKHFDWPE+KADTLREAAFGYRD
Sbjct: 601 SLIREVNNAVYQNIDDVVAFVKWLDDELCFLVDERAVLKHFDWPEKKADTLREAAFGYRD 660

Query: 661 LKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDW 720
           LKKLE E+S+YK+D RLPCDIALKKMV+LSEKMER+ YNLLR RE LMR+CKEFQIPTDW
Sbjct: 661 LKKLESEVSSYKEDIRLPCDIALKKMVSLSEKMERTVYNLLRTREPLMRHCKEFQIPTDW 720

Query: 721 MLDYGIISKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLD GI+SKIK GSVKLAKMYMKR+AMELQSKA++EKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 721 MLDNGILSKIKFGSVKLAKMYMKRVAMELQSKAAAEKDPAMDYMLLQGVRFAFRIHQFAG 780

Query: 781 GFDAETMHAFEDLRNLANLLNKK 788
           GFDA+TMHAFE+LR LA+LLNKK
Sbjct: 781 GFDADTMHAFEELRYLAHLLNKK 789

BLAST of LsiUNG001790 vs. NCBI nr
Match: gi|595893770|ref|XP_007213645.1| (hypothetical protein PRUPE_ppa001630mg [Prunus persica])

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 586/803 (72.98%), Postives = 672/803 (83.69%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKR 60
           M+E+NPSE+  +  +F+DQNQ PKC    N KG + N+SKLR+ASSWGSHIVKG + DK+
Sbjct: 1   MREENPSESRARSIKFSDQNQIPKC---QNVKGNS-NASKLRSASSWGSHIVKGLAGDKK 60

Query: 61  TKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TK    +  KK PPL  SD+ANQK  FVPSH R+KRS+IGDL+CS N  QVHPQ + THR
Sbjct: 61  TKVQPIVTNKK-PPLMGSDMANQKNSFVPSHPRVKRSLIGDLSCSVNGNQVHPQMHPTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLF+ELD LR+LL ESK+REF+LQ EL+E KRN +  +LERELE K+ ELDGL +
Sbjct: 121 RQSSRDLFIELDHLRNLLRESKEREFQLQAELSECKRNPKVLDLERELEVKRIELDGLAR 180

Query: 181 KVSVLEEDRRVLSEQLVALSSIPEKQE------EPQTAPV-----NVEVEVVELRRLNKE 240
           KV +LEE++  LSEQL AL+SI ++ E      E Q + V     +VE+EVVELRRLNKE
Sbjct: 181 KVELLEEEKTSLSEQLSALTSILDRNEGVTLKKEEQESSVASASGSVEMEVVELRRLNKE 240

Query: 241 LQLQKRNLACRLSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRL 300
           LQLQKRNLAC+LSSV S+LA LAK SES+ + KIKAEAS LRHTNEDLCKQVEGLQMSRL
Sbjct: 241 LQLQKRNLACKLSSVTSQLASLAKASESDIVEKIKAEASALRHTNEDLCKQVEGLQMSRL 300

Query: 301 NEVEELAYLRWVNSCLRSELRNS--CPSANSGSPSSPQPIERSSESVGSLSSQK-ENMEY 360
           NEVEELAYLRWVNSCLR+EL+NS  C + NS  P SP   ERSS+S G+L S+  E +EY
Sbjct: 301 NEVEELAYLRWVNSCLRNELQNSNSCSTTNSDKPLSPGSFERSSKSAGALPSRSSEYLEY 360

Query: 361 SSAKRINLIKKLKKWPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPE 420
            S KR+NLIKKLKKWPI DEDL NL+  D  LLDK+WVD+EEGRSPRRRHSISG+KC  E
Sbjct: 361 GSVKRLNLIKKLKKWPIADEDLPNLECPDG-LLDKSWVDSEEGRSPRRRHSISGSKCCAE 420

Query: 421 EL-EPNKRRQSDGFICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEK 480
           EL + NKRRQSDGF+CA+EMEK+ +P++S+ +DL         GNCH+ N+  ASLDVEK
Sbjct: 421 ELVQSNKRRQSDGFMCAQEMEKDTEPVASENFDL-------FFGNCHEINKIPASLDVEK 480

Query: 481 RALRIPNPPPRPSCSISSEPKEENTAHVPPPLPPPPPPPPLPKFAVR-SATGMVQRAPQV 540
           RALRIPNPPPRPSCSIS   K + +A VPPP PPPPPPPP PKFA++ S TGMVQRAPQV
Sbjct: 481 RALRIPNPPPRPSCSISRGTKVDGSAQVPPP-PPPPPPPPPPKFAMKTSTTGMVQRAPQV 540

Query: 541 VEFYHSLMKRDSRKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVN 600
           VEFYHSLMKRDSRK+SSNG +C+ PDV+NVRSSMIGEIENRSSHLLAIKAD+ETQGEFVN
Sbjct: 541 VEFYHSLMKRDSRKDSSNGGVCDGPDVANVRSSMIGEIENRSSHLLAIKADVETQGEFVN 600

Query: 601 SLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRD 660
           SLIREVNNAVY  I+D+V FVKWLDDELCFLVDERAVLKHFDWPE+KADTLREAAFGYRD
Sbjct: 601 SLIREVNNAVYQNIDDVVAFVKWLDDELCFLVDERAVLKHFDWPEKKADTLREAAFGYRD 660

Query: 661 LKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDW 720
           LKKLE E+S+YK+D RLPCDIALKKMVALSEKMER+ YNLLR RE LMR+CKEFQIPTDW
Sbjct: 661 LKKLESEVSSYKEDIRLPCDIALKKMVALSEKMERTVYNLLRTREPLMRHCKEFQIPTDW 720

Query: 721 MLDYGIISKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLD GI+SKIK GSVKLAKMYMKR+AMELQSKA++EKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 721 MLDNGILSKIKFGSVKLAKMYMKRVAMELQSKAAAEKDPAMDYMLLQGVRFAFRIHQFAG 780

Query: 781 GFDAETMHAFEDLRNLANLLNKK 788
           GFDA+TMHAFE+LR LA+LLNKK
Sbjct: 781 GFDADTMHAFEELRYLAHLLNKK 789

BLAST of LsiUNG001790 vs. NCBI nr
Match: gi|225444169|ref|XP_002268607.1| (PREDICTED: protein CHUP1, chloroplastic-like [Vitis vinifera])

HSP 1 Score: 1068.5 bits (2762), Expect = 5.9e-309
Identity = 575/798 (72.06%), Postives = 656/798 (82.21%), Query Frame = 1

Query: 1   MKEDNPSENGGKPSRFADQNQNPKCLNQNNAKGTTGNSSKLRAASSWGSHIVKGFSTDKR 60
           M+E+NPSEN  K  +FADQNQ            T+ N S+LR+ASSWGSHIVKGFS DK+
Sbjct: 1   MREENPSENRVKSLKFADQNQGK----------TSSNPSRLRSASSWGSHIVKGFSADKK 60

Query: 61  TKAHSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
            K  +    KK P L +SD+ NQK   V SHSR+KRS+IGDL+CS N +QVHPQ Y ++R
Sbjct: 61  NKLQTTAAAKKVP-LTSSDIVNQKNPLVTSHSRVKRSLIGDLSCSVNASQVHPQLYNSNR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAEFKRNTRNYELERELEEKKAELDGLTQ 180
            +SSRDLF+ELD LRSLL ESK+REF+LQ EL EFKRN    +LERELE KK+E++ L+Q
Sbjct: 121 TKSSRDLFLELDHLRSLLQESKEREFKLQAELLEFKRNPEILDLERELEVKKSEVNELSQ 180

Query: 181 KVSVLEEDRRVLSEQLVALSSIPEKQEEP--------QTAPVN--VEVEVVELRRLNKEL 240
           KV +LE ++  LSEQL  L+SI E++EE          +AP    +E+EVVELRRLNKEL
Sbjct: 181 KVRLLESEKTSLSEQLSGLASIAERREELLKREDLEISSAPSQRTLEMEVVELRRLNKEL 240

Query: 241 QLQKRNLACRLSSVESELACLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLN 300
           QLQKR+L+CRLSS ES+L  L+K  ES+ +A I+AEASLLRHTNEDLCKQVE LQMSRLN
Sbjct: 241 QLQKRDLSCRLSSTESQLNTLSKVYESDTVANIQAEASLLRHTNEDLCKQVEDLQMSRLN 300

Query: 301 EVEELAYLRWVNSCLRSELRNSCPSANSGSPSSPQPIERSSESVGSLSSQKEN-MEYSSA 360
           EVEEL YLRWVNSCLR+ELRNSC   NS   SSP  IE S ES  S S Q ++ +EYSS 
Sbjct: 301 EVEELVYLRWVNSCLRNELRNSCSVTNSDKTSSPNSIEGSRESDSSFSCQTDDSLEYSSI 360

Query: 361 KRINLIKKLKKWPITDEDLSNLDRSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELE 420
           KR+NLIKKLKKWPI  EDL NLD  DN LL+K+WVD EEGRSPRRRHSISG+KC  E+L 
Sbjct: 361 KRLNLIKKLKKWPIISEDLPNLDCPDN-LLEKSWVDPEEGRSPRRRHSISGSKCSAEDLV 420

Query: 421 PNKRRQSDGFICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHDTNRSFASLDVEKRALR 480
            +KRRQSDGF+C KEMEKEA+PL SQKY+LG++Q+P + GNC +T +  ASLDVEKRALR
Sbjct: 421 QSKRRQSDGFMCPKEMEKEAEPLVSQKYELGIVQKPQLWGNCQETGKFMASLDVEKRALR 480

Query: 481 IPNPPPRPSCSISSEPKEENTAHVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYH 540
           IPNPPPRPS ++SS PKE   A +PPP PPPPPPPP PKF+ RS TG+VQRAPQVVEFYH
Sbjct: 481 IPNPPPRPSGALSSGPKEMVLAQIPPPPPPPPPPPPPPKFSARSTTGIVQRAPQVVEFYH 540

Query: 541 SLMKRDSRKESSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIRE 600
           SLMKRDSRK+SSNG I + PDV+NVRS+MIGEIENRSS+LLAIKAD+ETQGEFVNSLIRE
Sbjct: 541 SLMKRDSRKDSSNGGIYDTPDVANVRSNMIGEIENRSSYLLAIKADVETQGEFVNSLIRE 600

Query: 601 VNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLE 660
           VNNAVY  IED+V FVKWLDDELCFLVDERAVLKHFDWPE+KADTLREAAFGYRDLKKLE
Sbjct: 601 VNNAVYQNIEDVVAFVKWLDDELCFLVDERAVLKHFDWPEKKADTLREAAFGYRDLKKLE 660

Query: 661 CEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDYG 720
            E+S YKDDPR+PCDIALKKMVALSEKMERS YNL R RESLMRNCKEFQIPTDWMLD G
Sbjct: 661 SEVSYYKDDPRVPCDIALKKMVALSEKMERSVYNLFRTRESLMRNCKEFQIPTDWMLDNG 720

Query: 721 IISKIKLGSVKLAKMYMKRIAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAE 780
           II+KIK GSVKLAK YM+R+AMELQSK + EKDPAMDYMLLQGVRFAFRIHQFAGGFD E
Sbjct: 721 IINKIKFGSVKLAKKYMRRVAMELQSKGAFEKDPAMDYMLLQGVRFAFRIHQFAGGFDVE 780

Query: 781 TMHAFEDLRNLANLLNKK 788
           TMHAFE+LRNLA+LLNKK
Sbjct: 781 TMHAFEELRNLAHLLNKK 786

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CHUP1_ARATH1.1e-9254.29Protein CHUP1, chloroplastic OS=Arabidopsis thaliana GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KMA9_CUCSA0.0e+0094.92Uncharacterized protein OS=Cucumis sativus GN=Csa_6G526260 PE=4 SV=1[more]
M5WMB7_PRUPE0.0e+0072.98Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001630mg PE=4 SV=1[more]
F6HM16_VITVI4.1e-30972.06Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0003g05680 PE=4 SV=... [more]
W9RI54_9ROSA1.9e-30672.74Uncharacterized protein OS=Morus notabilis GN=L484_025949 PE=4 SV=1[more]
A0A061G7Q8_THECC3.5e-30070.64Hydroxyproline-rich glycoprotein family protein isoform 1 OS=Theobroma cacao GN=... [more]
Match NameE-valueIdentityDescription
AT3G25690.16.1e-9454.29 Hydroxyproline-rich glycoprotein family protein[more]
AT4G18570.12.4e-9052.63 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G48280.15.0e-8049.36 hydroxyproline-rich glycoprotein family protein[more]
AT1G07120.12.3e-6142.77 FUNCTIONS IN: molecular_function unknown[more]
AT1G52080.11.6e-0927.30 actin binding protein family[more]
Match NameE-valueIdentityDescription
gi|449433527|ref|XP_004134549.1|0.0e+0094.92PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus][more]
gi|659078023|ref|XP_008439508.1|0.0e+0094.42PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo][more]
gi|645238462|ref|XP_008225690.1|0.0e+0072.73PREDICTED: protein CHUP1, chloroplastic-like [Prunus mume][more]
gi|595893770|ref|XP_007213645.1|0.0e+0072.98hypothetical protein PRUPE_ppa001630mg [Prunus persica][more]
gi|225444169|ref|XP_002268607.1|5.9e-30972.06PREDICTED: protein CHUP1, chloroplastic-like [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
biological_process GO:0008150 biological_process
cellular_component GO:0009707 chloroplast outer membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
LsiUNG001790.1LsiUNG001790.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 131..195
score: -coord: 216..247
score: -coord: 859..860
scor
NoneNo IPR availablePANTHERPTHR31342FAMILY NOT NAMEDcoord: 454..796
score: 0.0coord: 12..392
score:
NoneNo IPR availablePANTHERPTHR31342:SF11SUBFAMILY NOT NAMEDcoord: 454..796
score: 0.0coord: 12..392
score: