Csa7G009210 (gene) Cucumber (Chinese Long) v2

NameCsa7G009210
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPutative neutral invertase; contains IPR008928 (Six-hairpin glycosidase-like), IPR024746 (Glycosyl hydrolase family 100)
LocationChr7 : 596427 .. 599827 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGAATTCCTCTTCGAACATGCCCCAGAATGGGAATGTTAAAAACAATGATACGTTATTCACAGTGGATGAGATTGAAGAAAGTGAGTTTTCGAAGCTATTGGATAGGCCAAGGCCGTTGAACATGGAGAGACAGAGGTCGTTCGATGAAAGATCGCTTGGGGATTTGGCGATTGGTTTTTCTCCACGGTTATCATCGAGAGTTTCTTCAGAAAACTTTGGTCGATTGAGTGATAATTACGATCATTCTCCATCGCCGGGTAGAAAATCGGATTTCAATACTCCAAGATCACACACTGGATTCGAGCAGCATCCTATGGTGGCAGAAGCTTGGGAAGCTTTGAGGCGTTCGTTGGTCTATTTCCGTGGCCAGCCAGTTGGTACTATTGCGGCGTTGGACAGTACTGAAGAAAATCTTAACTATGATCAAGTAAGAACGAACACTTTATGAATAGTTGCTTACTTGCATTCATTGATCAATCCCTGTTTTGTTGTGTTTATAATAATGCAGAGTAGTGTCTATGTTCTTGCTTGTATTCTATTGCATTGAGAATCAATTTGTCATTCAGAAAACATTTCATCTGTTGAAACTTTGAAACATGATATCTCTATAGATTCTCACAACAATAATGATACTCACAGGAGTACTTCCGTTAATAGTATTTCGAACGTTCGTTTTCTGTTTACTCTTTGAAACTTAGTGAGCATACGTTGACCACTTGTTGAACGCCGTAACTCATGTCATACACTGATGTACTTTTCTAGAGTAAGTAAATTTAGATGTTGAAGCATTATTGACTTGATATTTCCTTTGCATTAATGTGTGAAACTATTTGAGTTGGTTTGAATCATAAATGCACAAACATTGTATCTTTTGAGATAAAACGTTGTATCATAAATGCGTAAACATCATCAACTAAAAGGGGATTTTTGCATGAAAACCACAGAAATGTATAAATATTTAGAAATGAACTTATGTATACCAAGTATCAGTATGTGCGCTAATGACTTAATTTTTAGTTAAGGGGTTTTCATTGATCTGAACCGAGAGCAAAAGCTAACATATGGATTTACTTCTTCTGTTTGCATAGGTGTTTGTAAGAGACTTCGTCCCAAGTGCGTTTGCTTTTCTAATGAATGGGGAGCCTGAAATAGTAAAGAACTTTATCCTAAAAACTCTTCGGCTTCAGTCATGGGAGAAAAAGATTGATAGATTCCAGCTTGGAGAAGGGGTGATGCCTGCTAGTTTCAAAGTTCTTCATGATCCAGTGAGGAACACTGAAACTTTAATTGCAGATTTTGGAGAGAGTGCAATAGGAAGAGTTGCTCCTGTTGACTCTGGATTTTGGTGGATTATACTGCTTAGAGCATACACAAAGTCCACTGGTGACAGTTCATTGGCTGAACTACCAGAATGCCAAAAGGGAATGCGCCTTATTTTGAGTCTGTGTCTTTCAGAAGGTTTTGACACATTCCCAACCCTTCTCTGTGCCGATGGATGCTGCATGATTGATCGACGAATGGTGGGTTCTTTCAACCTTTCCCTTTTAATATTTAATAATGAAACTAACATGAAGCTTGACTTCTGAAATTCACTCGGGAGACCATTTGAACTGTTAATAGAATGATCTAATCATAGTTCAAATCCTCTCCTCTTTTGGTTCTGAACTTTCAGATTTCTCTGATTCATTTTAGTACTTTCAAAAGGCCTATTTAATTTTTAGATTTTAGATAGAGCGATCGTTGTCTTTAAAAAAAGTAGCTATTTAGGTTATTGCTATTATTTTTGTCCCCATTTATATAAAACAACTACAGCTTTTTAGAATTCAGGAAAACAAAATAGAGTAACTTTGAATGTTTAAAAGTCAAAATAAAATTTAAACCAAAATTATCTACATTGAAAATTCAGGAAAACAAAGTAGAGTAACTTTGAACGTTTAAAAATCAAAATAAGATTTAAACCAAAATTATTTACATCGTCTTGCGGCCTCCGTTTGATGTACATTTTTTTTTTCTTACTTTTTGGGAGACAAAAGGTTTCACTACAATATTAGGTTTGCCTAACGAAATGAGCTGTATGTACTGTGCAGTCTTGAAATCATTATTATTATTTATTCATTTCTTCATTCTTACTGCTTTTACTATTGGTTTCAGGGTGTATATGGCTACCCAATTGAAATTCAGGCACTTTTCTTTATGGCTTTAAGGTGTGCTTTGATTTTGCTTAAGCAAGACCATGAGGGGAAGGACTTTGTAGAAAGAATAACAAAACGGCTTCACGCCATGAGCTATCACATGAGAACTTACTTTTGGATCGACCTAAAGCAACTAAATGATATATACCGATATAAAACTGAAGAATACTCTCATACTGCTTTGAACAAGTTCAACGTAATACCTGATTCTCTTCCTGAATGGATTTTTGACTTCATGCCAACTCGTGGTGGATACTTCATTGGAAATGTCAGCCCTGCAAGAATGGACTTCCGTTGGTTTTGCTTGGGAAATTGCATTGCAATTCTTTCTGCCTTGGCAACACCAGAACAAGCCACTGCTATTATGGATCTTATTGAATCGCGGTGGGAAGAGCTGGTTGGAGAAATGCCGTTAAAGGTTTGTTACCCTGCCATTGAAAGCCACGAGTGGCGAATTGTAACTGGATGTGACCCAAAAAATACAAGATGGAGTTACCATAACGGTGGTTCTTGGCCAGGTAAACATCTCTACTACCTCTAATTGCTTGCAAATTTATAGTTCCCATCTGTTCCCATTTACAACTTGAGTATGCAAATTAGGAGGCTTTGTCACGACCTTACTTAAACTAGATCGGTTCTATAGGTTGTACAAGTGACTTTGATTATAAAAAAAGTTTAGGAGGCTTAGTCCTAATGACAAGTAGACTCCGAAGTTGCACTAGAACAAAGATATTGACATTGCTGCAATATTCTTGCAGTTCTCCTATGGCTTTTAACAGCTGCATGTATCAAGACTGGGCGACCGCAGATTGCAAGACGTGCACTCGAACTGGCTGAATCCAGGCTACTGAAAGACAGCTGGCCAGAATATTATGATGGGACACTTGGACGGTACATCGGGAAACAGGCACGAAAGTTTCAGACGTGGTCAATTGCAGGTTACCTAGTTGCAAAGATGATGTTGGAAGACCCTTCTCATTCAGGCATGGTGTCCTTGGAGGAAGATAAGCAGATGAAGCCTTTAATGAAAAGGTCACATTCATGGACTTGTTAATGCTCCAAACTTATAGTTTTGGTTAGTAGTTGAACTTTTGTCTTTACAGCCTTCTTCCCACAGTAATTTTCATATATAATGTTTAGTTTTGTTTTCTAGAGGGATATTTGTTTGTGTTTTTTCTGGCTAACAGATAAGTATAAATGACGA

mRNA sequence

ATGTCGAATTCCTCTTCGAACATGCCCCAGAATGGGAATGTTAAAAACAATGATACGTTATTCACAGTGGATGAGATTGAAGAAAGTGAGTTTTCGAAGCTATTGGATAGGCCAAGGCCGTTGAACATGGAGAGACAGAGGTCGTTCGATGAAAGATCGCTTGGGGATTTGGCGATTGGTTTTTCTCCACGGTTATCATCGAGAGTTTCTTCAGAAAACTTTGGTCGATTGAGTGATAATTACGATCATTCTCCATCGCCGGGTAGAAAATCGGATTTCAATACTCCAAGATCACACACTGGATTCGAGCAGCATCCTATGGTGGCAGAAGCTTGGGAAGCTTTGAGGCGTTCGTTGGTCTATTTCCGTGGCCAGCCAGTTGGTACTATTGCGGCGTTGGACAGTACTGAAGAAAATCTTAACTATGATCAAGTGTTTGTAAGAGACTTCGTCCCAAGTGCGTTTGCTTTTCTAATGAATGGGGAGCCTGAAATAGTAAAGAACTTTATCCTAAAAACTCTTCGGCTTCAGTCATGGGAGAAAAAGATTGATAGATTCCAGCTTGGAGAAGGGGTGATGCCTGCTAGTTTCAAAGTTCTTCATGATCCAGTGAGGAACACTGAAACTTTAATTGCAGATTTTGGAGAGAGTGCAATAGGAAGAGTTGCTCCTGTTGACTCTGGATTTTGGTGGATTATACTGCTTAGAGCATACACAAAGTCCACTGGTGACAGTTCATTGGCTGAACTACCAGAATGCCAAAAGGGAATGCGCCTTATTTTGAGTCTGTGTCTTTCAGAAGGTTTTGACACATTCCCAACCCTTCTCTGTGCCGATGGATGCTGCATGATTGATCGACGAATGGGTGTATATGGCTACCCAATTGAAATTCAGGCACTTTTCTTTATGGCTTTAAGGTGTGCTTTGATTTTGCTTAAGCAAGACCATGAGGGGAAGGACTTTGTAGAAAGAATAACAAAACGGCTTCACGCCATGAGCTATCACATGAGAACTTACTTTTGGATCGACCTAAAGCAACTAAATGATATATACCGATATAAAACTGAAGAATACTCTCATACTGCTTTGAACAAGTTCAACGTAATACCTGATTCTCTTCCTGAATGGATTTTTGACTTCATGCCAACTCGTGGTGGATACTTCATTGGAAATGTCAGCCCTGCAAGAATGGACTTCCGTTGGTTTTGCTTGGGAAATTGCATTGCAATTCTTTCTGCCTTGGCAACACCAGAACAAGCCACTGCTATTATGGATCTTATTGAATCGCGGTGGGAAGAGCTGGTTGGAGAAATGCCGTTAAAGGTTTGTTACCCTGCCATTGAAAGCCACGAGTGGCGAATTGTAACTGGATGTGACCCAAAAAATACAAGATGGAGTTACCATAACGGTGGTTCTTGGCCAGTTCTCCTATGGCTTTTAACAGCTGCATGTATCAAGACTGGGCGACCGCAGATTGCAAGACGTGCACTCGAACTGGCTGAATCCAGGCTACTGAAAGACAGCTGGCCAGAATATTATGATGGGACACTTGGACGGTACATCGGGAAACAGGCACGAAAGTTTCAGACGTGGTCAATTGCAGGTTACCTAGTTGCAAAGATGATGTTGGAAGACCCTTCTCATTCAGGCATGGTGTCCTTGGAGGAAGATAAGCAGATGAAGCCTTTAATGAAAAGGTCACATTCATGGACTTGTTAA

Coding sequence (CDS)

ATGTCGAATTCCTCTTCGAACATGCCCCAGAATGGGAATGTTAAAAACAATGATACGTTATTCACAGTGGATGAGATTGAAGAAAGTGAGTTTTCGAAGCTATTGGATAGGCCAAGGCCGTTGAACATGGAGAGACAGAGGTCGTTCGATGAAAGATCGCTTGGGGATTTGGCGATTGGTTTTTCTCCACGGTTATCATCGAGAGTTTCTTCAGAAAACTTTGGTCGATTGAGTGATAATTACGATCATTCTCCATCGCCGGGTAGAAAATCGGATTTCAATACTCCAAGATCACACACTGGATTCGAGCAGCATCCTATGGTGGCAGAAGCTTGGGAAGCTTTGAGGCGTTCGTTGGTCTATTTCCGTGGCCAGCCAGTTGGTACTATTGCGGCGTTGGACAGTACTGAAGAAAATCTTAACTATGATCAAGTGTTTGTAAGAGACTTCGTCCCAAGTGCGTTTGCTTTTCTAATGAATGGGGAGCCTGAAATAGTAAAGAACTTTATCCTAAAAACTCTTCGGCTTCAGTCATGGGAGAAAAAGATTGATAGATTCCAGCTTGGAGAAGGGGTGATGCCTGCTAGTTTCAAAGTTCTTCATGATCCAGTGAGGAACACTGAAACTTTAATTGCAGATTTTGGAGAGAGTGCAATAGGAAGAGTTGCTCCTGTTGACTCTGGATTTTGGTGGATTATACTGCTTAGAGCATACACAAAGTCCACTGGTGACAGTTCATTGGCTGAACTACCAGAATGCCAAAAGGGAATGCGCCTTATTTTGAGTCTGTGTCTTTCAGAAGGTTTTGACACATTCCCAACCCTTCTCTGTGCCGATGGATGCTGCATGATTGATCGACGAATGGGTGTATATGGCTACCCAATTGAAATTCAGGCACTTTTCTTTATGGCTTTAAGGTGTGCTTTGATTTTGCTTAAGCAAGACCATGAGGGGAAGGACTTTGTAGAAAGAATAACAAAACGGCTTCACGCCATGAGCTATCACATGAGAACTTACTTTTGGATCGACCTAAAGCAACTAAATGATATATACCGATATAAAACTGAAGAATACTCTCATACTGCTTTGAACAAGTTCAACGTAATACCTGATTCTCTTCCTGAATGGATTTTTGACTTCATGCCAACTCGTGGTGGATACTTCATTGGAAATGTCAGCCCTGCAAGAATGGACTTCCGTTGGTTTTGCTTGGGAAATTGCATTGCAATTCTTTCTGCCTTGGCAACACCAGAACAAGCCACTGCTATTATGGATCTTATTGAATCGCGGTGGGAAGAGCTGGTTGGAGAAATGCCGTTAAAGGTTTGTTACCCTGCCATTGAAAGCCACGAGTGGCGAATTGTAACTGGATGTGACCCAAAAAATACAAGATGGAGTTACCATAACGGTGGTTCTTGGCCAGTTCTCCTATGGCTTTTAACAGCTGCATGTATCAAGACTGGGCGACCGCAGATTGCAAGACGTGCACTCGAACTGGCTGAATCCAGGCTACTGAAAGACAGCTGGCCAGAATATTATGATGGGACACTTGGACGGTACATCGGGAAACAGGCACGAAAGTTTCAGACGTGGTCAATTGCAGGTTACCTAGTTGCAAAGATGATGTTGGAAGACCCTTCTCATTCAGGCATGGTGTCCTTGGAGGAAGATAAGCAGATGAAGCCTTTAATGAAAAGGTCACATTCATGGACTTGTTAA

Protein sequence

MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPRLSSRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLVYFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC*
BLAST of Csa7G009210 vs. Swiss-Prot
Match: INVB_ARATH (Probable alkaline/neutral invertase B OS=Arabidopsis thaliana GN=INVB PE=1 SV=1)

HSP 1 Score: 986.5 bits (2549), Expect = 1.2e-286
Identity = 472/572 (82.52%), Postives = 523/572 (91.43%), Query Frame = 1

Query: 3   NSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFS 62
           N S ++ QNGN+KN D+L T+D+I++ +F+KLL++PRPLN++R RS DERSL +L    S
Sbjct: 5   NLSVDVNQNGNIKNVDSLSTLDDIDDIDFAKLLEKPRPLNIDRLRSLDERSLTELT--GS 64

Query: 63  PRLSSRVSSENFGRLSDNYDH--SPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV 122
           P+L +   ++N  R  D+ D+  SPS GR+S FNTPRS  GFE HPMV EAW+ALRRS+V
Sbjct: 65  PQLRN---ADNASRAPDHADYVISPSFGRRSGFNTPRSQPGFESHPMVGEAWDALRRSMV 124

Query: 123 YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE 182
           YFRGQPVGTIAA+D++EE LNYDQVFVRDFVPSA AFLMNGEP+IVKNF+LKTLRLQSWE
Sbjct: 125 YFRGQPVGTIAAVDNSEEKLNYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLRLQSWE 184

Query: 183 KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 242
           KKIDRFQLGEGVMPASFKV HDPVRN ETLIADFGESAIGRVAPVDSGFWWIILLRAYTK
Sbjct: 185 KKIDRFQLGEGVMPASFKVFHDPVRNHETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 244

Query: 243 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 302
           STGDSSLA++PECQKG+RLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL
Sbjct: 245 STGDSSLADMPECQKGIRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 304

Query: 303 FFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH 362
           FFMALRCAL+LLK D EGK+ VE+I KRLHA+SYHMR+YFW+DLKQLNDIYRYKTEEYSH
Sbjct: 305 FFMALRCALLLLKHDGEGKEMVEQIVKRLHALSYHMRSYFWLDLKQLNDIYRYKTEEYSH 364

Query: 363 TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQA 422
           TA+NKFNVIPDSLPEW+FDFMP  GG+FIGNVSPARMDFRWF LGNCIAILS+LATPEQ+
Sbjct: 365 TAVNKFNVIPDSLPEWVFDFMPPHGGFFIGNVSPARMDFRWFALGNCIAILSSLATPEQS 424

Query: 423 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 482
           TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL
Sbjct: 425 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 484

Query: 483 TAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK 542
           TAACIKTGRPQIARRA+E+AE+RL KD WPEYYDG +GRY+GKQ+RK QTWS+AGYLVAK
Sbjct: 485 TAACIKTGRPQIARRAIEVAEARLHKDHWPEYYDGKVGRYVGKQSRKNQTWSVAGYLVAK 544

Query: 543 MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC 573
           MMLEDPSH GMV LEEDKQMKP+M+RS+SWTC
Sbjct: 545 MMLEDPSHVGMVCLEEDKQMKPVMRRSNSWTC 571

BLAST of Csa7G009210 vs. Swiss-Prot
Match: CINV2_ARATH (Alkaline/neutral invertase CINV2 OS=Arabidopsis thaliana GN=CINV2 PE=1 SV=1)

HSP 1 Score: 900.6 bits (2326), Expect = 8.9e-261
Identity = 423/553 (76.49%), Postives = 486/553 (87.88%), Query Frame = 1

Query: 22  TVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPRLSSRVSSENFGRLSDNY 81
           ++ E+++ + ++ L++PR L +ER+RSFDERS+ +L+ G+              R     
Sbjct: 19  SLSEMDDFDLTRALEKPRQLKIERKRSFDERSMSELSTGYV-------------RQDSIL 78

Query: 82  DHSPSPGRKSDFNTPRS-HTGFEQHPMVAEAWEALRRSLVYFRGQPVGTIAALD-STEEN 141
           + + SPG +S  +TP S    FE HPMVAEAWEALRRS+V+FRGQPVGTIAA D ++EE 
Sbjct: 79  EMAHSPGSRSMVDTPLSVRNSFEPHPMVAEAWEALRRSMVFFRGQPVGTIAAYDHASEEV 138

Query: 142 LNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRFQLGEGVMPASFKV 201
           LNYDQVFVRDFVPSA AFLMNGEP+IVKNF+LKTL+LQ WEK++DRF+LGEGVMPASFKV
Sbjct: 139 LNYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLQLQGWEKRVDRFKLGEGVMPASFKV 198

Query: 202 LHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLAELPECQKGMRL 261
           LHDPVR T+T+IADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD +L+E PECQ+GMRL
Sbjct: 199 LHDPVRKTDTIIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLSETPECQRGMRL 258

Query: 262 ILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALILLKQDHEGK 321
           ILSLCLSEGFDTFPTLLCADGC M+DRRMGVYGYPIEIQALFFMALRCAL +LK D EG+
Sbjct: 259 ILSLCLSEGFDTFPTLLCADGCSMVDRRMGVYGYPIEIQALFFMALRCALSMLKPDEEGR 318

Query: 322 DFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKFNVIPDSLPEWIFD 381
           DF+ERI KRLHA+S+HMR+YFW+D +QLNDIYRYKTEEYSHTA+NKFNV+PDS+P+W+FD
Sbjct: 319 DFIERIVKRLHALSFHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVMPDSIPDWVFD 378

Query: 382 FMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDLIESRWEELVGEMP 441
           FMP RGGYF+GNVSPARMDFRWF LGNC++ILS+LATP+Q+ AIMDL+E RWEELVGEMP
Sbjct: 379 FMPLRGGYFVGNVSPARMDFRWFSLGNCVSILSSLATPDQSMAIMDLLEHRWEELVGEMP 438

Query: 442 LKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRALEL 501
           LK+CYP IESHEWRIVTGCDPKNTRWSYHNGGSWPVLLW LTAACIKTGRPQIARRA++L
Sbjct: 439 LKICYPCIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWTLTAACIKTGRPQIARRAIDL 498

Query: 502 AESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDPSHSGMVSLEEDKQ 561
            ESRL +D WPEYYDG  GRY+GKQARK+QTWSIAGYLVAKMMLEDPSH GM+SLEEDKQ
Sbjct: 499 IESRLHRDCWPEYYDGKQGRYVGKQARKYQTWSIAGYLVAKMMLEDPSHIGMISLEEDKQ 558

Query: 562 MKPLMKRSHSWTC 573
           MKP++KRS SWTC
Sbjct: 559 MKPVIKRSASWTC 558

BLAST of Csa7G009210 vs. Swiss-Prot
Match: CINV1_ORYSJ (Cytosolic invertase 1 OS=Oryza sativa subsp. japonica GN=CINV1 PE=1 SV=1)

HSP 1 Score: 892.1 bits (2304), Expect = 3.2e-258
Identity = 431/565 (76.28%), Postives = 487/565 (86.19%), Query Frame = 1

Query: 12  GNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPRLSSRVSS 71
           G ++ + +  ++ E ++ + S+LL++PR +N+ERQRSFD+RSL D++           S 
Sbjct: 8   GGMRRSASHTSLSESDDFDLSRLLNKPR-INVERQRSFDDRSLSDVSY----------SG 67

Query: 72  ENFGRLSDNYD--HSPSPGRKSDFNTPRSHT--GFEQHPMVAEAWEALRRSLVYFRGQPV 131
              G     +D  +SP  G +S   TP S     FE HP+V +AWEALRRSLV+FRGQP+
Sbjct: 68  GGHGGTRGGFDGMYSPGGGLRSLVGTPASSALHSFEPHPIVGDAWEALRRSLVFFRGQPL 127

Query: 132 GTIAALD-STEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRF 191
           GTIAA D ++EE LNYDQVFVRDFVPSA AFLMNGEPEIV++F+LKTL LQ WEKK+DRF
Sbjct: 128 GTIAAFDHASEEVLNYDQVFVRDFVPSALAFLMNGEPEIVRHFLLKTLLLQGWEKKVDRF 187

Query: 192 QLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSS 251
           +LGEG MPASFKVLHD  +  +TL ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD +
Sbjct: 188 KLGEGAMPASFKVLHDSKKGVDTLHADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLT 247

Query: 252 LAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALR 311
           LAE PECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALR
Sbjct: 248 LAETPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALR 307

Query: 312 CALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKF 371
           CAL LLK D+EGK+FVERI  RLHA+SYHMR+Y+W+D +QLNDIYRYKTEEYSHTA+NKF
Sbjct: 308 CALQLLKHDNEGKEFVERIATRLHALSYHMRSYYWLDFQQLNDIYRYKTEEYSHTAVNKF 367

Query: 372 NVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDL 431
           NVIPDS+P+W+FDFMP +GG+FIGNVSPARMDFRWF LGN IAILS+LATPEQ+TAIMDL
Sbjct: 368 NVIPDSIPDWLFDFMPCQGGFFIGNVSPARMDFRWFALGNMIAILSSLATPEQSTAIMDL 427

Query: 432 IESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIK 491
           IE RWEEL+GEMPLK+CYPAIE+HEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIK
Sbjct: 428 IEERWEELIGEMPLKICYPAIENHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIK 487

Query: 492 TGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDP 551
           TGRPQIARRA++LAE RLLKD WPEYYDG LGRY+GKQARKFQTWSIAGYLVAKMMLEDP
Sbjct: 488 TGRPQIARRAIDLAERRLLKDGWPEYYDGKLGRYVGKQARKFQTWSIAGYLVAKMMLEDP 547

Query: 552 SHSGMVSLEEDKQMKPLMKRSHSWT 572
           SH GM+SLEEDK MKP++KRS SWT
Sbjct: 548 SHLGMISLEEDKAMKPVLKRSASWT 561

BLAST of Csa7G009210 vs. Swiss-Prot
Match: INVD_ARATH (Probable alkaline/neutral invertase D OS=Arabidopsis thaliana GN=INVD PE=2 SV=1)

HSP 1 Score: 878.2 bits (2268), Expect = 4.7e-254
Identity = 423/545 (77.61%), Postives = 476/545 (87.34%), Query Frame = 1

Query: 30  EFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPRLSSRVSSENFGRLSDNYDHSPSPGR 89
           E ++LLDRPR +N+ER+RSFDERS  ++ I                     +D+  SPG 
Sbjct: 15  ELARLLDRPR-VNIERKRSFDERSFSEMGI---------------------FDNVNSPG- 74

Query: 90  KSDFNTPRS--HTGFEQHPMVAEAWEALRRSLVYFRGQPVGTIAALD-STEENLNYDQVF 149
              + TP S     FE HPMVAEAW+ALRRSLVYFRGQPVGTIAA D +TEE LNYDQVF
Sbjct: 75  --GWETPVSSARNSFEPHPMVAEAWDALRRSLVYFRGQPVGTIAAYDHATEEVLNYDQVF 134

Query: 150 VRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRFQLGEGVMPASFKVLHDPVRN 209
           VRDFVPSA AFLMNGEP+IVKNF+LKT+++Q  EK+IDRF+LGEG MPASFKV+HDP++ 
Sbjct: 135 VRDFVPSALAFLMNGEPDIVKNFLLKTIQIQGREKRIDRFKLGEGAMPASFKVIHDPIKE 194

Query: 210 TETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLAELPECQKGMRLILSLCLS 269
           T+++ ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD+SLAE  ECQKGMRLILSLCLS
Sbjct: 195 TDSINADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDTSLAETSECQKGMRLILSLCLS 254

Query: 270 EGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALILLKQDHEGKDFVERIT 329
           EGFDTFPTLLCADGC MIDRRMGVYGYPIEIQALFFMALR A+ +LK D EGK+F+ERI 
Sbjct: 255 EGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRSAMSMLKHDAEGKEFMERIV 314

Query: 330 KRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKFNVIPDSLPEWIFDFMPTRGG 389
           KRLHA+S+HMR+YFW+D +QLNDIYRYKTEEYSHTA+NKFNVIPDS+PEW+FDFMP RGG
Sbjct: 315 KRLHALSFHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDFMPLRGG 374

Query: 390 YFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDLIESRWEELVGEMPLKVCYPA 449
           YFIGNVSPARMDFRWF LGNC+AIL++LATPEQ+ +IMDLIE RWEELVGEMP+K+C+PA
Sbjct: 375 YFIGNVSPARMDFRWFALGNCVAILASLATPEQSASIMDLIEERWEELVGEMPVKICHPA 434

Query: 450 IESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRALELAESRLLK 509
           IESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRA++LAE+RLLK
Sbjct: 435 IESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIDLAEARLLK 494

Query: 510 DSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDPSHSGMVSLEEDKQMKPLMKR 569
           D WPEYYDG  GR+IGKQARKFQTWSIAGYLVAKM+LEDPSH GM+SLEEDKQ KP++KR
Sbjct: 495 DGWPEYYDGKSGRFIGKQARKFQTWSIAGYLVAKMLLEDPSHLGMISLEEDKQTKPVIKR 534

Query: 570 SHSWT 572
           S+SWT
Sbjct: 555 SYSWT 534

BLAST of Csa7G009210 vs. Swiss-Prot
Match: CINV1_ARATH (Alkaline/neutral invertase CINV1 OS=Arabidopsis thaliana GN=CINV1 PE=1 SV=1)

HSP 1 Score: 859.0 bits (2218), Expect = 3.0e-248
Identity = 406/552 (73.55%), Postives = 476/552 (86.23%), Query Frame = 1

Query: 22  TVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPRLSSRVSSENFGRLSDNY 81
           ++ E+++ + ++ LD+PR L +ER+RSFDERS+ +L+ G+S             R    +
Sbjct: 14  SLSEMDDLDLTRALDKPR-LKIERKRSFDERSMSELSTGYS-------------RHDGIH 73

Query: 82  DHSPSPGRKSDFNTPRS--HTGFEQHPMVAEAWEALRRSLVYFRGQPVGTIAALDST-EE 141
           D   SP  +S  +TP S     FE HPM+AEAWEALRRS+V+FRGQPVGT+AA+D+T +E
Sbjct: 74  D---SPRGRSVLDTPLSSARNSFEPHPMMAEAWEALRRSMVFFRGQPVGTLAAVDNTTDE 133

Query: 142 NLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRFQLGEGVMPASFK 201
            LNYDQVFVRDFVPSA AFLMNGEP+IVK+F+LKTL+LQ WEK++DRF+LGEGVMPASFK
Sbjct: 134 VLNYDQVFVRDFVPSALAFLMNGEPDIVKHFLLKTLQLQGWEKRVDRFKLGEGVMPASFK 193

Query: 202 VLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLAELPECQKGMR 261
           VLHDP+R T+ ++ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD +L+E PECQKGM+
Sbjct: 194 VLHDPIRETDNIVADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLSETPECQKGMK 253

Query: 262 LILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALILLKQDHEG 321
           LILSLCL+EGFDTFPTLLCADGC MIDRRMGVYGYPIEIQALFFMALR AL +LK D +G
Sbjct: 254 LILSLCLAEGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRSALSMLKPDGDG 313

Query: 322 KDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKFNVIPDSLPEWIF 381
           ++ +ERI KRLHA+S+HMR YFW+D + LNDIYR+KTEEYSHTA+NKFNV+PDS+PEW+F
Sbjct: 314 REVIERIVKRLHALSFHMRNYFWLDHQNLNDIYRFKTEEYSHTAVNKFNVMPDSIPEWVF 373

Query: 382 DFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDLIESRWEELVGEM 441
           DFMP RGGYF+GNV PA MDFRWF LGNC++ILS+LATP+Q+ AIMDL+E RW ELVGEM
Sbjct: 374 DFMPLRGGYFVGNVGPAHMDFRWFALGNCVSILSSLATPDQSMAIMDLLEHRWAELVGEM 433

Query: 442 PLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRALE 501
           PLK+CYP +E HEWRIVTGCDPKNTRWSYHNGGSWPVLLW LTAACIKTGRPQIARRA++
Sbjct: 434 PLKICYPCLEGHEWRIVTGCDPKNTRWSYHNGGSWPVLLWQLTAACIKTGRPQIARRAVD 493

Query: 502 LAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDPSHSGMVSLEEDK 561
           L ESRL +D WPEYYDG LGRY+GKQARK+QTWSIAGYLVAKM+LEDPSH GM+SLEEDK
Sbjct: 494 LIESRLHRDCWPEYYDGKLGRYVGKQARKYQTWSIAGYLVAKMLLEDPSHIGMISLEEDK 548

Query: 562 QMKPLMKRSHSW 571
            MKP++KRS SW
Sbjct: 554 LMKPVIKRSASW 548

BLAST of Csa7G009210 vs. TrEMBL
Match: A0A061DYK5_THECC (Plant neutral invertase family protein isoform 1 OS=Theobroma cacao GN=TCM_006699 PE=4 SV=1)

HSP 1 Score: 1023.5 bits (2645), Expect = 1.0e-295
Identity = 492/573 (85.86%), Postives = 532/573 (92.84%), Query Frame = 1

Query: 1   MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRP-RPLNMERQRSFDERSLGDLAI 60
           MS  + ++ QNGNVK  DTL T+ E EE +FSKLL++P R LNMERQRS DERSL DL+I
Sbjct: 1   MSTPTVDVNQNGNVKTEDTLCTLAEFEECDFSKLLEKPPRILNMERQRSLDERSLSDLSI 60

Query: 61  GFSPRLSSRVSSENFGRLSDNYDHSPSP-GRKSDFNTPRSHTGFEQHPMVAEAWEALRRS 120
           G SPRLS+R +  N  R+ +  D   SP GR+S FNTPRS TGFE HPMVAEAW+ALRRS
Sbjct: 61  GISPRLSARATDINTSRIFEPLDFICSPVGRRSGFNTPRSQTGFEPHPMVAEAWDALRRS 120

Query: 121 LVYFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQS 180
           LVYFRGQPVGTIAALD++EE LNYDQVFVRDFVPS  AFLMNGEPEIVKNFILKTLRLQS
Sbjct: 121 LVYFRGQPVGTIAALDNSEEKLNYDQVFVRDFVPSGLAFLMNGEPEIVKNFILKTLRLQS 180

Query: 181 WEKKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAY 240
           WEKKIDRFQLGEGVMPASFKVLHDPVRN ETL+ADFGESAIGRVAPVDSGFWWIILLRAY
Sbjct: 181 WEKKIDRFQLGEGVMPASFKVLHDPVRNNETLMADFGESAIGRVAPVDSGFWWIILLRAY 240

Query: 241 TKSTGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQ 300
           TKSTGD+SLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQ
Sbjct: 241 TKSTGDTSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQ 300

Query: 301 ALFFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEY 360
           ALFFMALRCAL+LLKQD EGK+F+ERI KRLHA+S+HMR+YFW+DLKQLNDIYRYKTEEY
Sbjct: 301 ALFFMALRCALLLLKQDDEGKEFIERIVKRLHALSFHMRSYFWLDLKQLNDIYRYKTEEY 360

Query: 361 SHTALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPE 420
           SHTALNKFNV+PDSLPEWIFDFMP RGGYFIGNVSPARMDFRWFCLGNCIAILS+LATPE
Sbjct: 361 SHTALNKFNVMPDSLPEWIFDFMPVRGGYFIGNVSPARMDFRWFCLGNCIAILSSLATPE 420

Query: 421 QATAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLW 480
           Q+TAIMDLIESRWEELVGEMPLKVCYPAIE+HEWRI TGCDPKNTRWSYHNGGSWPVLLW
Sbjct: 421 QSTAIMDLIESRWEELVGEMPLKVCYPAIENHEWRITTGCDPKNTRWSYHNGGSWPVLLW 480

Query: 481 LLTAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLV 540
           LLTAAC+KTGRPQIARRALE+AE+RLLKD+WPEYYDG LGRYIGKQ+RK QTWSIAGYLV
Sbjct: 481 LLTAACVKTGRPQIARRALEIAETRLLKDNWPEYYDGKLGRYIGKQSRKVQTWSIAGYLV 540

Query: 541 AKMMLEDPSHSGMVSLEEDKQMKPLMKRSHSWT 572
           AKM+LEDPSH GM++LEEDKQMKPL++RS+SWT
Sbjct: 541 AKMLLEDPSHLGMIALEEDKQMKPLLRRSNSWT 573

BLAST of Csa7G009210 vs. TrEMBL
Match: M5W112_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003483mg PE=4 SV=1)

HSP 1 Score: 1020.8 bits (2638), Expect = 6.6e-295
Identity = 490/572 (85.66%), Postives = 530/572 (92.66%), Query Frame = 1

Query: 1   MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIG 60
           MS  +S+M QNGN+++ D+L +V EIEE +FSKLLDRP  LNMER+RSFDERSL +L++ 
Sbjct: 1   MSIPNSDMSQNGNIRHVDSLCSVAEIEEIDFSKLLDRPSLLNMERKRSFDERSLSELSVA 60

Query: 61  FSPRLSSRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV 120
            SPR SSR +  +F +  D+ ++  SP R+S   TPRS TGFE HPMVAEAWE LRRSLV
Sbjct: 61  LSPRHSSRNADNSF-KFFDHPEYVFSPSRRSLIGTPRSLTGFEPHPMVAEAWETLRRSLV 120

Query: 121 YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE 180
           +FRGQPVGTIAA D++EE LNYDQVFVRDFVPS  AFLMNGEPEIVKNFILKTLRLQSWE
Sbjct: 121 FFRGQPVGTIAATDTSEEKLNYDQVFVRDFVPSGLAFLMNGEPEIVKNFILKTLRLQSWE 180

Query: 181 KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240
           KKIDRFQLGEGVMPASFKVLHDPVRN+ETLIADFGESAIGRVAPVDSGFWWIILLRAYTK
Sbjct: 181 KKIDRFQLGEGVMPASFKVLHDPVRNSETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240

Query: 241 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300
           STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL
Sbjct: 241 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300

Query: 301 FFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH 360
           FFMALRCAL+LLK D EGK+FVERI KRLHA+SYHMR+YFW+D KQLNDIYRYKTEEYSH
Sbjct: 301 FFMALRCALLLLKHDDEGKEFVERIVKRLHALSYHMRSYFWLDFKQLNDIYRYKTEEYSH 360

Query: 361 TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQA 420
           TA+NKFNVIPDSLPEW+FDFMPTRGGYFIGN+SPARMDFRWFCLGNCIAILS+LATPEQ+
Sbjct: 361 TAVNKFNVIPDSLPEWVFDFMPTRGGYFIGNISPARMDFRWFCLGNCIAILSSLATPEQS 420

Query: 421 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480
            AIMDLIESRWEEL GEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL
Sbjct: 421 MAIMDLIESRWEELAGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480

Query: 481 TAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK 540
           TAACIKTGRPQIARRA+ELAESRLLKD+WPEYYDG LGRYIGKQARKFQTWS+AGYLVAK
Sbjct: 481 TAACIKTGRPQIARRAIELAESRLLKDNWPEYYDGKLGRYIGKQARKFQTWSVAGYLVAK 540

Query: 541 MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC 573
           M+LEDPSH GM++LEEDKQMKP MKRS+SWTC
Sbjct: 541 MLLEDPSHLGMIALEEDKQMKPAMKRSNSWTC 571

BLAST of Csa7G009210 vs. TrEMBL
Match: A0A059BCZ8_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_G01751 PE=4 SV=1)

HSP 1 Score: 1008.8 bits (2607), Expect = 2.6e-291
Identity = 485/565 (85.84%), Postives = 525/565 (92.92%), Query Frame = 1

Query: 7   NMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPRLS 66
           ++ +NGNV+ +D L T+ E EE +FSKL++RPRPLNMER+RS DERSL +L+   SP LS
Sbjct: 8   SVTENGNVRGSDLLCTLAESEECDFSKLMERPRPLNMERKRSLDERSLNELSTALSPHLS 67

Query: 67  SRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLVYFRGQP 126
            R +SE+  RL+D +    SP R+S +NTPR+  GF+ HPMVAEAWEALRRSLVYFRGQP
Sbjct: 68  LR-NSESSSRLTDPFGSFLSPDRRSGYNTPRADNGFDTHPMVAEAWEALRRSLVYFRGQP 127

Query: 127 VGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRF 186
           VGTIAALDS+EENLNYDQVFVRDF PSA AFLMNGE EIVKNFILKTLRLQSWEKKIDRF
Sbjct: 128 VGTIAALDSSEENLNYDQVFVRDFFPSALAFLMNGESEIVKNFILKTLRLQSWEKKIDRF 187

Query: 187 QLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSS 246
           QLGEGVMPASFKVLHDPVRN +TLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD+S
Sbjct: 188 QLGEGVMPASFKVLHDPVRNNDTLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDTS 247

Query: 247 LAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALR 306
           LAEL ECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALR
Sbjct: 248 LAELSECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALR 307

Query: 307 CALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKF 366
           CAL+LLKQD EGK+FVERI KRLHA++YHMR YFWIDLK LNDIYRYKTEEYSHTA+NKF
Sbjct: 308 CALLLLKQDVEGKEFVERIVKRLHALTYHMRGYFWIDLKHLNDIYRYKTEEYSHTAVNKF 367

Query: 367 NVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDL 426
           NVIPDSLPEWIFDFMPTRGGYF+GNVSPARMDFRWFCLGNCIAILS+LATPEQ+TAIMDL
Sbjct: 368 NVIPDSLPEWIFDFMPTRGGYFVGNVSPARMDFRWFCLGNCIAILSSLATPEQSTAIMDL 427

Query: 427 IESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIK 486
           IESRWEELVGEMPLKVCYPAIE+HEW+IVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIK
Sbjct: 428 IESRWEELVGEMPLKVCYPAIENHEWKIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIK 487

Query: 487 TGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDP 546
           TGRP IARRA+ELAE+RLLKD+WPEYYDG LG YIGKQARKFQTWSIAGYLVAKMMLEDP
Sbjct: 488 TGRPLIARRAIELAEARLLKDNWPEYYDGKLGCYIGKQARKFQTWSIAGYLVAKMMLEDP 547

Query: 547 SHSGMVSLEEDKQMKPLMKRSHSWT 572
           SH GMVSLEED+QMKP+M+RS+SWT
Sbjct: 548 SHIGMVSLEEDRQMKPVMRRSNSWT 571

BLAST of Csa7G009210 vs. TrEMBL
Match: V4UC11_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10030393mg PE=4 SV=1)

HSP 1 Score: 1005.7 bits (2599), Expect = 2.2e-290
Identity = 488/565 (86.37%), Postives = 521/565 (92.21%), Query Frame = 1

Query: 18  DTLFTVDEIEESEFSKLLDRPRPLNM--ERQRSFDERSLGDLAIGFSPRLSSRVSSE--- 77
           DTL TV E  E +FSKL ++PR LNM  ERQRSFDERSL +L+IGFSPR+ +R +     
Sbjct: 2   DTLCTVAECNECDFSKLSEKPRSLNMDRERQRSFDERSLSELSIGFSPRVMTRSADNANA 61

Query: 78  --NFGRLSDNYDHSP----SPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLVYFRGQP 137
             NF RL    DH+P    SPGR+S FNTPRS  G+E HPMV EAW+ALRRSLVYFRG P
Sbjct: 62  NANFSRLV--IDHNPDAPFSPGRRSGFNTPRSLIGYEPHPMVGEAWDALRRSLVYFRGNP 121

Query: 138 VGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRF 197
           VGTIAALDS+EE LNYDQVFVRDFVPSA AFLMNGEPEIVKNFILKTLRLQSWEKKIDRF
Sbjct: 122 VGTIAALDSSEEELNYDQVFVRDFVPSALAFLMNGEPEIVKNFILKTLRLQSWEKKIDRF 181

Query: 198 QLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSS 257
           QLGEGVMPASFKVLHDP+RNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSS
Sbjct: 182 QLGEGVMPASFKVLHDPIRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSS 241

Query: 258 LAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALR 317
           LAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALR
Sbjct: 242 LAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALR 301

Query: 318 CALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKF 377
           CAL+LLKQD EGK+FVERI KRLHA++YHMR+YFW+DLKQLNDIYRYKTEEYSHTA+NKF
Sbjct: 302 CALVLLKQDDEGKEFVERIVKRLHALNYHMRSYFWLDLKQLNDIYRYKTEEYSHTAVNKF 361

Query: 378 NVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDL 437
           NVIPDSLPEW+FDFMP RGGYFIGNVSPA+MDFRWF LGNCIAILS+LAT EQ+ AIMDL
Sbjct: 362 NVIPDSLPEWVFDFMPIRGGYFIGNVSPAKMDFRWFALGNCIAILSSLATEEQSNAIMDL 421

Query: 438 IESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIK 497
           IESRWEELVGEMP+KVCYPAIESH+WRI+TGCDPKNTRWSYHNGGSWPVLLWLLTAACIK
Sbjct: 422 IESRWEELVGEMPIKVCYPAIESHDWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAACIK 481

Query: 498 TGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDP 557
           TGRPQIARRA+ELAESRLLKDSWPEYYDG LGRYIGKQARKFQTWSIAGYLVAKMMLEDP
Sbjct: 482 TGRPQIARRAIELAESRLLKDSWPEYYDGKLGRYIGKQARKFQTWSIAGYLVAKMMLEDP 541

Query: 558 SHSGMVSLEEDKQMKPLMKRSHSWT 572
           SH GM+SLEEDKQ+KPL++RSHSWT
Sbjct: 542 SHLGMISLEEDKQLKPLLRRSHSWT 564

BLAST of Csa7G009210 vs. TrEMBL
Match: A0A0D2NI51_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G252100 PE=4 SV=1)

HSP 1 Score: 1005.0 bits (2597), Expect = 3.7e-290
Identity = 479/568 (84.33%), Postives = 523/568 (92.08%), Query Frame = 1

Query: 5   SSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPR 64
           + N  QN NVK  D L  + E EE +FSKLL++PR LN++RQRS DERSL +L+IG SPR
Sbjct: 9   NQNQNQNKNVKAEDILCPLAEYEECDFSKLLEKPRLLNIDRQRSLDERSLSELSIGISPR 68

Query: 65  LSSRVSSENFGRLSDNYDHSPSP-GRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLVYFR 124
            ++R    N  R  +  D   SP GR+S F+TPRS  GF+ HPMVAEAWEALRRSLVYFR
Sbjct: 69  HATRAIDPNSYRFFEQLDSICSPVGRRSGFSTPRSQIGFDPHPMVAEAWEALRRSLVYFR 128

Query: 125 GQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKI 184
           GQPVGTIAALD+TEENLNYDQVFVRDFVPSA AFLMNGEPEIVKNFILKTLRLQSWEKKI
Sbjct: 129 GQPVGTIAALDNTEENLNYDQVFVRDFVPSALAFLMNGEPEIVKNFILKTLRLQSWEKKI 188

Query: 185 DRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTG 244
           DRFQLGEGVMPASFKVLHDPVRN ETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTG
Sbjct: 189 DRFQLGEGVMPASFKVLHDPVRNNETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTG 248

Query: 245 DSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFM 304
           D+SLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFM
Sbjct: 249 DTSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFM 308

Query: 305 ALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTAL 364
           ALRCAL+LLKQD EGK+F+ERI KRLHA+SYHMR+YFW+DLKQLNDIYR+KTEEYSHTA+
Sbjct: 309 ALRCALLLLKQDDEGKEFIERIVKRLHALSYHMRSYFWLDLKQLNDIYRFKTEEYSHTAV 368

Query: 365 NKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAI 424
           NKFNV+PDSLPEW+FDFMP  GGYFIGNVSPARMDFRWFCLGNCIAILS+LATPEQ+TAI
Sbjct: 369 NKFNVMPDSLPEWVFDFMPVYGGYFIGNVSPARMDFRWFCLGNCIAILSSLATPEQSTAI 428

Query: 425 MDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAA 484
           MDLIESRWEELVGEMPLKVCYPA+E+HEWRI+TGCDPKNTRWSYHNGGSWPVLLWLLTAA
Sbjct: 429 MDLIESRWEELVGEMPLKVCYPAMETHEWRIITGCDPKNTRWSYHNGGSWPVLLWLLTAA 488

Query: 485 CIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMML 544
           C+KTGRPQIARRA+E+AE+RLLKD WPEYYDG LGRYIGKQ+RK QTWSIAGYLVAKMML
Sbjct: 489 CVKTGRPQIARRAIEIAEARLLKDHWPEYYDGKLGRYIGKQSRKAQTWSIAGYLVAKMML 548

Query: 545 EDPSHSGMVSLEEDKQMKPLMKRSHSWT 572
           EDPSH GM+++EEDKQMKP+++RS+SWT
Sbjct: 549 EDPSHLGMIAIEEDKQMKPILRRSYSWT 576

BLAST of Csa7G009210 vs. TAIR10
Match: AT4G34860.1 (AT4G34860.1 Plant neutral invertase family protein)

HSP 1 Score: 986.5 bits (2549), Expect = 7.0e-288
Identity = 472/572 (82.52%), Postives = 523/572 (91.43%), Query Frame = 1

Query: 3   NSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFS 62
           N S ++ QNGN+KN D+L T+D+I++ +F+KLL++PRPLN++R RS DERSL +L    S
Sbjct: 5   NLSVDVNQNGNIKNVDSLSTLDDIDDIDFAKLLEKPRPLNIDRLRSLDERSLTELT--GS 64

Query: 63  PRLSSRVSSENFGRLSDNYDH--SPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV 122
           P+L +   ++N  R  D+ D+  SPS GR+S FNTPRS  GFE HPMV EAW+ALRRS+V
Sbjct: 65  PQLRN---ADNASRAPDHADYVISPSFGRRSGFNTPRSQPGFESHPMVGEAWDALRRSMV 124

Query: 123 YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE 182
           YFRGQPVGTIAA+D++EE LNYDQVFVRDFVPSA AFLMNGEP+IVKNF+LKTLRLQSWE
Sbjct: 125 YFRGQPVGTIAAVDNSEEKLNYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLRLQSWE 184

Query: 183 KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 242
           KKIDRFQLGEGVMPASFKV HDPVRN ETLIADFGESAIGRVAPVDSGFWWIILLRAYTK
Sbjct: 185 KKIDRFQLGEGVMPASFKVFHDPVRNHETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 244

Query: 243 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 302
           STGDSSLA++PECQKG+RLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL
Sbjct: 245 STGDSSLADMPECQKGIRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 304

Query: 303 FFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH 362
           FFMALRCAL+LLK D EGK+ VE+I KRLHA+SYHMR+YFW+DLKQLNDIYRYKTEEYSH
Sbjct: 305 FFMALRCALLLLKHDGEGKEMVEQIVKRLHALSYHMRSYFWLDLKQLNDIYRYKTEEYSH 364

Query: 363 TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQA 422
           TA+NKFNVIPDSLPEW+FDFMP  GG+FIGNVSPARMDFRWF LGNCIAILS+LATPEQ+
Sbjct: 365 TAVNKFNVIPDSLPEWVFDFMPPHGGFFIGNVSPARMDFRWFALGNCIAILSSLATPEQS 424

Query: 423 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 482
           TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL
Sbjct: 425 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 484

Query: 483 TAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK 542
           TAACIKTGRPQIARRA+E+AE+RL KD WPEYYDG +GRY+GKQ+RK QTWS+AGYLVAK
Sbjct: 485 TAACIKTGRPQIARRAIEVAEARLHKDHWPEYYDGKVGRYVGKQSRKNQTWSVAGYLVAK 544

Query: 543 MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC 573
           MMLEDPSH GMV LEEDKQMKP+M+RS+SWTC
Sbjct: 545 MMLEDPSHVGMVCLEEDKQMKPVMRRSNSWTC 571

BLAST of Csa7G009210 vs. TAIR10
Match: AT4G09510.1 (AT4G09510.1 cytosolic invertase 2)

HSP 1 Score: 900.6 bits (2326), Expect = 5.0e-262
Identity = 423/553 (76.49%), Postives = 486/553 (87.88%), Query Frame = 1

Query: 22  TVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPRLSSRVSSENFGRLSDNY 81
           ++ E+++ + ++ L++PR L +ER+RSFDERS+ +L+ G+              R     
Sbjct: 19  SLSEMDDFDLTRALEKPRQLKIERKRSFDERSMSELSTGYV-------------RQDSIL 78

Query: 82  DHSPSPGRKSDFNTPRS-HTGFEQHPMVAEAWEALRRSLVYFRGQPVGTIAALD-STEEN 141
           + + SPG +S  +TP S    FE HPMVAEAWEALRRS+V+FRGQPVGTIAA D ++EE 
Sbjct: 79  EMAHSPGSRSMVDTPLSVRNSFEPHPMVAEAWEALRRSMVFFRGQPVGTIAAYDHASEEV 138

Query: 142 LNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRFQLGEGVMPASFKV 201
           LNYDQVFVRDFVPSA AFLMNGEP+IVKNF+LKTL+LQ WEK++DRF+LGEGVMPASFKV
Sbjct: 139 LNYDQVFVRDFVPSALAFLMNGEPDIVKNFLLKTLQLQGWEKRVDRFKLGEGVMPASFKV 198

Query: 202 LHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLAELPECQKGMRL 261
           LHDPVR T+T+IADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD +L+E PECQ+GMRL
Sbjct: 199 LHDPVRKTDTIIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLSETPECQRGMRL 258

Query: 262 ILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALILLKQDHEGK 321
           ILSLCLSEGFDTFPTLLCADGC M+DRRMGVYGYPIEIQALFFMALRCAL +LK D EG+
Sbjct: 259 ILSLCLSEGFDTFPTLLCADGCSMVDRRMGVYGYPIEIQALFFMALRCALSMLKPDEEGR 318

Query: 322 DFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKFNVIPDSLPEWIFD 381
           DF+ERI KRLHA+S+HMR+YFW+D +QLNDIYRYKTEEYSHTA+NKFNV+PDS+P+W+FD
Sbjct: 319 DFIERIVKRLHALSFHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVMPDSIPDWVFD 378

Query: 382 FMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDLIESRWEELVGEMP 441
           FMP RGGYF+GNVSPARMDFRWF LGNC++ILS+LATP+Q+ AIMDL+E RWEELVGEMP
Sbjct: 379 FMPLRGGYFVGNVSPARMDFRWFSLGNCVSILSSLATPDQSMAIMDLLEHRWEELVGEMP 438

Query: 442 LKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRALEL 501
           LK+CYP IESHEWRIVTGCDPKNTRWSYHNGGSWPVLLW LTAACIKTGRPQIARRA++L
Sbjct: 439 LKICYPCIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWTLTAACIKTGRPQIARRAIDL 498

Query: 502 AESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDPSHSGMVSLEEDKQ 561
            ESRL +D WPEYYDG  GRY+GKQARK+QTWSIAGYLVAKMMLEDPSH GM+SLEEDKQ
Sbjct: 499 IESRLHRDCWPEYYDGKQGRYVGKQARKYQTWSIAGYLVAKMMLEDPSHIGMISLEEDKQ 558

Query: 562 MKPLMKRSHSWTC 573
           MKP++KRS SWTC
Sbjct: 559 MKPVIKRSASWTC 558

BLAST of Csa7G009210 vs. TAIR10
Match: AT1G22650.1 (AT1G22650.1 Plant neutral invertase family protein)

HSP 1 Score: 878.2 bits (2268), Expect = 2.7e-255
Identity = 423/545 (77.61%), Postives = 476/545 (87.34%), Query Frame = 1

Query: 30  EFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPRLSSRVSSENFGRLSDNYDHSPSPGR 89
           E ++LLDRPR +N+ER+RSFDERS  ++ I                     +D+  SPG 
Sbjct: 15  ELARLLDRPR-VNIERKRSFDERSFSEMGI---------------------FDNVNSPG- 74

Query: 90  KSDFNTPRS--HTGFEQHPMVAEAWEALRRSLVYFRGQPVGTIAALD-STEENLNYDQVF 149
              + TP S     FE HPMVAEAW+ALRRSLVYFRGQPVGTIAA D +TEE LNYDQVF
Sbjct: 75  --GWETPVSSARNSFEPHPMVAEAWDALRRSLVYFRGQPVGTIAAYDHATEEVLNYDQVF 134

Query: 150 VRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRFQLGEGVMPASFKVLHDPVRN 209
           VRDFVPSA AFLMNGEP+IVKNF+LKT+++Q  EK+IDRF+LGEG MPASFKV+HDP++ 
Sbjct: 135 VRDFVPSALAFLMNGEPDIVKNFLLKTIQIQGREKRIDRFKLGEGAMPASFKVIHDPIKE 194

Query: 210 TETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLAELPECQKGMRLILSLCLS 269
           T+++ ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD+SLAE  ECQKGMRLILSLCLS
Sbjct: 195 TDSINADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDTSLAETSECQKGMRLILSLCLS 254

Query: 270 EGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALILLKQDHEGKDFVERIT 329
           EGFDTFPTLLCADGC MIDRRMGVYGYPIEIQALFFMALR A+ +LK D EGK+F+ERI 
Sbjct: 255 EGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRSAMSMLKHDAEGKEFMERIV 314

Query: 330 KRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKFNVIPDSLPEWIFDFMPTRGG 389
           KRLHA+S+HMR+YFW+D +QLNDIYRYKTEEYSHTA+NKFNVIPDS+PEW+FDFMP RGG
Sbjct: 315 KRLHALSFHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPEWVFDFMPLRGG 374

Query: 390 YFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDLIESRWEELVGEMPLKVCYPA 449
           YFIGNVSPARMDFRWF LGNC+AIL++LATPEQ+ +IMDLIE RWEELVGEMP+K+C+PA
Sbjct: 375 YFIGNVSPARMDFRWFALGNCVAILASLATPEQSASIMDLIEERWEELVGEMPVKICHPA 434

Query: 450 IESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRALELAESRLLK 509
           IESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRA++LAE+RLLK
Sbjct: 435 IESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIDLAEARLLK 494

Query: 510 DSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDPSHSGMVSLEEDKQMKPLMKR 569
           D WPEYYDG  GR+IGKQARKFQTWSIAGYLVAKM+LEDPSH GM+SLEEDKQ KP++KR
Sbjct: 495 DGWPEYYDGKSGRFIGKQARKFQTWSIAGYLVAKMLLEDPSHLGMISLEEDKQTKPVIKR 534

Query: 570 SHSWT 572
           S+SWT
Sbjct: 555 SYSWT 534

BLAST of Csa7G009210 vs. TAIR10
Match: AT1G35580.1 (AT1G35580.1 cytosolic invertase 1)

HSP 1 Score: 859.0 bits (2218), Expect = 1.7e-249
Identity = 406/552 (73.55%), Postives = 476/552 (86.23%), Query Frame = 1

Query: 22  TVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIGFSPRLSSRVSSENFGRLSDNY 81
           ++ E+++ + ++ LD+PR L +ER+RSFDERS+ +L+ G+S             R    +
Sbjct: 14  SLSEMDDLDLTRALDKPR-LKIERKRSFDERSMSELSTGYS-------------RHDGIH 73

Query: 82  DHSPSPGRKSDFNTPRS--HTGFEQHPMVAEAWEALRRSLVYFRGQPVGTIAALDST-EE 141
           D   SP  +S  +TP S     FE HPM+AEAWEALRRS+V+FRGQPVGT+AA+D+T +E
Sbjct: 74  D---SPRGRSVLDTPLSSARNSFEPHPMMAEAWEALRRSMVFFRGQPVGTLAAVDNTTDE 133

Query: 142 NLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRFQLGEGVMPASFK 201
            LNYDQVFVRDFVPSA AFLMNGEP+IVK+F+LKTL+LQ WEK++DRF+LGEGVMPASFK
Sbjct: 134 VLNYDQVFVRDFVPSALAFLMNGEPDIVKHFLLKTLQLQGWEKRVDRFKLGEGVMPASFK 193

Query: 202 VLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLAELPECQKGMR 261
           VLHDP+R T+ ++ADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD +L+E PECQKGM+
Sbjct: 194 VLHDPIRETDNIVADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLTLSETPECQKGMK 253

Query: 262 LILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALILLKQDHEG 321
           LILSLCL+EGFDTFPTLLCADGC MIDRRMGVYGYPIEIQALFFMALR AL +LK D +G
Sbjct: 254 LILSLCLAEGFDTFPTLLCADGCSMIDRRMGVYGYPIEIQALFFMALRSALSMLKPDGDG 313

Query: 322 KDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKFNVIPDSLPEWIF 381
           ++ +ERI KRLHA+S+HMR YFW+D + LNDIYR+KTEEYSHTA+NKFNV+PDS+PEW+F
Sbjct: 314 REVIERIVKRLHALSFHMRNYFWLDHQNLNDIYRFKTEEYSHTAVNKFNVMPDSIPEWVF 373

Query: 382 DFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDLIESRWEELVGEM 441
           DFMP RGGYF+GNV PA MDFRWF LGNC++ILS+LATP+Q+ AIMDL+E RW ELVGEM
Sbjct: 374 DFMPLRGGYFVGNVGPAHMDFRWFALGNCVSILSSLATPDQSMAIMDLLEHRWAELVGEM 433

Query: 442 PLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRALE 501
           PLK+CYP +E HEWRIVTGCDPKNTRWSYHNGGSWPVLLW LTAACIKTGRPQIARRA++
Sbjct: 434 PLKICYPCLEGHEWRIVTGCDPKNTRWSYHNGGSWPVLLWQLTAACIKTGRPQIARRAVD 493

Query: 502 LAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDPSHSGMVSLEEDK 561
           L ESRL +D WPEYYDG LGRY+GKQARK+QTWSIAGYLVAKM+LEDPSH GM+SLEEDK
Sbjct: 494 LIESRLHRDCWPEYYDGKLGRYVGKQARKYQTWSIAGYLVAKMLLEDPSHIGMISLEEDK 548

Query: 562 QMKPLMKRSHSW 571
            MKP++KRS SW
Sbjct: 554 LMKPVIKRSASW 548

BLAST of Csa7G009210 vs. TAIR10
Match: AT1G72000.1 (AT1G72000.1 Plant neutral invertase family protein)

HSP 1 Score: 831.6 bits (2147), Expect = 2.9e-241
Identity = 393/494 (79.55%), Postives = 443/494 (89.68%), Query Frame = 1

Query: 81  YDHSPSPGRKSDFNTP--RSHTGFEQHPMVAEAWEALRRSLVYFRGQPVGTIAALD-STE 140
           YD + S   KS ++TP        +++PMV EAWEAL +S VYFRG+PVGTIAA D ++E
Sbjct: 6   YDSAHSLDGKSGWDTPVFSMKDSMDRNPMVTEAWEALCQSQVYFRGKPVGTIAAYDHASE 65

Query: 141 ENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWEKKIDRFQLGEGVMPASF 200
           E LNYDQVFVRDFVPSA AFLMNGEPEIVKNF+LKTL +Q  +K ID+F+LG+G MPASF
Sbjct: 66  EVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLHIQGQDKMIDKFKLGDGAMPASF 125

Query: 201 KVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDSSLAELPECQKGM 260
           KVLH+P++ T+T+IADFGESAIGRVAPVDSGFWWIILLRAYTKSTGD SLAE PECQKGM
Sbjct: 126 KVLHNPIKKTDTIIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDHSLAERPECQKGM 185

Query: 261 RLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQALFFMALRCALILLKQDHE 320
           RLILSLCLSEGFDTFPTLLCADGC M+DRRMG+YGYPIEIQALFFMALR AL +LK D E
Sbjct: 186 RLILSLCLSEGFDTFPTLLCADGCSMVDRRMGIYGYPIEIQALFFMALRSALSMLKHDSE 245

Query: 321 GKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSHTALNKFNVIPDSLPEWI 380
           GK+F+E+I KRLHA+S+HMR+YFW+D +QLNDIYRYKTEEYSHTA+NKFNVIPDS+P+WI
Sbjct: 246 GKEFMEKIVKRLHALSFHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPDWI 305

Query: 381 FDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQATAIMDLIESRWEELVGE 440
           FDFMP RGGYF+GNVSPARMDFRWF LGNCIAILS+LATPEQ+ AIMDLIE+RWEELVGE
Sbjct: 306 FDFMPLRGGYFVGNVSPARMDFRWFALGNCIAILSSLATPEQSMAIMDLIEARWEELVGE 365

Query: 441 MPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAL 500
           MPLK+CYPA+ESHEW IVTGCDPKNTRWSYHNGGSWPVLLWLLTAA IKTGRPQIARRA+
Sbjct: 366 MPLKICYPAMESHEWGIVTGCDPKNTRWSYHNGGSWPVLLWLLTAASIKTGRPQIARRAI 425

Query: 501 ELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAKMMLEDPSHSGMVSLEED 560
           ELAE+RLLKD WPEYYDG  GR+IGKQARK QTWSIAGYLVAKMM++DP+H GM+S+EE+
Sbjct: 426 ELAEARLLKDGWPEYYDGKSGRFIGKQARKSQTWSIAGYLVAKMMMDDPTHVGMISMEEE 485

Query: 561 KQMKPLMKRSHSWT 572
           K MKP ++RS SWT
Sbjct: 486 KHMKPPLRRSSSWT 499

BLAST of Csa7G009210 vs. NCBI nr
Match: gi|449454175|ref|XP_004144831.1| (PREDICTED: probable alkaline/neutral invertase B [Cucumis sativus])

HSP 1 Score: 1180.6 bits (3053), Expect = 0.0e+00
Identity = 572/572 (100.00%), Postives = 572/572 (100.00%), Query Frame = 1

Query: 1   MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIG 60
           MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIG
Sbjct: 1   MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIG 60

Query: 61  FSPRLSSRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV 120
           FSPRLSSRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV
Sbjct: 61  FSPRLSSRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV 120

Query: 121 YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE 180
           YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE
Sbjct: 121 YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE 180

Query: 181 KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240
           KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK
Sbjct: 181 KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240

Query: 241 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300
           STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL
Sbjct: 241 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300

Query: 301 FFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH 360
           FFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH
Sbjct: 301 FFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH 360

Query: 361 TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQA 420
           TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQA
Sbjct: 361 TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQA 420

Query: 421 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480
           TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL
Sbjct: 421 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480

Query: 481 TAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK 540
           TAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK
Sbjct: 481 TAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK 540

Query: 541 MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC 573
           MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC
Sbjct: 541 MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC 572

BLAST of Csa7G009210 vs. NCBI nr
Match: gi|659094308|ref|XP_008447991.1| (PREDICTED: alkaline/neutral invertase CINV2 [Cucumis melo])

HSP 1 Score: 1169.5 bits (3024), Expect = 0.0e+00
Identity = 566/572 (98.95%), Postives = 570/572 (99.65%), Query Frame = 1

Query: 1   MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIG 60
           MSNSSSNM QNGNVKNNDTLFTVDEIEESEFSKLLDRPR LNMERQRSFDERSLGDLAIG
Sbjct: 1   MSNSSSNMNQNGNVKNNDTLFTVDEIEESEFSKLLDRPRHLNMERQRSFDERSLGDLAIG 60

Query: 61  FSPRLSSRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV 120
           FSPRLS+RVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV
Sbjct: 61  FSPRLSTRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV 120

Query: 121 YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE 180
           YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE
Sbjct: 121 YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE 180

Query: 181 KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240
           KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK
Sbjct: 181 KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240

Query: 241 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300
           STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL
Sbjct: 241 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300

Query: 301 FFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH 360
           FFMALRCAL+LLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH
Sbjct: 301 FFMALRCALLLLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH 360

Query: 361 TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQA 420
           TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQ+
Sbjct: 361 TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQS 420

Query: 421 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480
           TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL
Sbjct: 421 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480

Query: 481 TAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK 540
           TAACIKTGRPQIARRALELAESRLLKD+WPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK
Sbjct: 481 TAACIKTGRPQIARRALELAESRLLKDNWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK 540

Query: 541 MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC 573
           MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC
Sbjct: 541 MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC 572

BLAST of Csa7G009210 vs. NCBI nr
Match: gi|590684809|ref|XP_007041939.1| (Plant neutral invertase family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 1023.5 bits (2645), Expect = 1.5e-295
Identity = 492/573 (85.86%), Postives = 532/573 (92.84%), Query Frame = 1

Query: 1   MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRP-RPLNMERQRSFDERSLGDLAI 60
           MS  + ++ QNGNVK  DTL T+ E EE +FSKLL++P R LNMERQRS DERSL DL+I
Sbjct: 1   MSTPTVDVNQNGNVKTEDTLCTLAEFEECDFSKLLEKPPRILNMERQRSLDERSLSDLSI 60

Query: 61  GFSPRLSSRVSSENFGRLSDNYDHSPSP-GRKSDFNTPRSHTGFEQHPMVAEAWEALRRS 120
           G SPRLS+R +  N  R+ +  D   SP GR+S FNTPRS TGFE HPMVAEAW+ALRRS
Sbjct: 61  GISPRLSARATDINTSRIFEPLDFICSPVGRRSGFNTPRSQTGFEPHPMVAEAWDALRRS 120

Query: 121 LVYFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQS 180
           LVYFRGQPVGTIAALD++EE LNYDQVFVRDFVPS  AFLMNGEPEIVKNFILKTLRLQS
Sbjct: 121 LVYFRGQPVGTIAALDNSEEKLNYDQVFVRDFVPSGLAFLMNGEPEIVKNFILKTLRLQS 180

Query: 181 WEKKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAY 240
           WEKKIDRFQLGEGVMPASFKVLHDPVRN ETL+ADFGESAIGRVAPVDSGFWWIILLRAY
Sbjct: 181 WEKKIDRFQLGEGVMPASFKVLHDPVRNNETLMADFGESAIGRVAPVDSGFWWIILLRAY 240

Query: 241 TKSTGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQ 300
           TKSTGD+SLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQ
Sbjct: 241 TKSTGDTSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQ 300

Query: 301 ALFFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEY 360
           ALFFMALRCAL+LLKQD EGK+F+ERI KRLHA+S+HMR+YFW+DLKQLNDIYRYKTEEY
Sbjct: 301 ALFFMALRCALLLLKQDDEGKEFIERIVKRLHALSFHMRSYFWLDLKQLNDIYRYKTEEY 360

Query: 361 SHTALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPE 420
           SHTALNKFNV+PDSLPEWIFDFMP RGGYFIGNVSPARMDFRWFCLGNCIAILS+LATPE
Sbjct: 361 SHTALNKFNVMPDSLPEWIFDFMPVRGGYFIGNVSPARMDFRWFCLGNCIAILSSLATPE 420

Query: 421 QATAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLW 480
           Q+TAIMDLIESRWEELVGEMPLKVCYPAIE+HEWRI TGCDPKNTRWSYHNGGSWPVLLW
Sbjct: 421 QSTAIMDLIESRWEELVGEMPLKVCYPAIENHEWRITTGCDPKNTRWSYHNGGSWPVLLW 480

Query: 481 LLTAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLV 540
           LLTAAC+KTGRPQIARRALE+AE+RLLKD+WPEYYDG LGRYIGKQ+RK QTWSIAGYLV
Sbjct: 481 LLTAACVKTGRPQIARRALEIAETRLLKDNWPEYYDGKLGRYIGKQSRKVQTWSIAGYLV 540

Query: 541 AKMMLEDPSHSGMVSLEEDKQMKPLMKRSHSWT 572
           AKM+LEDPSH GM++LEEDKQMKPL++RS+SWT
Sbjct: 541 AKMLLEDPSHLGMIALEEDKQMKPLLRRSNSWT 573

BLAST of Csa7G009210 vs. NCBI nr
Match: gi|595800735|ref|XP_007201719.1| (hypothetical protein PRUPE_ppa003483mg [Prunus persica])

HSP 1 Score: 1020.8 bits (2638), Expect = 9.5e-295
Identity = 490/572 (85.66%), Postives = 530/572 (92.66%), Query Frame = 1

Query: 1   MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIG 60
           MS  +S+M QNGN+++ D+L +V EIEE +FSKLLDRP  LNMER+RSFDERSL +L++ 
Sbjct: 1   MSIPNSDMSQNGNIRHVDSLCSVAEIEEIDFSKLLDRPSLLNMERKRSFDERSLSELSVA 60

Query: 61  FSPRLSSRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV 120
            SPR SSR +  +F +  D+ ++  SP R+S   TPRS TGFE HPMVAEAWE LRRSLV
Sbjct: 61  LSPRHSSRNADNSF-KFFDHPEYVFSPSRRSLIGTPRSLTGFEPHPMVAEAWETLRRSLV 120

Query: 121 YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE 180
           +FRGQPVGTIAA D++EE LNYDQVFVRDFVPS  AFLMNGEPEIVKNFILKTLRLQSWE
Sbjct: 121 FFRGQPVGTIAATDTSEEKLNYDQVFVRDFVPSGLAFLMNGEPEIVKNFILKTLRLQSWE 180

Query: 181 KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240
           KKIDRFQLGEGVMPASFKVLHDPVRN+ETLIADFGESAIGRVAPVDSGFWWIILLRAYTK
Sbjct: 181 KKIDRFQLGEGVMPASFKVLHDPVRNSETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240

Query: 241 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300
           STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL
Sbjct: 241 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300

Query: 301 FFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH 360
           FFMALRCAL+LLK D EGK+FVERI KRLHA+SYHMR+YFW+D KQLNDIYRYKTEEYSH
Sbjct: 301 FFMALRCALLLLKHDDEGKEFVERIVKRLHALSYHMRSYFWLDFKQLNDIYRYKTEEYSH 360

Query: 361 TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQA 420
           TA+NKFNVIPDSLPEW+FDFMPTRGGYFIGN+SPARMDFRWFCLGNCIAILS+LATPEQ+
Sbjct: 361 TAVNKFNVIPDSLPEWVFDFMPTRGGYFIGNISPARMDFRWFCLGNCIAILSSLATPEQS 420

Query: 421 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480
            AIMDLIESRWEEL GEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL
Sbjct: 421 MAIMDLIESRWEELAGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480

Query: 481 TAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK 540
           TAACIKTGRPQIARRA+ELAESRLLKD+WPEYYDG LGRYIGKQARKFQTWS+AGYLVAK
Sbjct: 481 TAACIKTGRPQIARRAIELAESRLLKDNWPEYYDGKLGRYIGKQARKFQTWSVAGYLVAK 540

Query: 541 MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC 573
           M+LEDPSH GM++LEEDKQMKP MKRS+SWTC
Sbjct: 541 MLLEDPSHLGMIALEEDKQMKPAMKRSNSWTC 571

BLAST of Csa7G009210 vs. NCBI nr
Match: gi|645252036|ref|XP_008231940.1| (PREDICTED: alkaline/neutral invertase CINV2-like [Prunus mume])

HSP 1 Score: 1018.8 bits (2633), Expect = 3.6e-294
Identity = 488/572 (85.31%), Postives = 530/572 (92.66%), Query Frame = 1

Query: 1   MSNSSSNMPQNGNVKNNDTLFTVDEIEESEFSKLLDRPRPLNMERQRSFDERSLGDLAIG 60
           MS  +S+M QNGN+++ D L +V EIEE +FSKLLDRP  LNMER+RSFDERSL +L++ 
Sbjct: 1   MSIPNSDMSQNGNIRHVDALCSVAEIEEIDFSKLLDRPSFLNMERKRSFDERSLSELSVA 60

Query: 61  FSPRLSSRVSSENFGRLSDNYDHSPSPGRKSDFNTPRSHTGFEQHPMVAEAWEALRRSLV 120
            SPR SSR +++N  R  D+ ++  SP R S   TPRS TGFE HPMVAEAWE LRRSLV
Sbjct: 61  LSPRHSSR-NADNSSRFFDHPEYVFSPSRTSFIGTPRSLTGFEPHPMVAEAWETLRRSLV 120

Query: 121 YFRGQPVGTIAALDSTEENLNYDQVFVRDFVPSAFAFLMNGEPEIVKNFILKTLRLQSWE 180
           +FRGQPVGTIAA D++EE LNYDQVFVRDFVPS  AFLMNGEPEIVKNFILKTLRLQSWE
Sbjct: 121 FFRGQPVGTIAATDTSEEKLNYDQVFVRDFVPSGLAFLMNGEPEIVKNFILKTLRLQSWE 180

Query: 181 KKIDRFQLGEGVMPASFKVLHDPVRNTETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240
           KKIDRF LGEGVMPASFKVLHDPVRN+ETLIADFGESAIGRVAPVDSGFWWIILLRAYTK
Sbjct: 181 KKIDRFHLGEGVMPASFKVLHDPVRNSETLIADFGESAIGRVAPVDSGFWWIILLRAYTK 240

Query: 241 STGDSSLAELPECQKGMRLILSLCLSEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300
           STGDSSLAELPECQKGMRLILSLCL+EGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL
Sbjct: 241 STGDSSLAELPECQKGMRLILSLCLTEGFDTFPTLLCADGCCMIDRRMGVYGYPIEIQAL 300

Query: 301 FFMALRCALILLKQDHEGKDFVERITKRLHAMSYHMRTYFWIDLKQLNDIYRYKTEEYSH 360
           FFMALRCAL+LLKQD EGK+FVERI KRLHA+SYHMR+YFW+D KQLNDIYRYKTEEYSH
Sbjct: 301 FFMALRCALLLLKQDDEGKEFVERIVKRLHALSYHMRSYFWLDFKQLNDIYRYKTEEYSH 360

Query: 361 TALNKFNVIPDSLPEWIFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSALATPEQA 420
           TA+NKFNVIPDSLP+W+FDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILS+LATPEQ+
Sbjct: 361 TAVNKFNVIPDSLPDWVFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSSLATPEQS 420

Query: 421 TAIMDLIESRWEELVGEMPLKVCYPAIESHEWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480
            AIMDLIESRWEEL GEMPLKVCYPAIESH+WRIVTGCDPKNTRWSYHNGGSWPVLLWLL
Sbjct: 421 MAIMDLIESRWEELAGEMPLKVCYPAIESHQWRIVTGCDPKNTRWSYHNGGSWPVLLWLL 480

Query: 481 TAACIKTGRPQIARRALELAESRLLKDSWPEYYDGTLGRYIGKQARKFQTWSIAGYLVAK 540
           TAACIKTGRPQIARRA+ELAESRLLKD+WPEYYDG LGRY+GKQARKFQTWS+AGYLVAK
Sbjct: 481 TAACIKTGRPQIARRAIELAESRLLKDNWPEYYDGKLGRYVGKQARKFQTWSVAGYLVAK 540

Query: 541 MMLEDPSHSGMVSLEEDKQMKPLMKRSHSWTC 573
           MMLEDPSH GM++LEED+QMKP+MKRS+SWTC
Sbjct: 541 MMLEDPSHLGMIALEEDRQMKPVMKRSNSWTC 571

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
INVB_ARATH1.2e-28682.52Probable alkaline/neutral invertase B OS=Arabidopsis thaliana GN=INVB PE=1 SV=1[more]
CINV2_ARATH8.9e-26176.49Alkaline/neutral invertase CINV2 OS=Arabidopsis thaliana GN=CINV2 PE=1 SV=1[more]
CINV1_ORYSJ3.2e-25876.28Cytosolic invertase 1 OS=Oryza sativa subsp. japonica GN=CINV1 PE=1 SV=1[more]
INVD_ARATH4.7e-25477.61Probable alkaline/neutral invertase D OS=Arabidopsis thaliana GN=INVD PE=2 SV=1[more]
CINV1_ARATH3.0e-24873.55Alkaline/neutral invertase CINV1 OS=Arabidopsis thaliana GN=CINV1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A061DYK5_THECC1.0e-29585.86Plant neutral invertase family protein isoform 1 OS=Theobroma cacao GN=TCM_00669... [more]
M5W112_PRUPE6.6e-29585.66Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003483mg PE=4 SV=1[more]
A0A059BCZ8_EUCGR2.6e-29185.84Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_G01751 PE=4 SV=1[more]
V4UC11_9ROSI2.2e-29086.37Uncharacterized protein OS=Citrus clementina GN=CICLE_v10030393mg PE=4 SV=1[more]
A0A0D2NI51_GOSRA3.7e-29084.33Uncharacterized protein OS=Gossypium raimondii GN=B456_005G252100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G34860.17.0e-28882.52 Plant neutral invertase family protein[more]
AT4G09510.15.0e-26276.49 cytosolic invertase 2[more]
AT1G22650.12.7e-25577.61 Plant neutral invertase family protein[more]
AT1G35580.11.7e-24973.55 cytosolic invertase 1[more]
AT1G72000.12.9e-24179.55 Plant neutral invertase family protein[more]
Match NameE-valueIdentityDescription
gi|449454175|ref|XP_004144831.1|0.0e+00100.00PREDICTED: probable alkaline/neutral invertase B [Cucumis sativus][more]
gi|659094308|ref|XP_008447991.1|0.0e+0098.95PREDICTED: alkaline/neutral invertase CINV2 [Cucumis melo][more]
gi|590684809|ref|XP_007041939.1|1.5e-29585.86Plant neutral invertase family protein isoform 1 [Theobroma cacao][more]
gi|595800735|ref|XP_007201719.1|9.5e-29585.66hypothetical protein PRUPE_ppa003483mg [Prunus persica][more]
gi|645252036|ref|XP_008231940.1|3.6e-29485.31PREDICTED: alkaline/neutral invertase CINV2-like [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR0089286-hairpin_glycosidase_sf
IPR024746Glyco_hydro_100
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0033926glycopeptide alpha-N-acetylgalactosaminidase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005829 cytosol
cellular_component GO:0017177 glucosidase II complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0033926 glycopeptide alpha-N-acetylgalactosaminidase activity
molecular_function GO:0004575 sucrose alpha-glucosidase activity
molecular_function GO:0003824 catalytic activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU154156cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa7G009210.1Csa7G009210.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU154156CU154156transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008928Six-hairpin glycosidase-likeunknownSSF48208Six-hairpin glycosidasescoord: 110..353
score: 8.69E-61coord: 381..539
score: 8.69
IPR024746Glycosyl hydrolase family 100PFAMPF12899Glyco_hydro_100coord: 111..546
score: 4.6E
NoneNo IPR availablePANTHERPTHR31916FAMILY NOT NAMEDcoord: 1..571
score:
NoneNo IPR availablePANTHERPTHR31916:SF15SUBFAMILY NOT NAMEDcoord: 1..571
score:

The following gene(s) are paralogous to this gene:

None