Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCTATTCCACCTTCTTTCCTTCTGTTTTCACTTCTCTCTTCATCCAAACCATGGCCACTGCACCTGGATTGCTGGATCCTGTGCAGTCATCCACCTTTAAGGAGCAGATGAGAAAGAGACATTACAACGACTACTTTGTCTACGGCAGCATTAACTTCAAACATTTGAATGATGACTCAACTAAAATTGACATTCCAATTCCGAATTACATCTCAGCTATTGAGAGCATCATTACCGCTACAAATCGAATTACTGATACCGTTCATCATCGCGTAAAAATCTTTATTATTCTCCGTAAAGTTTGACACTTTCTTTATTTTTATAACGACATAACGAATAATTATTATCATACTCATGCAGGGGAGCGAAGGTTCATTAGATGTTAATGTTGTGATCGAGCCTCCGTTGTGTATTCTTAATCATATATCTAGCGAGGTCATTTTTCAATTTTTTAATAACCCTTTTGAAATCTCTAATCAAAATTTCAAAACATGCATCAAAGGAGATACCTTTTTGTTTCTTACAACTTTGATTTTAATATCATTTTCAGCTTTGGTGCAACCCTCCCGGGATTGGAAGTGCACACGAGAATACACTAACAATCTTCGAAATATTGGCAAATTATCCATGGGAAGCCAAGGCAGCTCTCACTTTGTTAGCCTTTGCAACAGATTATGGAGACTTATGGCATCCCTATCCTTATTCCCATACCGATCCATTGGCTAAATCATTGGCAATTATCAAGCGAGTAGGTATGTTGAAGAAGCACTTGGACTCGCTTCCATACCAACAAGTGCTTCTCAATCCTAACAGTCTCATTCAAATGTGCTTGCAAGCAATCAAACGTATGAATGAGATACGAGAATTCAAAAAATATTATCTCAATGACCCTCTTGAGTTAGCTTCTGCTCTTCGTCAGATTCCATTGGTTACTTATTGGGTTATACACATTATCGTGGCTTCTAGCATTGAGCTATCCAGCTATCTCACCGAAACCAAGTAAGTAATAATTTATCTGTTCACTTTTGTAAATGGTTAGAAATGTTTATTTAATCTGCCGAGTTGGGCTCGGTTTAGACCTGCCTTAAATGAATGAAACGGAGAATCCATGTGCCTTAAATGAATGAAATGGAGAATCCGATTTCTGTTCTTGGTGTGGGGGGAGTAGGAAGAGATTTTTGATCCCAACCTTGTTCAATTCTGGTAGGGGGAAGGGGGTAGGAAGAGATTTCTGATCCCGACCCTGTTCAATTCTGGTAGGGGGAAGGGGAGTGGGGGGGGGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGAGCAGGAAGAGATTTCTGATCCCGACCCTGTTCAATTCTGGTCTCAATTCTGGTCTCAATTCTGGTCTCATCTTGCTCTTGAAAATAAATGAAGAATTTCGTGAAGATGAAAATTGAAATATGAGTTGTGTAACGACCCAGATGCGCCGCTAGTAGATATTGTCGTTTTTGGGCTTTCCCTTGTGCTTCCCCTCGAGGCTTTAAAAGTTTGCATACCCTTAGGCGATGGTTTCCACACCCTTATAAATGGTGTTTTCTTCTCCTCCCCAACCAATGTGGGACATCACAAGATGAGATGGAGACAGAAAAGACTTCCCCATTTATGCCCTACTCTGTAACATCTCTACAAATAGTTGTGATCTTGCCGTGATCGAAAGAAAATGACAAACCGAGTCGATCGATTTAAGTTTTTATATAGTTTCTTAATTCATTAAAAATGTTAATTTTTTTTTTCAGAAATCAGACACAGAGATATTTGAATGAATTGTCGGAAACTATTGCGTTGGTACTCGCTAGACTTGAAAAGCATCTAGACGCCATCCGAGAACAGCATGGTGGAATATACTTTAAAGCCATCCATGATGTTATTGTTCGTTAATTAACAAGTGACCTGAAACATAACGTTAAATTTGTTGTGTTTCTAATGACAGAGGAAGTTGATCTCTACCGATGGTTGGTTGACCACATTGAGCATTTTCATACTAACATTGCATTAGTTGTTTCCAAGCTACTTAGCGGCAAACCTGAAACCAACCCAATAATTGATGGCTCAACTCAAAAAGAGGTAAAAGTTCTTAAATCAAGTAACACAAATATCAAAGTTGATGAACGAAACTTAAAAATGTATGTTTTTACAGGTTGGTGTTCATGAAAGTTTGTCGGGAAAGAACGTGGTATTGCTCATTTCGAGGTTGGATATCTCCGAGGATGATATCAAAGCTATTCATAATATTTATGATGAATTGAAAACTAGAGACACTAATTATGAGATAGTTTGGATTCCAATTATCCCGGAGCCTTATCATGAAGATGATCGCAAGAGATATGAGTATTTGCGTTCTACAATGAAGTGGTACTCAATCCAGTTTACTACAAGAATATCTGGCATGAGATACATCGAGGAGGAGTGGCAATTTAGAGAAGATCCATTAGTTGTGGTACTCAACCCAAATTCTAAAGTAGAATTCATGAATGCAATTCATCTAATTCGAGTTTGGGAAAATGAAGCTATCCCATTTACACAAGCAAGAACTGAATCTTTACTAAAAAAACATTGGCCCGAGTCAACTCTCCTCAAATTCACTCACCAACCAAGGCTACCAAATTTGGTAGGTATAGTACAATCATTCTCTATTTCTTTTCTTTTCTTTTCTTTTGATATCGTAAATGTTTAAGAACGATTTAGAACTTCGGATGAAAATTAAGACTTCAATAATGAAAATTTGAAGGGTAGTTTTCAAAATTAAGACTTCAATAATGAAAATTTCCAAGTTACTCCTATACCAGCTTAGGTCAACATTTACTCTAAGGGTAGTTTTCAAACATTTATAAGAAGTCAATAAATTTATTTTATTTTATTTTTTTTGATTGTTTAGACAGGTTTAGAGTGAGTTTAAAAGTGTTTAAAATTATTTGAGTTCCAAAGTGTTTAAAATTATTTTTACAAAGACAATCAAAATAACCCTTAAATCTCCTATTTTGAGTTGACCTATGTACTTTTGTCAAACAGATTAAGAGTCAGAAAAGTATTATATTTTATGGAGGAAGGAATCCATCATGGATCCAACAATTTGAAGAAAAAGTAGAAATTTTGAAAAACGATCCTTTGATACTCGATGGGAGTTCATTTGAGATCGTACGCATTGGAAAAGATGCAATAGGAGAGGATGATCCTAAACTCATGGCTCGTTTTTGGAAAGTACAATGGGGTTATTTTATAGTGAAGAGCCAGATAAAAGGTTTAAGTGCAAGCGAGACAAGTGAAGATATTTTAAGATTGATTTCTTACCAAAATGAAGATGGTTGGGCAGTTGTTACGATAGGCTCAGCCCCTTTGTTAGTTGGTAGTGGCATTTTGATTTTGAGATTGTTTGATGATTTCCCAAAATGGAAACAAAATTTGCACTTCAAGGCTTTCCCCAATGCTTTTAGAGACTCCTTCAATGAGCTGGCTATGAAGACTCATGAATGTGATCGAGTTATTTTTCCTGAATTTAGTGGATGGATTCCTATGGTTGTCAACTGTCCCGGATGTCCTCGTTTCATGCAGAAGGACATTACCTTTAAATGTTGTCATGGTGGCTCTCGCATGTGA
mRNA sequence
CCCTATTCCACCTTCTTTCCTTCTGTTTTCACTTCTCTCTTCATCCAAACCATGGCCACTGCACCTGGATTGCTGGATCCTGTGCAGTCATCCACCTTTAAGGAGCAGATGAGAAAGAGACATTACAACGACTACTTTGTCTACGGCAGCATTAACTTCAAACATTTGAATGATGACTCAACTAAAATTGACATTCCAATTCCGAATTACATCTCAGCTATTGAGAGCATCATTACCGCTACAAATCGAATTACTGATACCGTTCATCATCGCGGGAGCGAAGGTTCATTAGATGTTAATGTTGTGATCGAGCCTCCGTTGTGTATTCTTAATCATATATCTAGCGAGCTTTGGTGCAACCCTCCCGGGATTGGAAGTGCACACGAGAATACACTAACAATCTTCGAAATATTGGCAAATTATCCATGGGAAGCCAAGGCAGCTCTCACTTTGTTAGCCTTTGCAACAGATTATGGAGACTTATGGCATCCCTATCCTTATTCCCATACCGATCCATTGGCTAAATCATTGGCAATTATCAAGCGAGTAGGTATGTTGAAGAAGCACTTGGACTCGCTTCCATACCAACAAGTGCTTCTCAATCCTAACAGTCTCATTCAAATGTGCTTGCAAGCAATCAAACGTATGAATGAGATACGAGAATTCAAAAAATATTATCTCAATGACCCTCTTGAGTTAGCTTCTGCTCTTCGTCAGATTCCATTGGTTACTTATTGGGTTATACACATTATCGTGGCTTCTAGCATTGAGCTATCCAGCTATCTCACCGAAACCAAAAATCAGACACAGAGATATTTGAATGAATTGTCGGAAACTATTGCGTTGGTACTCGCTAGACTTGAAAAGCATCTAGACGCCATCCGAGAACAGCATGAGGAAGTTGATCTCTACCGATGGTTGGTTGACCACATTGAGCATTTTCATACTAACATTGCATTAGTTGTTTCCAAGCTACTTAGCGGCAAACCTGAAACCAACCCAATAATTGATGGCTCAACTCAAAAAGAGGTTGGTGTTCATGAAAGTTTGTCGGGAAAGAACGTGGTATTGCTCATTTCGAGGTTGGATATCTCCGAGGATGATATCAAAGCTATTCATAATATTTATGATGAATTGAAAACTAGAGACACTAATTATGAGATAGTTTGGATTCCAATTATCCCGGAGCCTTATCATGAAGATGATCGCAAGAGATATGAGTATTTGCGTTCTACAATGAAGTGGTACTCAATCCAGTTTACTACAAGAATATCTGGCATGAGATACATCGAGGAGGAGTGGCAATTTAGAGAAGATCCATTAGTTGTGATTAAGAGTCAGAAAAGTATTATATTTTATGGAGGAAGGAATCCATCATGGATCCAACAATTTGAAGAAAAAGTAGAAATTTTGAAAAACGATCCTTTGATACTCGATGGGAGTTCATTTGAGATCGTACGCATTGGAAAAGATGCAATAGGAGAGGATGATCCTAAACTCATGGCTCGTTTTTGGAAAGTACAATGGGGTTATTTTATAGTGAAGAGCCAGATAAAAGGTTTAAGTGCAAGCGAGACAAGTGAAGATATTTTAAGATTGATTTCTTACCAAAATGAAGATGGTTGGGCAGTTGTTACGATAGGCTCAGCCCCTTTGTTAGTTGGTAGTGGCATTTTGATTTTGAGATTGTTTGATGATTTCCCAAAATGGAAACAAAATTTGCACTTCAAGGCTTTCCCCAATGCTTTTAGAGACTCCTTCAATGAGCTGGCTATGAAGACTCATGAATGTGATCGAGTTATTTTTCCTGAATTTAGTGGATGGATTCCTATGGTTGTCAACTGTCCCGGATGTCCTCGTTTCATGCAGAAGGACATTACCTTTAAATGTTGTCATGGTGGCTCTCGCATGTGA
Coding sequence (CDS)
ATGGCCACTGCACCTGGATTGCTGGATCCTGTGCAGTCATCCACCTTTAAGGAGCAGATGAGAAAGAGACATTACAACGACTACTTTGTCTACGGCAGCATTAACTTCAAACATTTGAATGATGACTCAACTAAAATTGACATTCCAATTCCGAATTACATCTCAGCTATTGAGAGCATCATTACCGCTACAAATCGAATTACTGATACCGTTCATCATCGCGGGAGCGAAGGTTCATTAGATGTTAATGTTGTGATCGAGCCTCCGTTGTGTATTCTTAATCATATATCTAGCGAGCTTTGGTGCAACCCTCCCGGGATTGGAAGTGCACACGAGAATACACTAACAATCTTCGAAATATTGGCAAATTATCCATGGGAAGCCAAGGCAGCTCTCACTTTGTTAGCCTTTGCAACAGATTATGGAGACTTATGGCATCCCTATCCTTATTCCCATACCGATCCATTGGCTAAATCATTGGCAATTATCAAGCGAGTAGGTATGTTGAAGAAGCACTTGGACTCGCTTCCATACCAACAAGTGCTTCTCAATCCTAACAGTCTCATTCAAATGTGCTTGCAAGCAATCAAACGTATGAATGAGATACGAGAATTCAAAAAATATTATCTCAATGACCCTCTTGAGTTAGCTTCTGCTCTTCGTCAGATTCCATTGGTTACTTATTGGGTTATACACATTATCGTGGCTTCTAGCATTGAGCTATCCAGCTATCTCACCGAAACCAAAAATCAGACACAGAGATATTTGAATGAATTGTCGGAAACTATTGCGTTGGTACTCGCTAGACTTGAAAAGCATCTAGACGCCATCCGAGAACAGCATGAGGAAGTTGATCTCTACCGATGGTTGGTTGACCACATTGAGCATTTTCATACTAACATTGCATTAGTTGTTTCCAAGCTACTTAGCGGCAAACCTGAAACCAACCCAATAATTGATGGCTCAACTCAAAAAGAGGTTGGTGTTCATGAAAGTTTGTCGGGAAAGAACGTGGTATTGCTCATTTCGAGGTTGGATATCTCCGAGGATGATATCAAAGCTATTCATAATATTTATGATGAATTGAAAACTAGAGACACTAATTATGAGATAGTTTGGATTCCAATTATCCCGGAGCCTTATCATGAAGATGATCGCAAGAGATATGAGTATTTGCGTTCTACAATGAAGTGGTACTCAATCCAGTTTACTACAAGAATATCTGGCATGAGATACATCGAGGAGGAGTGGCAATTTAGAGAAGATCCATTAGTTGTGATTAAGAGTCAGAAAAGTATTATATTTTATGGAGGAAGGAATCCATCATGGATCCAACAATTTGAAGAAAAAGTAGAAATTTTGAAAAACGATCCTTTGATACTCGATGGGAGTTCATTTGAGATCGTACGCATTGGAAAAGATGCAATAGGAGAGGATGATCCTAAACTCATGGCTCGTTTTTGGAAAGTACAATGGGGTTATTTTATAGTGAAGAGCCAGATAAAAGGTTTAAGTGCAAGCGAGACAAGTGAAGATATTTTAAGATTGATTTCTTACCAAAATGAAGATGGTTGGGCAGTTGTTACGATAGGCTCAGCCCCTTTGTTAGTTGGTAGTGGCATTTTGATTTTGAGATTGTTTGATGATTTCCCAAAATGGAAACAAAATTTGCACTTCAAGGCTTTCCCCAATGCTTTTAGAGACTCCTTCAATGAGCTGGCTATGAAGACTCATGAATGTGATCGAGTTATTTTTCCTGAATTTAGTGGATGGATTCCTATGGTTGTCAACTGTCCCGGATGTCCTCGTTTCATGCAGAAGGACATTACCTTTAAATGTTGTCATGGTGGCTCTCGCATGTGA
Protein sequence
MATAPGLLDPVQSSTFKEQMRKRHYNDYFVYGSINFKHLNDDSTKIDIPIPNYISAIESIITATNRITDTVHHRGSEGSLDVNVVIEPPLCILNHISSELWCNPPGIGSAHENTLTIFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGMLKKHLDSLPYQQVLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLVTYWVIHIIVASSIELSSYLTETKNQTQRYLNELSETIALVLARLEKHLDAIREQHEEVDLYRWLVDHIEHFHTNIALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVVLLISRLDISEDDIKAIHNIYDELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKWYSIQFTTRISGMRYIEEEWQFREDPLVVIKSQKSIIFYGGRNPSWIQQFEEKVEILKNDPLILDGSSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGLSASETSEDILRLISYQNEDGWAVVTIGSAPLLVGSGILILRLFDDFPKWKQNLHFKAFPNAFRDSFNELAMKTHECDRVIFPEFSGWIPMVVNCPGCPRFMQKDITFKCCHGGSRM
Homology
BLAST of CmoCh17G001820 vs. ExPASy Swiss-Prot
Match:
Q9SS87 (Protein SIEVE ELEMENT OCCLUSION B OS=Arabidopsis thaliana OX=3702 GN=SEOB PE=1 SV=1)
HSP 1 Score: 127.9 bits (320), Expect = 4.1e-28
Identity = 97/343 (28.28%), Postives = 168/343 (48.98%), Query Frame = 0
Query: 110 AHENTLTIFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGML 169
+HE T+++FE L+++ W+ K LTL AFA +YG+ W + + LAKSLA++K V +
Sbjct: 127 SHEITMSVFEHLSSFQWDGKLVLTLAAFALNYGEFWLLVQFYSKNQLAKSLAMLKLVPVQ 186
Query: 170 KKHLDSLPYQQVLLNPNSLIQMCLQAIKRMNEIREFKKYYLN-DPLELASALRQIPLVTY 229
+ + + V N LI+ + E+ E Y+ D +L+ L IP+ Y
Sbjct: 187 NR----VTLESVSQGLNDLIREMKSVTACVVELSELPDRYITPDVPQLSRILSTIPIAVY 246
Query: 230 WVIHIIVA--SSIELSSYLTETKNQTQRYLNELSETIALVLARLEKHL-DAIR------E 289
W I ++A S I + + + TQ L E S +A L + HL + +R E
Sbjct: 247 WTIRSVIACISQINMITAMGHEMMNTQMDLWETS-MLANKLKNIHDHLAETLRLCYRHIE 306
Query: 290 QHEEVDLYRWLVDHIEHFHTNIALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVV 349
+ + + L + H + +++ L+ KP P+ DG T+++V + + L K V+
Sbjct: 307 KQRSSESLKVLHSLFDTTHIDNMKILTALVHPKPHITPLQDGLTKRKVHL-DVLRRKTVL 366
Query: 350 LLISRLDISEDDIKAIHNIYDELKTR--------DTNYEIVWIPIIPEPYHEDDR----- 409
LLIS L+I +D++ IY E + YE+VW+P++ +P + +R
Sbjct: 367 LLISDLNILQDELSIFEQIYTESRRNLVGVDGKSHMPYEVVWVPVV-DPIEDFERSPILQ 426
Query: 410 KRYEYLRSTMKWYSIQFTTRISG--MRYIEEEWQFREDPLVVI 428
K++E LR M WYS+ I + ++ W F P++V+
Sbjct: 427 KKFEDLRDPMPWYSVDSPKLIERHVVEFMRGRWHFMNKPILVV 462
BLAST of CmoCh17G001820 vs. ExPASy Swiss-Prot
Match:
Q93XX2 (Protein SIEVE ELEMENT OCCLUSION A OS=Arabidopsis thaliana OX=3702 GN=SEOA PE=1 SV=1)
HSP 1 Score: 87.4 bits (215), Expect = 6.1e-16
Identity = 84/351 (23.93%), Postives = 163/351 (46.44%), Query Frame = 0
Query: 107 IGSAHENTLTIFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRV 166
+ S + T ++ +++ Y W+AK L L A A YG T+ L KSLA+IK++
Sbjct: 231 LDSRNTTTFSVLSLVSKYRWDAKLVLVLSALAVKYGVFLLLAETHATNQLTKSLALIKQL 290
Query: 167 GMLKKHLDSLPYQQVLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLV 226
+ ++L Q L L+Q + + +I + ++ A+ IP
Sbjct: 291 PSIFSRQNAL--HQRLDKTRILMQDMVDLTTTIIDIYQLPPNHIT-----AAFTDHIPTA 350
Query: 227 TYWVIHIIVASSIELSSYLTETKNQTQRY-----LNELSETIALVLARLEKHLDAIREQH 286
YW++ ++ +S ++Q + ++E SE + + A L + +
Sbjct: 351 VYWIVRCVLICVSHISGASGFKQDQIMSFMEVSEIHENSERLRKINAYLLEQFKKSKMTI 410
Query: 287 EEVDLYRWLVDHIEHFHTNIAL-VVSKLLSGKPETNPIIDGS--TQKEVGVHESLSGKNV 346
EE + + I+ F T I + VV LL + + G+ +++ VG++ L+ K+V
Sbjct: 411 EEGIIEEEYQELIQTFTTIIHVDVVPPLLRLLRPIDFLYHGAGVSKRRVGIN-VLTQKHV 470
Query: 347 VLLISRLDISEDDIKAIHNIYDELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKW 406
+LLIS L+ E ++ + ++Y E ++EI+W+P + + + E D ++E L M+W
Sbjct: 471 LLLISDLENIEKELYILESLYTE--AWQQSFEILWVP-VQDFWTEADDAKFEALHMNMRW 530
Query: 407 YSIQFTTRI--SGMRYIEEEWQFREDPLVVIKSQKSIIFYGGRNPS-WIQQ 447
Y + ++ + +R++ E W F+ P++V K + P WI Q
Sbjct: 531 YVLGEPRKLRRAAIRFVREWWGFKNRPILVALDPKGQVMSTNAFPMVWIWQ 570
BLAST of CmoCh17G001820 vs. ExPASy Swiss-Prot
Match:
Q9FXE2 (Protein SIEVE ELEMENT OCCLUSION C OS=Arabidopsis thaliana OX=3702 GN=SEOC PE=4 SV=2)
HSP 1 Score: 80.1 bits (196), Expect = 9.7e-14
Identity = 88/363 (24.24%), Postives = 161/363 (44.35%), Query Frame = 0
Query: 96 ISSELWCNPPGIGSAHENTLTIFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDP 155
IS ++ C G + T+ +F++L Y W+AKA L L A YG L P + DP
Sbjct: 80 ISVQMLCPCTGENEIRKRTMVLFDLLKEYRWDAKAVLVLGVLAATYGGLLLPVHLAICDP 139
Query: 156 LAKSLAIIKRVGMLKKHLDSLPYQQVLLNP-----NSLIQMCLQAIKRMNEIRE--FKKY 215
+A S+A L+ LP ++ P N LI+ + K + + + FK+
Sbjct: 140 VAASIA----------KLNQLPIERTKFRPWLESLNLLIKAMVDVTKCIIKFEKIPFKQA 199
Query: 216 YLNDPLELASALRQIPLVTYWVIHIIVASSIELSSY------------LTETKNQTQRYL 275
L++ + L L I L TY V+ + ++ + E +++R
Sbjct: 200 KLDNNI-LGETLSNIYLTTYRVVKSALTCMQQIPYFKQTQQAKKSRKTAAELSIESRRAA 259
Query: 276 NELSE---TIALVLARLEKHLDAIREQHEEVDLYRWLVDHIEHFHTNIALVVSKLLSGKP 335
ELS + + RL K ++ Q EE R +IE N V LL
Sbjct: 260 GELSSLGYQLLNIHTRLNKQVEDCSTQIEEEINQRLRNINIETHQDN--QDVLHLLFSLQ 319
Query: 336 ETNPIIDGSTQKEVGVHESLSGKNVVLLISRLDISEDDIKAIHNIYDELKTRDT--NYEI 395
+ P+ S Q + + K +LL+S+ + E + +YD +T NYEI
Sbjct: 320 DDLPLQQYSRQISI---TEVQDKVTLLLLSKPPV-EPLFFLLQQLYDHPSNTNTEQNYEI 379
Query: 396 VWIPI-IPEPYHEDDRKRYEYLRSTMKWYSIQFTTRISG--MRYIEEEWQFREDP--LVV 430
+W+PI + + +++++ +++ +++ W S++ +S + + ++EW ++++ LVV
Sbjct: 380 IWVPIPSSQKWTDEEKEIFDFYSNSLPWISVRQPWLMSSTILNFFKQEWHYKDNEAMLVV 425
BLAST of CmoCh17G001820 vs. ExPASy TrEMBL
Match:
A0A6J1H3S9 (protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita moschata OX=3662 GN=LOC111460158 PE=4 SV=1)
HSP 1 Score: 1248.0 bits (3228), Expect = 0.0e+00
Identity = 620/677 (91.58%), Postives = 620/677 (91.58%), Query Frame = 0
Query: 1 MATAPGLLDPVQSSTFKEQMRKRHYNDYFVYGSINFKHLNDDSTKIDIPIPNYISAIESI 60
MATAPGLLDPVQSSTFKEQMRKRHYNDYFVYGSINFKHLNDDSTKIDIPIPNYISAIESI
Sbjct: 1 MATAPGLLDPVQSSTFKEQMRKRHYNDYFVYGSINFKHLNDDSTKIDIPIPNYISAIESI 60
Query: 61 ITATNRITDTVHHRGSEGSLDVNVVIEPPLCILNHISSELWCNPPGIGSAHENTLTIFEI 120
ITATNRITDTVHHRGSEGSLDVNVVIEPPLCILNHISSELWCNPPGIGSAHENTLTIFEI
Sbjct: 61 ITATNRITDTVHHRGSEGSLDVNVVIEPPLCILNHISSELWCNPPGIGSAHENTLTIFEI 120
Query: 121 LANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGMLKKHLDSLPYQQ 180
LANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGMLKKHLDSLPYQQ
Sbjct: 121 LANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGMLKKHLDSLPYQQ 180
Query: 181 VLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLVTYWVIHIIVASSIE 240
VLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLVTYWVIHIIVASSIE
Sbjct: 181 VLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLVTYWVIHIIVASSIE 240
Query: 241 LSSYLTETKNQTQRYLNELSETIALVLARLEKHLDAIREQHEEVDLYRWLVDHIEHFHTN 300
LSSYLTETKNQTQRYLNELSETIALVLARLEKHLDAIREQHEEVDLYRWLVDHIEHFHTN
Sbjct: 241 LSSYLTETKNQTQRYLNELSETIALVLARLEKHLDAIREQHEEVDLYRWLVDHIEHFHTN 300
Query: 301 IALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVVLLISRLDISEDDIKAIHNIYD 360
IALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVVLLISRLDISEDDIKAIHNIYD
Sbjct: 301 IALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVVLLISRLDISEDDIKAIHNIYD 360
Query: 361 ELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKWYSIQFTTRISGMRYIEEEWQFR 420
ELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKWYSIQFTTRISGMRYIEEEWQFR
Sbjct: 361 ELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKWYSIQFTTRISGMRYIEEEWQFR 420
Query: 421 EDPLVV------------------------------------------------------ 480
EDPLVV
Sbjct: 421 EDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRL 480
Query: 481 ---IKSQKSIIFYGGRNPSWIQQFEEKVEILKNDPLILDGSSFEIVRIGKDAIGEDDPKL 540
IKSQKSIIFYGGRNPSWIQQFEEKVEILKNDPLILDGSSFEIVRIGKDAIGEDDPKL
Sbjct: 481 PNLIKSQKSIIFYGGRNPSWIQQFEEKVEILKNDPLILDGSSFEIVRIGKDAIGEDDPKL 540
Query: 541 MARFWKVQWGYFIVKSQIKGLSASETSEDILRLISYQNEDGWAVVTIGSAPLLVGSGILI 600
MARFWKVQWGYFIVKSQIKGLSASETSEDILRLISYQNEDGWAVVTIGSAPLLVGSGILI
Sbjct: 541 MARFWKVQWGYFIVKSQIKGLSASETSEDILRLISYQNEDGWAVVTIGSAPLLVGSGILI 600
Query: 601 LRLFDDFPKWKQNLHFKAFPNAFRDSFNELAMKTHECDRVIFPEFSGWIPMVVNCPGCPR 621
LRLFDDFPKWKQNLHFKAFPNAFRDSFNELAMKTHECDRVIFPEFSGWIPMVVNCPGCPR
Sbjct: 601 LRLFDDFPKWKQNLHFKAFPNAFRDSFNELAMKTHECDRVIFPEFSGWIPMVVNCPGCPR 660
BLAST of CmoCh17G001820 vs. ExPASy TrEMBL
Match:
A0A6J1H6V1 (protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita moschata OX=3662 GN=LOC111460156 PE=4 SV=1)
HSP 1 Score: 953.0 bits (2462), Expect = 6.3e-274
Identity = 483/682 (70.82%), Postives = 533/682 (78.15%), Query Frame = 0
Query: 3 TAPGLLDPVQSSTFKEQMRKRHYNDYFVYGSINFKHLNDDSTKIDIPIPNYISAIESIIT 62
T+ GLL P QSST KE+M RHY+D V G I +H +DDSTKID +PNYIS IESIIT
Sbjct: 10 TSHGLLHPKQSSTSKEEMSVRHYSDKLVTGHIYAQHRDDDSTKID--LPNYISIIESIIT 69
Query: 63 ATNRITDTVHHRGSEG-------SLDVNVVIEPPLCILNHISSELWCNPPGIGSAHENTL 122
+RIT+TV HRGSEG SL NVVIEPPLC L+HISSEL C PG+ AHE TL
Sbjct: 70 TADRITETV-HRGSEGRVVYSDDSLASNVVIEPPLCTLHHISSELSCKAPGVEKAHETTL 129
Query: 123 TIFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGMLKKHLDS 182
IFEILANYPWEAKAALTLLAFATDYGDLWH Y YSHTDPLAKSLAIIKRV LKKHLDS
Sbjct: 130 KIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDS 189
Query: 183 LPYQQVLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLVTYWVIHIIV 242
L Y+QVLLNP SLIQ CLQAIK M+EIREF KY + + L +ALR IPL TYWVIH IV
Sbjct: 190 LRYRQVLLNPKSLIQSCLQAIKYMDEIREFSKYDVKELTALPAALRLIPLFTYWVIHTIV 249
Query: 243 ASSIELSSYLTETKNQTQRYLNELSETIALVLARLEKHLDAIREQHEEVDLYRWLVDHIE 302
AS IELSSYL+ET+NQ Q YLNELS+ I VL +E+HL AIR+ EEVDLYRWLVDHIE
Sbjct: 250 ASRIELSSYLSETENQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRWLVDHIE 309
Query: 303 HFHTNIALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVVLLISRLDISEDDIKAI 362
H+HT I LV+SKL+ G+PETNPIIDGSTQKEVG+HESLS KNV+LLIS LDISEDDI+A+
Sbjct: 310 HYHTGIPLVISKLVDGRPETNPIIDGSTQKEVGIHESLSEKNVILLISGLDISEDDIRAL 369
Query: 363 HNIYDELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKWYSIQFTTRISGMRYIEE 422
H IYDELK R+ NYEIVWIPIIPEPYHEDDRK YEYLRSTMKW SIQFTT+ISGMRYIEE
Sbjct: 370 HKIYDELKARNANYEIVWIPIIPEPYHEDDRKMYEYLRSTMKWLSIQFTTKISGMRYIEE 429
Query: 423 EWQFREDPLVV------------------------------------------------- 482
+WQFREDPLVV
Sbjct: 430 KWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFT 489
Query: 483 --------IKSQKSIIFYGGRNPSWIQQFEEKVEILKNDPLILDGSSFEIVRIGKDAIGE 542
IKSQKSIIFYGG+N +WIQQFEEKVE+LK+DPLI+DG SFEIVRIGKDAIGE
Sbjct: 490 HQPRLPNWIKSQKSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGE 549
Query: 543 DDPKLMARFWKVQWGYFIVKSQIKGLSASETSEDILRLISYQNEDGWAVVTIGSAPLLVG 602
DDPKLMARFWKVQWGYFIVKSQIKG SASET+EDILRLISYQNEDGWAV+T+GSAP+L+G
Sbjct: 550 DDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGSAPVLIG 609
Query: 603 SGILILRLFDDFPKWKQNLHFKAFPNAFRDSFNELAMKTHECDRVIFPEFSGWIPMVVNC 621
+LILRL DDFPKWKQ L KAFP+AFR+ FN+LAMKTH+CDRVI P FSGWIPMVVNC
Sbjct: 610 RDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPMVVNC 669
BLAST of CmoCh17G001820 vs. ExPASy TrEMBL
Match:
A0A6J1L2X0 (protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita maxima OX=3661 GN=LOC111499361 PE=4 SV=1)
HSP 1 Score: 934.9 bits (2415), Expect = 1.8e-268
Identity = 476/682 (69.79%), Postives = 525/682 (76.98%), Query Frame = 0
Query: 3 TAPGLLDPVQSSTFKEQMRKRHYNDYFVYGSINFKHLNDDSTKIDIPIPNYISAIESIIT 62
TAPGLL P QSST KE++ RHY+D V G I KH +DDSTKID +PNYIS IESIIT
Sbjct: 10 TAPGLLHPKQSSTSKEELSLRHYSDELVTGHIYAKHSDDDSTKID--LPNYISVIESIIT 69
Query: 63 ATNRITDTVHHRGSEG-------SLDVNVVIEPPLCILNHISSELWCNPPGIGSAHENTL 122
+RIT+TV HRGSEG SL NVVIEPPLC L+ ISSEL C PGI AHE TL
Sbjct: 70 TADRITETV-HRGSEGRLVYSDDSLASNVVIEPPLCTLHRISSELSCKAPGIEKAHETTL 129
Query: 123 TIFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGMLKKHLDS 182
IFEILANYPWEAKAALTLLAFA DYGDLWH Y YSHTDPLAKSLA+IKRV LKKHLDS
Sbjct: 130 KIFEILANYPWEAKAALTLLAFAADYGDLWHLYHYSHTDPLAKSLAVIKRVATLKKHLDS 189
Query: 183 LPYQQVLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLVTYWVIHIIV 242
L Y+QVLLNP SLIQ CLQAIK MNEIREF KY + + EL +ALR IPL TYW+IH IV
Sbjct: 190 LRYRQVLLNPKSLIQSCLQAIKYMNEIREFSKYDVKELPELPAALRLIPLFTYWIIHTIV 249
Query: 243 ASSIELSSYLTETKNQTQRYLNELSETIALVLARLEKHLDAIREQHEEVDLYRWLVDHIE 302
AS IELSSYL+ET+NQ Q YLNELS+ IA VLA +E+HL+AIR Q +EVDLYRWLVDH+E
Sbjct: 250 ASRIELSSYLSETENQPQLYLNELSDKIARVLAEIERHLEAIRVQQDEVDLYRWLVDHVE 309
Query: 303 HFHTNIALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVVLLISRLDISEDDIKAI 362
H+HT I LVVSKL+SG+PETNPIIDGSTQKEVGVHESL KNV+LLIS LDI EDDI+A+
Sbjct: 310 HYHTGIPLVVSKLISGRPETNPIIDGSTQKEVGVHESLLEKNVILLISDLDILEDDIRAL 369
Query: 363 HNIYDELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKWYSIQFTTRISGMRYIEE 422
H IYDELK RD NYEIVWIPI PEPYHEDD +RYEYLRS+MKWYSIQFTT+ISGMRYIEE
Sbjct: 370 HKIYDELKARDANYEIVWIPIFPEPYHEDDLRRYEYLRSSMKWYSIQFTTKISGMRYIEE 429
Query: 423 EWQFREDPLVV------------------------------------------------- 482
+WQFREDPLVV
Sbjct: 430 KWQFREDPLVVVLNPQSKVEFMNAIHLIRVWENEAIPFTHARTEFLLKKHWPESTLLKFT 489
Query: 483 --------IKSQKSIIFYGGRNPSWIQQFEEKVEILKNDPLILDGSSFEIVRIGKDAIGE 542
I SQKSIIFYGG++ SWIQQFEEKVEILK DPLI++G SFEIVRIGKDA E
Sbjct: 490 HQPRLPNWINSQKSIIFYGGKSQSWIQQFEEKVEILKGDPLIINGGSFEIVRIGKDATRE 549
Query: 543 DDPKLMARFWKVQWGYFIVKSQIKGLSASETSEDILRLISYQNEDGWAVVTIGSAPLLVG 602
DDPKLMARFWKVQWGYF+VKSQIKG SASET+EDILRLISYQNEDGWAV+T+G AP+LVG
Sbjct: 550 DDPKLMARFWKVQWGYFVVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGLAPVLVG 609
Query: 603 SGILILRLFDDFPKWKQNLHFKAFPNAFRDSFNELAMKTHECDRVIFPEFSGWIPMVVNC 621
ILILRL DDFP+WK L KAFP+AFRD FN+LAMK H CD++ P FSG IPMV+NC
Sbjct: 610 RDILILRLLDDFPEWKSTLRLKAFPDAFRDYFNDLAMKIHRCDQISLPGFSGSIPMVINC 669
BLAST of CmoCh17G001820 vs. ExPASy TrEMBL
Match:
A0A6J1L5P1 (protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita maxima OX=3661 GN=LOC111499360 PE=4 SV=1)
HSP 1 Score: 917.5 bits (2370), Expect = 2.9e-263
Identity = 460/681 (67.55%), Postives = 525/681 (77.09%), Query Frame = 0
Query: 4 APGLLDPVQSSTFKEQMRKRHYNDYFVYGSINFKHLNDDSTKIDIPIPNYISAIESIITA 63
AP LL + KE++ +H++D V G I KH +DD TKID +PNYIS IE+IIT
Sbjct: 12 APSLLHSKHAFAHKEEVGTKHFSDEIVTGHIYAKHRDDDRTKID--LPNYISVIENIITT 71
Query: 64 TNRITDTVHHRGSEG-------SLDVNVVIEPPLCILNHISSELWCNPPGIGSAHENTLT 123
++I DTV HRG++G SL NVVIEPPLC L+ ISSEL C PGI AHE TL
Sbjct: 72 ADQIIDTV-HRGTDGRLVHSDASLAFNVVIEPPLCTLHRISSELSCKAPGIEKAHETTLE 131
Query: 124 IFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGMLKKHLDSL 183
IFEILANYPWEAKAALTL+AFA DYGDLWH + YSH DPLAKSLAIIKRV LKKHLDSL
Sbjct: 132 IFEILANYPWEAKAALTLIAFAADYGDLWHLHHYSHADPLAKSLAIIKRVATLKKHLDSL 191
Query: 184 PYQQVLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLVTYWVIHIIVA 243
Y+QVLLNP SLIQ CLQAIK M+EIREF KY + + EL +ALRQIPLVTYWVIH IVA
Sbjct: 192 RYRQVLLNPKSLIQSCLQAIKYMDEIREFSKYDVKELSELPAALRQIPLVTYWVIHTIVA 251
Query: 244 SSIELSSYLTETKNQTQRYLNELSETIALVLARLEKHLDAIREQHEEVDLYRWLVDHIEH 303
S IELSSYL+ET+NQ QRYLN+LSE +A VL LEKHL+ +REQHEEVDLYRWLVDHIEH
Sbjct: 252 SRIELSSYLSETENQPQRYLNDLSEKMARVLDLLEKHLETLREQHEEVDLYRWLVDHIEH 311
Query: 304 FHTNIALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVVLLISRLDISEDDIKAIH 363
+ T+I LVV KLLSGK ET P+IDGST +EVGVHESLSGKNV+L+IS LDISEDDIKAIH
Sbjct: 312 YRTDITLVVPKLLSGKTETKPLIDGSTLREVGVHESLSGKNVILVISGLDISEDDIKAIH 371
Query: 364 NIYDELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKWYSIQFTTRISGMRYIEEE 423
N+YDELK R TNYEIVWIPIIPEPYHEDD K+YEYLRSTMKWYSIQFTT+ISGMRY+EE+
Sbjct: 372 NVYDELKNRGTNYEIVWIPIIPEPYHEDDHKKYEYLRSTMKWYSIQFTTKISGMRYLEEK 431
Query: 424 WQFREDPLVV-------------------------------------------------- 483
WQ REDPLVV
Sbjct: 432 WQLREDPLVVVLSPQSEVVFMNAIHLIRVWGTEAIDFKEDRAKFLLRKNWPDSTLVKFTH 491
Query: 484 -------IKSQKSIIFYGGRNPSWIQQFEEKVEILKNDPLILDGSSFEIVRIGKDAIGED 543
IK +KSI+FYGG+ P WIQQFEE+VEILK+DPLI DG SFEIVRIGK+A GED
Sbjct: 492 QPRLQSWIKQEKSILFYGGKEPMWIQQFEERVEILKSDPLIRDGGSFEIVRIGKNAKGED 551
Query: 544 DPKLMARFWKVQWGYFIVKSQIKGLSASETSEDILRLISYQNEDGWAVVTIGSAPLLVGS 603
DP LMARFWK+QWGYFIVKSQ+ G SASET+EDILRLISYQNE+GW V+++GSAP+LVG
Sbjct: 552 DPALMARFWKIQWGYFIVKSQLIGSSASETTEDILRLISYQNEEGWVVLSVGSAPVLVGR 611
Query: 604 GILILRLFDDFPKWKQNLHFKAFPNAFRDSFNELAMKTHECDRVIFPEFSGWIPMVVNCP 621
GILIL+L ++FPKWKQ+L KAFP+AFR+ FNELA+K+H+CDRVI P FSGWIPM+VNCP
Sbjct: 612 GILILKLLEEFPKWKQSLRLKAFPDAFREYFNELALKSHQCDRVILPGFSGWIPMIVNCP 671
BLAST of CmoCh17G001820 vs. ExPASy TrEMBL
Match:
I6V4B3 (Sieve element occlusion protein 1 OS=Cucurbita maxima OX=3661 GN=SEO1 PE=2 SV=1)
HSP 1 Score: 907.9 bits (2345), Expect = 2.3e-260
Identity = 458/681 (67.25%), Postives = 525/681 (77.09%), Query Frame = 0
Query: 4 APGLLDPVQSSTFKEQMRKRHYNDYFVYGSINFKHLNDDSTKIDIPIPNYISAIESIITA 63
AP LL +ST KE++ +H++D V G I KH +DDSTKID +P+YIS IE+IIT
Sbjct: 12 APSLLHSKHASTHKEEVGTKHFSDELVTGHIYAKHRDDDSTKID--LPSYISVIENIITT 71
Query: 64 TNRITDTVHHRGSEG-------SLDVNVVIEPPLCILNHISSELWCNPPGIGSAHENTLT 123
++I DTV HRG++G SL NVVIEPPLC L+ ISSEL C PGI AHE TL
Sbjct: 72 ADQIIDTV-HRGTDGRLVHSDASLAFNVVIEPPLCTLHRISSELSCKAPGIEKAHETTLE 131
Query: 124 IFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGMLKKHLDSL 183
IFEILANYPWEAKAALTL+AFA DYGDLWH + YSH DPLAKSLAIIKRV LKKHLDSL
Sbjct: 132 IFEILANYPWEAKAALTLIAFAADYGDLWHLHHYSHADPLAKSLAIIKRVATLKKHLDSL 191
Query: 184 PYQQVLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLVTYWVIHIIVA 243
Y+QVLLNP SLIQ CLQAIK M+EIREF KY + + EL +ALR IPLVTYWVIH IVA
Sbjct: 192 RYRQVLLNPKSLIQSCLQAIKYMDEIREFSKYDVKELSELPAALRLIPLVTYWVIHTIVA 251
Query: 244 SSIELSSYLTETKNQTQRYLNELSETIALVLARLEKHLDAIREQHEEVDLYRWLVDHIEH 303
S IELSSYL+ET+NQ QRYLN+LSE +A VL LEKHL+ +REQHEEVDLYRWLVDHIEH
Sbjct: 252 SRIELSSYLSETENQPQRYLNDLSEKMARVLDVLEKHLETLREQHEEVDLYRWLVDHIEH 311
Query: 304 FHTNIALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVVLLISRLDISEDDIKAIH 363
+ T+I LVV KLLSGK ET P+IDGST +EVG+HESLSGKNV+L+IS LDISEDDIKAIH
Sbjct: 312 YRTDITLVVPKLLSGKTETKPLIDGSTLREVGIHESLSGKNVILVISGLDISEDDIKAIH 371
Query: 364 NIYDELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKWYSIQFTTRISGMRYIEEE 423
N+YDELK+R TNYEIVWIPII E HEDD K+YEYLRSTMKWYSIQFTT+ISGMRY+EE+
Sbjct: 372 NVYDELKSRGTNYEIVWIPIILESNHEDDHKKYEYLRSTMKWYSIQFTTKISGMRYLEEK 431
Query: 424 WQFREDPLVV-------------------------------------------------- 483
WQ REDPLVV
Sbjct: 432 WQLREDPLVVVLSPQSEVVFMNAIHLIRVWGTEAIDFKEDRAKFLLRKNWPDSTLVKFTH 491
Query: 484 -------IKSQKSIIFYGGRNPSWIQQFEEKVEILKNDPLILDGSSFEIVRIGKDAIGED 543
IK +KSI+FYGG+ P WIQQFEE+VEILK+DPLI DG SFEIVRIGK+A GED
Sbjct: 492 QPRLQSWIKQEKSILFYGGKEPMWIQQFEERVEILKSDPLIRDGGSFEIVRIGKNAKGED 551
Query: 544 DPKLMARFWKVQWGYFIVKSQIKGLSASETSEDILRLISYQNEDGWAVVTIGSAPLLVGS 603
DP LMARFWK+QWGYFIVKSQ+ G SASET+EDILRLISYQNEDGW V+++GSAP+LVG
Sbjct: 552 DPALMARFWKIQWGYFIVKSQLIGSSASETTEDILRLISYQNEDGWVVLSVGSAPVLVGR 611
Query: 604 GILILRLFDDFPKWKQNLHFKAFPNAFRDSFNELAMKTHECDRVIFPEFSGWIPMVVNCP 621
GILIL+L ++FPKWKQ+L KAFP+AFRD FNELA+K+H+CDRVI P FSG+IPM+VNCP
Sbjct: 612 GILILKLLEEFPKWKQSLRLKAFPDAFRDYFNELALKSHQCDRVILPGFSGYIPMIVNCP 671
BLAST of CmoCh17G001820 vs. TAIR 10
Match:
AT3G01680.1 (CONTAINS InterPro DOMAIN/s: Mediator complex subunit Med28 (InterPro:IPR021640); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G01670.1); Has 122 Blast hits to 112 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 127.9 bits (320), Expect = 2.9e-29
Identity = 97/343 (28.28%), Postives = 168/343 (48.98%), Query Frame = 0
Query: 110 AHENTLTIFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRVGML 169
+HE T+++FE L+++ W+ K LTL AFA +YG+ W + + LAKSLA++K V +
Sbjct: 127 SHEITMSVFEHLSSFQWDGKLVLTLAAFALNYGEFWLLVQFYSKNQLAKSLAMLKLVPVQ 186
Query: 170 KKHLDSLPYQQVLLNPNSLIQMCLQAIKRMNEIREFKKYYLN-DPLELASALRQIPLVTY 229
+ + + V N LI+ + E+ E Y+ D +L+ L IP+ Y
Sbjct: 187 NR----VTLESVSQGLNDLIREMKSVTACVVELSELPDRYITPDVPQLSRILSTIPIAVY 246
Query: 230 WVIHIIVA--SSIELSSYLTETKNQTQRYLNELSETIALVLARLEKHL-DAIR------E 289
W I ++A S I + + + TQ L E S +A L + HL + +R E
Sbjct: 247 WTIRSVIACISQINMITAMGHEMMNTQMDLWETS-MLANKLKNIHDHLAETLRLCYRHIE 306
Query: 290 QHEEVDLYRWLVDHIEHFHTNIALVVSKLLSGKPETNPIIDGSTQKEVGVHESLSGKNVV 349
+ + + L + H + +++ L+ KP P+ DG T+++V + + L K V+
Sbjct: 307 KQRSSESLKVLHSLFDTTHIDNMKILTALVHPKPHITPLQDGLTKRKVHL-DVLRRKTVL 366
Query: 350 LLISRLDISEDDIKAIHNIYDELKTR--------DTNYEIVWIPIIPEPYHEDDR----- 409
LLIS L+I +D++ IY E + YE+VW+P++ +P + +R
Sbjct: 367 LLISDLNILQDELSIFEQIYTESRRNLVGVDGKSHMPYEVVWVPVV-DPIEDFERSPILQ 426
Query: 410 KRYEYLRSTMKWYSIQFTTRISG--MRYIEEEWQFREDPLVVI 428
K++E LR M WYS+ I + ++ W F P++V+
Sbjct: 427 KKFEDLRDPMPWYSVDSPKLIERHVVEFMRGRWHFMNKPILVV 462
BLAST of CmoCh17G001820 vs. TAIR 10
Match:
AT3G01670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G01680.1); Has 121 Blast hits to 111 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 87.4 bits (215), Expect = 4.3e-17
Identity = 84/351 (23.93%), Postives = 163/351 (46.44%), Query Frame = 0
Query: 107 IGSAHENTLTIFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDPLAKSLAIIKRV 166
+ S + T ++ +++ Y W+AK L L A A YG T+ L KSLA+IK++
Sbjct: 231 LDSRNTTTFSVLSLVSKYRWDAKLVLVLSALAVKYGVFLLLAETHATNQLTKSLALIKQL 290
Query: 167 GMLKKHLDSLPYQQVLLNPNSLIQMCLQAIKRMNEIREFKKYYLNDPLELASALRQIPLV 226
+ ++L Q L L+Q + + +I + ++ A+ IP
Sbjct: 291 PSIFSRQNAL--HQRLDKTRILMQDMVDLTTTIIDIYQLPPNHIT-----AAFTDHIPTA 350
Query: 227 TYWVIHIIVASSIELSSYLTETKNQTQRY-----LNELSETIALVLARLEKHLDAIREQH 286
YW++ ++ +S ++Q + ++E SE + + A L + +
Sbjct: 351 VYWIVRCVLICVSHISGASGFKQDQIMSFMEVSEIHENSERLRKINAYLLEQFKKSKMTI 410
Query: 287 EEVDLYRWLVDHIEHFHTNIAL-VVSKLLSGKPETNPIIDGS--TQKEVGVHESLSGKNV 346
EE + + I+ F T I + VV LL + + G+ +++ VG++ L+ K+V
Sbjct: 411 EEGIIEEEYQELIQTFTTIIHVDVVPPLLRLLRPIDFLYHGAGVSKRRVGIN-VLTQKHV 470
Query: 347 VLLISRLDISEDDIKAIHNIYDELKTRDTNYEIVWIPIIPEPYHEDDRKRYEYLRSTMKW 406
+LLIS L+ E ++ + ++Y E ++EI+W+P + + + E D ++E L M+W
Sbjct: 471 LLLISDLENIEKELYILESLYTE--AWQQSFEILWVP-VQDFWTEADDAKFEALHMNMRW 530
Query: 407 YSIQFTTRI--SGMRYIEEEWQFREDPLVVIKSQKSIIFYGGRNPS-WIQQ 447
Y + ++ + +R++ E W F+ P++V K + P WI Q
Sbjct: 531 YVLGEPRKLRRAAIRFVREWWGFKNRPILVALDPKGQVMSTNAFPMVWIWQ 570
BLAST of CmoCh17G001820 vs. TAIR 10
Match:
AT1G67790.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G01680.1); Has 208 Blast hits to 125 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 208; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 57.0 bits (136), Expect = 6.3e-08
Identity = 49/179 (27.37%), Postives = 80/179 (44.69%), Query Frame = 0
Query: 96 ISSELWCNPPGIGSAHENTLTIFEILANYPWEAKAALTLLAFATDYGDLWHPYPYSHTDP 155
IS ++ C G + T+ +F++L Y W+AKA L L A YG L P + DP
Sbjct: 80 ISVQMLCPCTGENEIRKRTMVLFDLLKEYRWDAKAVLVLGVLAATYGGLLLPVHLAICDP 139
Query: 156 LAKSLAIIKRVGMLKKHLDSLPYQQVLLNP-----NSLIQMCLQAIKRMNEIRE--FKKY 215
+A S+A L+ LP ++ P N LI+ + K + + + FK+
Sbjct: 140 VAASIA----------KLNQLPIERTKFRPWLESLNLLIKAMVDVTKCIIKFEKIPFKQA 199
Query: 216 YLNDPLELASALRQIPLVTYWVIHIIVASSIELSSYLTETKNQTQRYLNELSETIALVL 268
L++ + L L I L TY V V S++ + K Q + E+ + + L+L
Sbjct: 200 KLDNNI-LGETLSNIYLTTYRV----VKSALTCMQQIPYFKQTQQISITEVQDKVTLLL 243
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9SS87 | 4.1e-28 | 28.28 | Protein SIEVE ELEMENT OCCLUSION B OS=Arabidopsis thaliana OX=3702 GN=SEOB PE=1 S... | [more] |
Q93XX2 | 6.1e-16 | 23.93 | Protein SIEVE ELEMENT OCCLUSION A OS=Arabidopsis thaliana OX=3702 GN=SEOA PE=1 S... | [more] |
Q9FXE2 | 9.7e-14 | 24.24 | Protein SIEVE ELEMENT OCCLUSION C OS=Arabidopsis thaliana OX=3702 GN=SEOC PE=4 S... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1H3S9 | 0.0e+00 | 91.58 | protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita moschata OX=3662 GN=LOC11146... | [more] |
A0A6J1H6V1 | 6.3e-274 | 70.82 | protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita moschata OX=3662 GN=LOC11146... | [more] |
A0A6J1L2X0 | 1.8e-268 | 69.79 | protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita maxima OX=3661 GN=LOC1114993... | [more] |
A0A6J1L5P1 | 2.9e-263 | 67.55 | protein SIEVE ELEMENT OCCLUSION B-like OS=Cucurbita maxima OX=3661 GN=LOC1114993... | [more] |
I6V4B3 | 2.3e-260 | 67.25 | Sieve element occlusion protein 1 OS=Cucurbita maxima OX=3661 GN=SEO1 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G01680.1 | 2.9e-29 | 28.28 | CONTAINS InterPro DOMAIN/s: Mediator complex subunit Med28 (InterPro:IPR021640);... | [more] |
AT3G01670.1 | 4.3e-17 | 23.93 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G67790.1 | 6.3e-08 | 27.37 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |