Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGTGTCAAAGAAAACCAAAGACGAAAATTTTTTCACATTCGCCGTCTCTCTTCTCCCTCTCCAGACTTCTACACGTTCGCTTCCTCTTCTCTCTCTCTCTCTCCGCCGTTCCGTTTCTCTCTGCCGTTTTCCGTTCTCTCCGCCACCGCACAGCGGCTGAAGCAGACCGAACCAAACATACAGACAAAAACCCTATTCGCTGTTCCGAACAACCCAAGCCTCATTTTCTCTCATTCACAATCCCTCTCCCTCTCTCTTTCCGCCGCCGCTTTCTTTCTTCACAGTCTCAAACTCTCTCTCTCTCTGTCTCTCTCGTTGTTGCCCTCTTCAGACGCATCTCTTTTTGTTTTTGGGAATATACAGCAACCACTGGGGGCAGTTATTGAAACAGTTGCGCCTTCTGGGGTTCGTCAATCAGTCTGAGGGTAGGGTTAGTTCTTCGTCTTTCATCGGATATCTCACAGGTTATATTATTTATCTCTTGTGTATTTTTCTTGCGGTTCGGACCTCCGCGGAAGTCTTTCGGAGGCTGCATATTTGGTTTAGGGTTTCTGATTTTTTTTCTTTGCACATTCAATCGGAGTTCGGATTGTGGTTTTTCCAATTTCTTATTTCGTTAGGGTTTTTTTTTTTCTTGTTCCTGATCGAACTGTTTTGACGGGCACAATCATCTGTTTTGGAATTGCTCCACCTCTTTGTTTGTGAGGTTTGCCTTCTTTATATTTCTGCATCTCAACCGATTCCCTCAAAGACGAGGGTTTTCCGAGCGTGTTTTGCTGACCGAATCTACACTTAATCGAACATGGAGGCCGGGAAAGATAGTGAGGCCTTTCCTCTTACTATTCTTTTCAATTGAGCCGCTTCATCTTCTCATTGCTATTTCCTCTGTTTTCATCCTCTGATATTCCACTCCCGTATGAGAATTAGGTTATATTGATTTTGCACATAAGATAAGACAAATCCCGTTCAGAATGCGTTCAGCTTGTAGCCAAATTGGTAGCTACTTGCTTGAATAGCGGTGGTTGTGAAGTAGGATAATTGTAGAGGATAAACTTCTTTAGTGGCATTTAGGACATAAATTTTGAGAACCATCCTCTGGGTTCACTCTCGTGCATGAGTTTGCAACTTTGAAGCAATTCACTGCGCATGTCAATCGTTGTTGGAACCCTGGATATGCCACTAAGTTGATGACACATATGGTTTGTAGCCGTGACTTGCATAGTCAGGGATGCCCGTGAAAGTTCATCCAGCGCAAAATAGAGGTTAGGCATTTTGAAATAATAAAGTACAGGTTAGGGTTTTATTCGTTCTTCTGTATAAGGAGTACAGTTTTTGGAAATTTGAGGTGGTAAAGAAGGCAAGAAGTGTTTTTTTCTTCTGATATTTTGATCATGTTCAGTATTCCTTATGGAAAGAAGTGAACCCTCATTAGTTCCAGAATGGTTGAGAAGTACTGGAAACGTTACTGGTGGCAGCAATTCAAACCACCATTTTCCGTCATCTTCTTCCCACTCAGGTATTGTTTTTTTCCCTCTAAATATTACATGCTTTCATCAATGATGCTCTCACTGACTGCTGACTTGCCTCGATTTTTATTTTGGGAACACCTTTTAATCTCATATCCTGTGTTGTATAGATGTGCCCTCTCTATCTCAATCGAGAAATAGAACTTCCAAGACCACTGGTGATTTTGATACTTCACGTTCTGCTTTTCTTGATCGGACATCTTCATCAAATTCAAGGAGAAGTTCGAGCAATGGTTCTGCCAAACATGCATATAGTAGTTTTAACAGGGGTCATCGTGATAAGGATCGAGAGAAAGAAAAGGATAGGTTAAGCTTTGGGGATAATTGGGACCGTGACTCTCATGACCCTCTTGGAAAGATTCTTTCCAATAGGATTGATAAAGATGCTTTGCGGCGATCCCATTCAATGGTATCCAGGAAGCAAGGCGAGTTGTTTCACAGAAGAGTTGCAACAGATTTAAAAGCTGGTGTCAACAGCAGTCACAACAACGGGAATGTGATGCCTTCTGGAACTAGTGTTGGCAGTAGCATTCAGAAGGCTGTATTTGAAAAGGATTTCCCCTCACTGGGATCCGAAGATAGGCAAGGATCATCAGAAATTGGAAGAGTTTCATCACCTGGTTTGAGCTCACCAGTTCAAAGCTTGCCTATTGGGAATTCAGCCTTAATTGGTGGTGAGGGATGGACATCTGCTCTTGCCGAGGTGCCCAATATGATTGGAAGCACCACAGGGTCATCATCATTTCAACAAACTGTTCCTGCTACATCAGGGGCTGGGCCTGTGAGTGTGACAGCTGGACTAAATATGGCTGAAGCGTTGGTGCAGGCTCCATCCCGAGCTCGTGCTACACCCCAGGTATCTGAGGTAACTTACTATACCTCATTGTGTTGTATATTATCTACTCCTTTTGATCTAACAGTCTCATCCTCTACCCCCCCACTTTCTTAGTTATCTGTCAAGACCCAGAGGCTTGAGGAATTGGCTATTAAGCAGTCCAGGCAGTTAATACCAGTGACGCCTTCTATGCCAAAAGCTATGGTAATTGGAGCTAATGCAATTCTTGAATCTTTGTTGTATTGATTTGTTGCTACCTCCTTATATTTTAATTAATTTAGGTACAATTATTTTTGTTGATTCAGAAATGAAAATGTTGATTCATAAAATATACAAGAGAACTGAAAACCATAAGCACACAGTGTCTTGACAACTTGTGTCAATCTCTGATATCAGCTAGGACTAATGCTCTCTTCTTATCAAAAGAGAAAAAGAAAAAAAGAACTCTTCTTGTAGCTACTTTCATGCTGTAGTGTATTATGTACTTAGGATGTACATATAATGATAATCGTTTTTGGGTAAATCATACTGAAAACAACGCAACAGAGATACTAGTTATATGTTTTGACATTTTTCTCCCCCTGCTGCTGGTGACTGGTGGTTTAAATTCTGGGATTTAGCTTGTTAAATAAACTATGCTTCTACATTTTAGAAAACTATAGAAATTTTTGTGGAAGTTTGTTGTTATTCAAGGATTCTATCTCTTCAACTTACTTATCATTGACTAAAGAATACTTTTATGGGTTGATAGCGGAAGTTAAAACATGTGCTAGAAGTTTCTGGTTCTAAGTCATTTACGCTATGCCACTTTTCCCCCTTCTATTTGATGAGAAACTCAATCTAAGGATGATGGAAAGAACAAAATACAAGTGAACCCCAGAAAAAGTGGTGCTAGAGGGATTCTATATCATTTATGTATTCTTTGATTTAGGAAATAGTTATTTAAAATTATTTAAAGGGTTTTGTCTTTATTTTCGTGCCTGATCACCTTTTGGCAATTTGTAAGAAGTAGATGGTAGGAAAGAGTTAACAGGGTTTATTTCTGGAACCAATTTCATAGCATGCATGAATGTGCTTTGCCATTTTGACACTTTGCACATTGAATTATCTTTGGTTAACCGCTTTTTATTCTCTCTCAAGACTTGAAATCTTTCTATATTAACGTTCTTCATCTGATTGTTGGGCTGTGGCCAATATATCGCTGGTGGCTAGTATTTGTTCGTTAACGTGTTATATTTGGTAGGCAAGTTTTTTTTATTTGACCTGAAAACTTGTTAAGCTGTTAATTGATTATCAACTCCTCTTCCTCTATAATAGTTATAACTTAACTGAGCATTCCTCCTTTGAAAAGTTCTCATTATTTTGCTGTTTTTTTTTTTGAAGCTGATTTAAATCATTGTTGTGTCGGTGTAGTGATGTAAAAAACTTGCTTGGCGGTATAATTCTTCAAATAGTTTTAGGGAAAAAAGAAGCAGAAGACAAGAATATTATGTTGTTGTGTCTTCTTTTATGTTTTCTGAAGCAATTTATTCATACCTGATCAATATTCTTTCAAAGTATAACTGGATTTAAATATGTTATATCTTTATATATATATATATATATATATATATATTCATAAAAAAATGCTGAAGAAATAAATGTAAATACAAGAACAGGCTGGGCCATATCTTTCTATATAATTTTAAAAAGTTTGTTTGTAAAAACAATTCTGATAATATACATGCAAATAGTTCTCTACTCTTGTATCTGAAATATAGTTCGTTTTTACATTTTGATATTTCATGGTTAATTTCTACAATGAAACATTGATTTTATTTCCTTTTCAGCATTGAACATATCTAGGAATACATAATTCTCATTATATACCTAATCGCAAATCTTCTGTTTTCTTCTTAATCATTAGCATGTTTTTGTTTGATAGGTGCTTAATTCTTCTGATAAATCAAAACCCAAATTAGCATCGAGAACTGGAGAACTTAATGTAACCATCAAGGGTGGACAGCCACAGCCCCTGTCAGTCCATGCCAACCAATCTCGTGGCGGACCTGTCAAGTCCGATCCTCAAAAGAGTTCTCATGGGAAATTTCTTGTTCTAAAACCTGTACGAGAAAATGGTGTCTCCCTTGCAGCAAAGGATGTCTCGAGTCCAACTAGTAATGCAAACAGCATGGCAGCAAACAACCAGTTCGCTCTTGCACCTTCAGTTCCACATGCTCCTTTGAGAAGCCCAAACAATTCAAATGTTTCCTCTGTGGAGCGCAAGATTGCTAGCTTAGATCTCAAATCTGGATCGACTTTGGAAAAAAGACCGTCCTTATCACAAGTCCAAAGTCGGAATGATTTCTTTAACCTCATTAAGAAGAAAACTTCAATGAGTTCTTCTGCTGTTCTCTCAGATTCATGCTCCTCTGTGAAATCTCCTTCAATTGGCCAATCTAATGAACTAACAAGGGAAGAAATCAACATTCCTGCAAGTCCTCGTGTTATTGAAAATGGTGCTATGGAGACTCTAAATGGAGATAGTTCTGAAGAGGTTCGAGCATCTTGTGACAGTGGTGAAAAAACTGAGAGCCACGTTGCTGCAGAGTCTCTAGATGAAGAGGAGGCTGCTTTTCTTCGTTCTCTTGGATGGGATGAGAACTGTGGTGAGGGAGGCCTTACTGAAGAAGAGATCAATTCTTTCTATCAGGAGGTAAACAATTACTTCTTGTCACGTTTCACTACTTCTATTGCTTTCATTAATCCCCGCTGCAATATTTTCCAAAGTATTGTTCTTATCTATATTAGCTGATGATGAAGTTGTAAATATATAATAATGTATGCTCTAATTCTGACTGCAGTTGAATTCCATGAATTTGAAGCCATCTCTAAAAATTGGCCGATTCATTCAGCGAAAGATAGTCGTGCCATCTGAATCTCCCGAGGGCGAGGGTAGTAGCAAGGATGAAGCTGATTCTGAATTGAGCTCATCTGACTCGGAAGCCTGAATTACTTTTTTTCACCCATGAGAAGTTCCTAGTTCACAATTTTTAATGGCAGTGGGGTTAGTTCTTTTCTTTTACAATTTTTTTTCCTTCTTTTTTGTTTTCATGGTCTTAAAGAAATGCTGATGAGAGTTGGGTCGGAAGAAGAGGTGGTTGTTAATTGAACACTGAACAGTGCAAATAGTGCAATAAGTTACAGTTGCCCAAATCATCCTGCCTACCTTTACAGCGGGAGTTTTTGCGGTTTTTAAAGGGCAGTTGCAGAGGTTTGGCAGGCGCATATTTTGTCTGCACTGAAAGATGTAGTGGAAATTGCATCTTTTGGTTTATTTTGGTTCTTTGTTCTCTCACTTTTGAAGAGGTTTTTGATTCTGGAAGTTGCTAGGAAGAAAAAAAAAAAAAGGTTACTGCTGCTTCAAAGAAAATTGGGGAAAAAAGGGAAAATGAAAAATGTTTCTATCTTCGTTCAGCATATATTGATTTATTGTTTGGGATGTCTCTACAGTTGGGTAGCGTAGCTTTGTTTGACATTGAATTGCACTAATTACACATGAATAAGAGGATTAGTTTAGAGATTAATTGTTTGGTGTCACTATAATCGCCCATTTTTTTAATCATTTGCTACGTTTGGGAAACTGTTTTGCAAATTAATCATCTTGTACGTTTGATAAACTATTTTGTAACTCAAAAGTTTTTAATATTTCAGTAAATCGTTTAAGTT
mRNA sequence
AGAGTGTCAAAGAAAACCAAAGACGAAAATTTTTTCACATTCGCCGTCTCTCTTCTCCCTCTCCAGACTTCTACACGTTCGCTTCCTCTTCTCTCTCTCTCTCTCCGCCGTTCCGTTTCTCTCTGCCGTTTTCCGTTCTCTCCGCCACCGCACAGCGGCTGAAGCAGACCGAACCAAACATACAGACAAAAACCCTATTCGCTGTTCCGAACAACCCAAGCCTCATTTTCTCTCATTCACAATCCCTCTCCCTCTCTCTTTCCGCCGCCGCTTTCTTTCTTCACAGTCTCAAACTCTCTCTCTCTCTGTCTCTCTCGTTGTTGCCCTCTTCAGACGCATCTCTTTTTGTTTTTGGGAATATACAGCAACCACTGGGGGCAGTTATTGAAACAGTTGCGCCTTCTGGGGTTCGTCAATCAGTCTGAGGGTAGGGTTAGTTCTTCGTCTTTCATCGGATATCTCACAGGTTATATTATTTATCTCTTGTGTATTTTTCTTGCGGTTCGGACCTCCGCGGAAGTCTTTCGGAGGCTGCATATTTGGTTTAGGGTTTCTGATTTTTTTTCTTTGCACATTCAATCGGAGTTCGGATTGTGGTTTTTCCAATTTCTTATTTCGTTAGGGTTTTTTTTTTTCTTGTTCCTGATCGAACTGTTTTGACGGGCACAATCATCTGTTTTGGAATTGCTCCACCTCTTTGTTTGTGAGGTTTGCCTTCTTTATATTTCTGCATCTCAACCGATTCCCTCAAAGACGAGGGTTTTCCGAGCGTGTTTTGCTGACCGAATCTACACTTAATCGAACATGGAGGCCGGGAAAGATAGTGAGGCCTTTCCTCTTACTATTCTTTTCAATTGAGCCGCTTCATCTTCTCATTGCTATTTCCTCTGTTTTCATCCTCTGATATTCCACTCCCGTATGAGAATTAGGTTATATTGATTTTGCACATAAGATAAGACAAATCCCGTTCAGAATGCGTTCAGCTTGTAGCCAAATTGGTAGCTACTTGCTTGAATAGCGGTGGTTGTGAAGTAGGATAATTGTAGAGGATAAACTTCTTTAGTGGCATTTAGGACATAAATTTTGAGAACCATCCTCTGGGTTCACTCTCGTGCATGAGTTTGCAACTTTGAAGCAATTCACTGCGCATGTCAATCGTTGTTGGAACCCTGGATATGCCACTAAGTTGATGACACATATGGTTTGTAGCCGTGACTTGCATAGTCAGGGATGCCCGTGAAAGTTCATCCAGCGCAAAATAGAGTATTCCTTATGGAAAGAAGTGAACCCTCATTAGTTCCAGAATGGTTGAGAAGTACTGGAAACGTTACTGGTGGCAGCAATTCAAACCACCATTTTCCGTCATCTTCTTCCCACTCAGATGTGCCCTCTCTATCTCAATCGAGAAATAGAACTTCCAAGACCACTGGTGATTTTGATACTTCACGTTCTGCTTTTCTTGATCGGACATCTTCATCAAATTCAAGGAGAAGTTCGAGCAATGGTTCTGCCAAACATGCATATAGTAGTTTTAACAGGGGTCATCGTGATAAGGATCGAGAGAAAGAAAAGGATAGGTTAAGCTTTGGGGATAATTGGGACCGTGACTCTCATGACCCTCTTGGAAAGATTCTTTCCAATAGGATTGATAAAGATGCTTTGCGGCGATCCCATTCAATGGTATCCAGGAAGCAAGGCGAGTTGTTTCACAGAAGAGTTGCAACAGATTTAAAAGCTGGTGTCAACAGCAGTCACAACAACGGGAATGTGATGCCTTCTGGAACTAGTGTTGGCAGTAGCATTCAGAAGGCTGTATTTGAAAAGGATTTCCCCTCACTGGGATCCGAAGATAGGCAAGGATCATCAGAAATTGGAAGAGTTTCATCACCTGGTTTGAGCTCACCAGTTCAAAGCTTGCCTATTGGGAATTCAGCCTTAATTGGTGGTGAGGGATGGACATCTGCTCTTGCCGAGGTGCCCAATATGATTGGAAGCACCACAGGGTCATCATCATTTCAACAAACTGTTCCTGCTACATCAGGGGCTGGGCCTGTGAGTGTGACAGCTGGACTAAATATGGCTGAAGCGTTGGTGCAGGCTCCATCCCGAGCTCGTGCTACACCCCAGGTATCTGAGTTATCTGTCAAGACCCAGAGGCTTGAGGAATTGGCTATTAAGCAGTCCAGGCAGTTAATACCAGTGACGCCTTCTATGCCAAAAGCTATGGTGCTTAATTCTTCTGATAAATCAAAACCCAAATTAGCATCGAGAACTGGAGAACTTAATGTAACCATCAAGGGTGGACAGCCACAGCCCCTGTCAGTCCATGCCAACCAATCTCGTGGCGGACCTGTCAAGTCCGATCCTCAAAAGAGTTCTCATGGGAAATTTCTTGTTCTAAAACCTGTACGAGAAAATGGTGTCTCCCTTGCAGCAAAGGATGTCTCGAGTCCAACTAGTAATGCAAACAGCATGGCAGCAAACAACCAGTTCGCTCTTGCACCTTCAGTTCCACATGCTCCTTTGAGAAGCCCAAACAATTCAAATGTTTCCTCTGTGGAGCGCAAGATTGCTAGCTTAGATCTCAAATCTGGATCGACTTTGGAAAAAAGACCGTCCTTATCACAAGTCCAAAGTCGGAATGATTTCTTTAACCTCATTAAGAAGAAAACTTCAATGAGTTCTTCTGCTGTTCTCTCAGATTCATGCTCCTCTGTGAAATCTCCTTCAATTGGCCAATCTAATGAACTAACAAGGGAAGAAATCAACATTCCTGCAAGTCCTCGTGTTATTGAAAATGGTGCTATGGAGACTCTAAATGGAGATAGTTCTGAAGAGGTTCGAGCATCTTGTGACAGTGGTGAAAAAACTGAGAGCCACGTTGCTGCAGAGTCTCTAGATGAAGAGGAGGCTGCTTTTCTTCGTTCTCTTGGATGGGATGAGAACTGTGGTGAGGGAGGCCTTACTGAAGAAGAGATCAATTCTTTCTATCAGGAGTTGAATTCCATGAATTTGAAGCCATCTCTAAAAATTGGCCGATTCATTCAGCGAAAGATAGTCGTGCCATCTGAATCTCCCGAGGGCGAGGGTAGTAGCAAGGATGAAGCTGATTCTGAATTGAGCTCATCTGACTCGGAAGCCTGAATTACTTTTTTTCACCCATGAGAAGTTCCTAGTTCACAATTTTTAATGGCAGTGGGGTTAGTTCTTTTCTTTTACAATTTTTTTTCCTTCTTTTTTGTTTTCATGGTCTTAAAGAAATGCTGATGAGAGTTGGGTCGGAAGAAGAGGTGGTTGTTAATTGAACACTGAACAGTGCAAATAGTGCAATAAGTTACAGTTGCCCAAATCATCCTGCCTACCTTTACAGCGGGAGTTTTTGCGGTTTTTAAAGGGCAGTTGCAGAGGTTTGGCAGGCGCATATTTTGTCTGCACTGAAAGATGTAGTGGAAATTGCATCTTTTGGTTTATTTTGGTTCTTTGTTCTCTCACTTTTGAAGAGGTTTTTGATTCTGGAAGTTGCTAGGAAGAAAAAAAAAAAAAGGTTACTGCTGCTTCAAAGAAAATTGGGGAAAAAAGGGAAAATGAAAAATGTTTCTATCTTCGTTCAGCATATATTGATTTATTGTTTGGGATGTCTCTACAGTTGGGTAGCGTAGCTTTGTTTGACATTGAATTGCACTAATTACACATGAATAAGAGGATTAGTTTAGAGATTAATTGTTTGGTGTCACTATAATCGCCCATTTTTTTAATCATTTGCTACGTTTGGGAAACTGTTTTGCAAATTAATCATCTTGTACGTTTGATAAACTATTTTGTAACTCAAAAGTTTTTAATATTTCAGTAAATCGTTTAAGTT
Coding sequence (CDS)
ATGCCCGTGAAAGTTCATCCAGCGCAAAATAGAGTATTCCTTATGGAAAGAAGTGAACCCTCATTAGTTCCAGAATGGTTGAGAAGTACTGGAAACGTTACTGGTGGCAGCAATTCAAACCACCATTTTCCGTCATCTTCTTCCCACTCAGATGTGCCCTCTCTATCTCAATCGAGAAATAGAACTTCCAAGACCACTGGTGATTTTGATACTTCACGTTCTGCTTTTCTTGATCGGACATCTTCATCAAATTCAAGGAGAAGTTCGAGCAATGGTTCTGCCAAACATGCATATAGTAGTTTTAACAGGGGTCATCGTGATAAGGATCGAGAGAAAGAAAAGGATAGGTTAAGCTTTGGGGATAATTGGGACCGTGACTCTCATGACCCTCTTGGAAAGATTCTTTCCAATAGGATTGATAAAGATGCTTTGCGGCGATCCCATTCAATGGTATCCAGGAAGCAAGGCGAGTTGTTTCACAGAAGAGTTGCAACAGATTTAAAAGCTGGTGTCAACAGCAGTCACAACAACGGGAATGTGATGCCTTCTGGAACTAGTGTTGGCAGTAGCATTCAGAAGGCTGTATTTGAAAAGGATTTCCCCTCACTGGGATCCGAAGATAGGCAAGGATCATCAGAAATTGGAAGAGTTTCATCACCTGGTTTGAGCTCACCAGTTCAAAGCTTGCCTATTGGGAATTCAGCCTTAATTGGTGGTGAGGGATGGACATCTGCTCTTGCCGAGGTGCCCAATATGATTGGAAGCACCACAGGGTCATCATCATTTCAACAAACTGTTCCTGCTACATCAGGGGCTGGGCCTGTGAGTGTGACAGCTGGACTAAATATGGCTGAAGCGTTGGTGCAGGCTCCATCCCGAGCTCGTGCTACACCCCAGGTATCTGAGTTATCTGTCAAGACCCAGAGGCTTGAGGAATTGGCTATTAAGCAGTCCAGGCAGTTAATACCAGTGACGCCTTCTATGCCAAAAGCTATGGTGCTTAATTCTTCTGATAAATCAAAACCCAAATTAGCATCGAGAACTGGAGAACTTAATGTAACCATCAAGGGTGGACAGCCACAGCCCCTGTCAGTCCATGCCAACCAATCTCGTGGCGGACCTGTCAAGTCCGATCCTCAAAAGAGTTCTCATGGGAAATTTCTTGTTCTAAAACCTGTACGAGAAAATGGTGTCTCCCTTGCAGCAAAGGATGTCTCGAGTCCAACTAGTAATGCAAACAGCATGGCAGCAAACAACCAGTTCGCTCTTGCACCTTCAGTTCCACATGCTCCTTTGAGAAGCCCAAACAATTCAAATGTTTCCTCTGTGGAGCGCAAGATTGCTAGCTTAGATCTCAAATCTGGATCGACTTTGGAAAAAAGACCGTCCTTATCACAAGTCCAAAGTCGGAATGATTTCTTTAACCTCATTAAGAAGAAAACTTCAATGAGTTCTTCTGCTGTTCTCTCAGATTCATGCTCCTCTGTGAAATCTCCTTCAATTGGCCAATCTAATGAACTAACAAGGGAAGAAATCAACATTCCTGCAAGTCCTCGTGTTATTGAAAATGGTGCTATGGAGACTCTAAATGGAGATAGTTCTGAAGAGGTTCGAGCATCTTGTGACAGTGGTGAAAAAACTGAGAGCCACGTTGCTGCAGAGTCTCTAGATGAAGAGGAGGCTGCTTTTCTTCGTTCTCTTGGATGGGATGAGAACTGTGGTGAGGGAGGCCTTACTGAAGAAGAGATCAATTCTTTCTATCAGGAGTTGAATTCCATGAATTTGAAGCCATCTCTAAAAATTGGCCGATTCATTCAGCGAAAGATAGTCGTGCCATCTGAATCTCCCGAGGGCGAGGGTAGTAGCAAGGATGAAGCTGATTCTGAATTGAGCTCATCTGACTCGGAAGCCTGA
Protein sequence
MPVKVHPAQNRVFLMERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRNRTSKTTGDFDTSRSAFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGDNWDRDSHDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNVMPSGTSVGSSIQKAVFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALIGGEGWTSALAEVPNMIGSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLSVHANQSRGGPVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANNQFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSCSSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRASCDSGEKTESHVAAESLDEEEAAFLRSLGWDENCGEGGLTEEEINSFYQELNSMNLKPSLKIGRFIQRKIVVPSESPEGEGSSKDEADSELSSSDSEA
Homology
BLAST of Lcy09g012070 vs. ExPASy TrEMBL
Match:
A0A6J1CKG8 (flocculation protein FLO11 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111011833 PE=4 SV=1)
HSP 1 Score: 1012.7 bits (2617), Expect = 6.8e-292
Identity = 564/639 (88.26%), Postives = 590/639 (92.33%), Query Frame = 0
Query: 1 MPVKVHPAQNRVFLMERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRN 60
MPVKVHP QNR FLMERSEP+LVPEWLRSTG+VTGG NSNHHFP SSSHSDV SL+QSRN
Sbjct: 1 MPVKVHPTQNRAFLMERSEPTLVPEWLRSTGSVTGGGNSNHHFPLSSSHSDVSSLAQSRN 60
Query: 61 RTSKTTGDFDTSRSAFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG 120
RTSKT GDFDTSRSAFLDR+SSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG
Sbjct: 61 RTSKTIGDFDTSRSAFLDRSSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG 120
Query: 121 DNWDRDSHDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNV 180
D+WDRDS DPLGKILSNRIDKDALRRSHSMVSRKQGELFHRR+ATD K GV+SS NNG
Sbjct: 121 DHWDRDSSDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRIATDPKIGVSSSLNNGTG 180
Query: 181 MPSGTSVGSSIQKAVFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALIGGE 240
MPSGTSVGSSIQKAVFEKDFPSLGSE++QG+S+IGRVSSPGLSSPVQSLPIGNSALIGGE
Sbjct: 181 MPSGTSVGSSIQKAVFEKDFPSLGSEEKQGTSDIGRVSSPGLSSPVQSLPIGNSALIGGE 240
Query: 241 GWTSALAEVPNMIGSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQV 300
GWTSALAEVPN+IGS+TGSSSFQQTVPA SGAG +SVTAGLNMAEALVQAPSRARA PQV
Sbjct: 241 GWTSALAEVPNIIGSSTGSSSFQQTVPAISGAGLLSVTAGLNMAEALVQAPSRARAVPQV 300
Query: 301 SELSVKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP 360
SEL VKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP
Sbjct: 301 SELFVKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP 360
Query: 361 QPLSV-HANQSRGGPVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANN 420
QPL V H NQ+RGG VKSD QKSSHGKFLVLKP RENGVSLA KDV SPTSNAN+MAAN+
Sbjct: 361 QPLPVHHTNQTRGGHVKSDAQKSSHGKFLVLKP-RENGVSLAVKDVPSPTSNANNMAANS 420
Query: 421 QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK 480
QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK
Sbjct: 421 QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK 480
Query: 481 KTSMSSSAVLSDSCSSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRA 540
KT SSSA+LSDSC +VKSP+IGQSNELTREEINIPASPRV+ENGA+ET NGDSSEEV+A
Sbjct: 481 KTPKSSSAILSDSCPAVKSPTIGQSNELTREEINIPASPRVVENGAVETRNGDSSEEVQA 540
Query: 541 SCDSGEKTESHVAAESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPS 600
SCDSGEK SHV AESLDEEEAAFLRSLGWDE+ GE GLTEEEINSFY+EL MNLKP
Sbjct: 541 SCDSGEKLASHVGAESLDEEEAAFLRSLGWDESYGEDEGLTEEEINSFYEELQYMNLKPP 600
Query: 601 LKIGRFIQRKIVVPSESPEGEGSSKDEADSELSSSDSEA 638
K+ R IQ KI VPSES E SKD A SELSSSDSEA
Sbjct: 601 TKMVRCIQPKIFVPSESHE---DSKDGAGSELSSSDSEA 635
BLAST of Lcy09g012070 vs. ExPASy TrEMBL
Match:
A0A6J1CIL7 (uncharacterized protein LOC111011833 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111011833 PE=4 SV=1)
HSP 1 Score: 1002.3 bits (2590), Expect = 9.2e-289
Identity = 561/639 (87.79%), Postives = 587/639 (91.86%), Query Frame = 0
Query: 1 MPVKVHPAQNRVFLMERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRN 60
MPVKVHP QNR FLMERSEP+LVPEWLRSTG+VTGG NSNHHFP SSSHSDV SL+QSRN
Sbjct: 1 MPVKVHPTQNRAFLMERSEPTLVPEWLRSTGSVTGGGNSNHHFPLSSSHSDVSSLAQSRN 60
Query: 61 RTSKTTGDFDTSRSAFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG 120
RTSKT GDFDTSRSAFLDR+SSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG
Sbjct: 61 RTSKTIGDFDTSRSAFLDRSSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG 120
Query: 121 DNWDRDSHDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNV 180
D+WDRDS DPLGKILSNRIDKDALRRSHSMVSRKQGELFHRR+ATD K GV+SS NNG
Sbjct: 121 DHWDRDSSDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRIATDPKIGVSSSLNNGTG 180
Query: 181 MPSGTSVGSSIQKAVFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALIGGE 240
MPSGTSVGSSIQKAVFEKDFPSLGSE++QG+S+IGRVSSPGLSSPVQSLPIGNSALIGGE
Sbjct: 181 MPSGTSVGSSIQKAVFEKDFPSLGSEEKQGTSDIGRVSSPGLSSPVQSLPIGNSALIGGE 240
Query: 241 GWTSALAEVPNMIGSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQV 300
GWTSALAEVPN+IGS+TGSSSFQQTVPA SGAG +SVTAGLNMAEALVQAPSRARA PQ
Sbjct: 241 GWTSALAEVPNIIGSSTGSSSFQQTVPAISGAGLLSVTAGLNMAEALVQAPSRARAVPQ- 300
Query: 301 SELSVKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP 360
L VKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP
Sbjct: 301 --LFVKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP 360
Query: 361 QPLSV-HANQSRGGPVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANN 420
QPL V H NQ+RGG VKSD QKSSHGKFLVLKP RENGVSLA KDV SPTSNAN+MAAN+
Sbjct: 361 QPLPVHHTNQTRGGHVKSDAQKSSHGKFLVLKP-RENGVSLAVKDVPSPTSNANNMAANS 420
Query: 421 QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK 480
QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK
Sbjct: 421 QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK 480
Query: 481 KTSMSSSAVLSDSCSSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRA 540
KT SSSA+LSDSC +VKSP+IGQSNELTREEINIPASPRV+ENGA+ET NGDSSEEV+A
Sbjct: 481 KTPKSSSAILSDSCPAVKSPTIGQSNELTREEINIPASPRVVENGAVETRNGDSSEEVQA 540
Query: 541 SCDSGEKTESHVAAESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPS 600
SCDSGEK SHV AESLDEEEAAFLRSLGWDE+ GE GLTEEEINSFY+EL MNLKP
Sbjct: 541 SCDSGEKLASHVGAESLDEEEAAFLRSLGWDESYGEDEGLTEEEINSFYEELQYMNLKPP 600
Query: 601 LKIGRFIQRKIVVPSESPEGEGSSKDEADSELSSSDSEA 638
K+ R IQ KI VPSES E SKD A SELSSSDSEA
Sbjct: 601 TKMVRCIQPKIFVPSESHE---DSKDGAGSELSSSDSEA 632
BLAST of Lcy09g012070 vs. ExPASy TrEMBL
Match:
A0A5D3DT29 (Mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold313G001070 PE=4 SV=1)
HSP 1 Score: 989.9 bits (2558), Expect = 4.7e-285
Identity = 555/625 (88.80%), Postives = 581/625 (92.96%), Query Frame = 0
Query: 15 MERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRNRTSKTTGDFDTSRS 74
MERSEP+LVPEWLRSTG+VTGG NSNHHFPSSSSHSDVPSLSQSRNR SKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 75 AFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGDNWDRDSHDPLGKI 134
+FLDRTSSSNSRRSSSNGS+KHAYSSFNRGHRDKDREKEKDRL+FGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 135 LSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNVMPSGTSVGSSIQKA 194
LSNRIDKDALRRSHSMVSRKQGELFHRRV T+LK SHN+ N + SGTSVGSSIQKA
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQGELFHRRVGTELK-----SHNSSNGILSGTSVGSSIQKA 180
Query: 195 VFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALI-GGEGWTSALAEVPNMI 254
VFEKDFPSLGSE++QG+SEIGRVSSPGLSSPVQSLPIGNSALI GGEGWTSALAEVP+MI
Sbjct: 181 VFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMI 240
Query: 255 GSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEEL 314
GST GSSSFQQTVPATSGAGP+SVTAGLNMAEALVQ+PSR R PQVSELSVKTQRLEEL
Sbjct: 241 GSTPGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQVSELSVKTQRLEEL 300
Query: 315 AIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLSVHANQSRGG 374
AIKQSRQLIPVTPSMPKAMVL+SSDKSKPKLASRTGELN TIKGGQPQP SVHANQSR G
Sbjct: 301 AIKQSRQLIPVTPSMPKAMVLSSSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRVG 360
Query: 375 PVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANNQFALAPSVPHAPLR 434
VK D QKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAAN+QFALAPSVPHAPLR
Sbjct: 361 HVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLR 420
Query: 435 SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 494
SPNN+NVSSVERKIASLDLK+G+TLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC
Sbjct: 421 SPNNTNVSSVERKIASLDLKTGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 480
Query: 495 SSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRASCDSGEKTESHVAA 554
SSVKSPSIGQSNELT EE+ IPASPRVIENGA+E NG+SSEEV+ S DSGEKTESHVAA
Sbjct: 481 SSVKSPSIGQSNELTSEEMGIPASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVAA 540
Query: 555 ESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPSLKIGRFIQRKIVVP 614
ESLDEEEAAFLRSLGWDE+CGE GLTEEEINSFY+E MNLKPSLKIGR IQ KI VP
Sbjct: 541 ESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREY--MNLKPSLKIGRCIQPKIFVP 600
Query: 615 SESPEGEGSSKDEADSELSSSDSEA 638
SES E S D A SELSSSDSEA
Sbjct: 601 SES--REDSKDDGAGSELSSSDSEA 616
BLAST of Lcy09g012070 vs. ExPASy TrEMBL
Match:
A0A1S3CDT9 (mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499274 PE=4 SV=1)
HSP 1 Score: 989.9 bits (2558), Expect = 4.7e-285
Identity = 555/625 (88.80%), Postives = 581/625 (92.96%), Query Frame = 0
Query: 15 MERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRNRTSKTTGDFDTSRS 74
MERSEP+LVPEWLRSTG+VTGG NSNHHFPSSSSHSDVPSLSQSRNR SKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 75 AFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGDNWDRDSHDPLGKI 134
+FLDRTSSSNSRRSSSNGS+KHAYSSFNRGHRDKDREKEKDRL+FGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 135 LSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNVMPSGTSVGSSIQKA 194
LSNRIDKDALRRSHSMVSRKQGELFHRRV T+LK SHN+ N + SGTSVGSSIQKA
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQGELFHRRVGTELK-----SHNSSNGILSGTSVGSSIQKA 180
Query: 195 VFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALI-GGEGWTSALAEVPNMI 254
VFEKDFPSLGSE++QG+SEIGRVSSPGLSSPVQSLPIGNSALI GGEGWTSALAEVP+MI
Sbjct: 181 VFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMI 240
Query: 255 GSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEEL 314
GST GSSSFQQTVPATSGAGP+SVTAGLNMAEALVQ+PSR R PQVSELSVKTQRLEEL
Sbjct: 241 GSTPGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQVSELSVKTQRLEEL 300
Query: 315 AIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLSVHANQSRGG 374
AIKQSRQLIPVTPSMPKAMVL+SSDKSKPKLASRTGELN TIKGGQPQP SVHANQSR G
Sbjct: 301 AIKQSRQLIPVTPSMPKAMVLSSSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRVG 360
Query: 375 PVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANNQFALAPSVPHAPLR 434
VK D QKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAAN+QFALAPSVPHAPLR
Sbjct: 361 HVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLR 420
Query: 435 SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 494
SPNN+NVSSVERKIASLDLK+G+TLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC
Sbjct: 421 SPNNTNVSSVERKIASLDLKTGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 480
Query: 495 SSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRASCDSGEKTESHVAA 554
SSVKSPSIGQSNELT EE+ IPASPRVIENGA+E NG+SSEEV+ S DSGEKTESHVAA
Sbjct: 481 SSVKSPSIGQSNELTSEEMGIPASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVAA 540
Query: 555 ESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPSLKIGRFIQRKIVVP 614
ESLDEEEAAFLRSLGWDE+CGE GLTEEEINSFY+E MNLKPSLKIGR IQ KI VP
Sbjct: 541 ESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREY--MNLKPSLKIGRCIQPKIFVP 600
Query: 615 SESPEGEGSSKDEADSELSSSDSEA 638
SES E S D A SELSSSDSEA
Sbjct: 601 SES--REDSKDDGAGSELSSSDSEA 616
BLAST of Lcy09g012070 vs. ExPASy TrEMBL
Match:
A0A6J1CI28 (flocculation protein FLO11 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111011833 PE=4 SV=1)
HSP 1 Score: 986.9 bits (2550), Expect = 4.0e-284
Identity = 552/625 (88.32%), Postives = 578/625 (92.48%), Query Frame = 0
Query: 15 MERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRNRTSKTTGDFDTSRS 74
MERSEP+LVPEWLRSTG+VTGG NSNHHFP SSSHSDV SL+QSRNRTSKT GDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPLSSSHSDVSSLAQSRNRTSKTIGDFDTSRS 60
Query: 75 AFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGDNWDRDSHDPLGKI 134
AFLDR+SSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGD+WDRDS DPLGKI
Sbjct: 61 AFLDRSSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGDHWDRDSSDPLGKI 120
Query: 135 LSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNVMPSGTSVGSSIQKA 194
LSNRIDKDALRRSHSMVSRKQGELFHRR+ATD K GV+SS NNG MPSGTSVGSSIQKA
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQGELFHRRIATDPKIGVSSSLNNGTGMPSGTSVGSSIQKA 180
Query: 195 VFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALIGGEGWTSALAEVPNMIG 254
VFEKDFPSLGSE++QG+S+IGRVSSPGLSSPVQSLPIGNSALIGGEGWTSALAEVPN+IG
Sbjct: 181 VFEKDFPSLGSEEKQGTSDIGRVSSPGLSSPVQSLPIGNSALIGGEGWTSALAEVPNIIG 240
Query: 255 STTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEELA 314
S+TGSSSFQQTVPA SGAG +SVTAGLNMAEALVQAPSRARA PQVSEL VKTQRLEELA
Sbjct: 241 SSTGSSSFQQTVPAISGAGLLSVTAGLNMAEALVQAPSRARAVPQVSELFVKTQRLEELA 300
Query: 315 IKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLSV-HANQSRGG 374
IKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPL V H NQ+RGG
Sbjct: 301 IKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLPVHHTNQTRGG 360
Query: 375 PVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANNQFALAPSVPHAPLR 434
VKSD QKSSHGKFLVLKP RENGVSLA KDV SPTSNAN+MAAN+QFALAPSVPHAPLR
Sbjct: 361 HVKSDAQKSSHGKFLVLKP-RENGVSLAVKDVPSPTSNANNMAANSQFALAPSVPHAPLR 420
Query: 435 SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 494
SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKT SSSA+LSDSC
Sbjct: 421 SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTPKSSSAILSDSC 480
Query: 495 SSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRASCDSGEKTESHVAA 554
+VKSP+IGQSNELTREEINIPASPRV+ENGA+ET NGDSSEEV+ASCDSGEK SHV A
Sbjct: 481 PAVKSPTIGQSNELTREEINIPASPRVVENGAVETRNGDSSEEVQASCDSGEKLASHVGA 540
Query: 555 ESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPSLKIGRFIQRKIVVP 614
ESLDEEEAAFLRSLGWDE+ GE GLTEEEINSFY+EL MNLKP K+ R IQ KI VP
Sbjct: 541 ESLDEEEAAFLRSLGWDESYGEDEGLTEEEINSFYEELQYMNLKPPTKMVRCIQPKIFVP 600
Query: 615 SESPEGEGSSKDEADSELSSSDSEA 638
SES E SKD A SELSSSDSEA
Sbjct: 601 SESHE---DSKDGAGSELSSSDSEA 621
BLAST of Lcy09g012070 vs. NCBI nr
Match:
XP_022141428.1 (flocculation protein FLO11 isoform X1 [Momordica charantia])
HSP 1 Score: 1012.7 bits (2617), Expect = 1.4e-291
Identity = 564/639 (88.26%), Postives = 590/639 (92.33%), Query Frame = 0
Query: 1 MPVKVHPAQNRVFLMERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRN 60
MPVKVHP QNR FLMERSEP+LVPEWLRSTG+VTGG NSNHHFP SSSHSDV SL+QSRN
Sbjct: 1 MPVKVHPTQNRAFLMERSEPTLVPEWLRSTGSVTGGGNSNHHFPLSSSHSDVSSLAQSRN 60
Query: 61 RTSKTTGDFDTSRSAFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG 120
RTSKT GDFDTSRSAFLDR+SSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG
Sbjct: 61 RTSKTIGDFDTSRSAFLDRSSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG 120
Query: 121 DNWDRDSHDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNV 180
D+WDRDS DPLGKILSNRIDKDALRRSHSMVSRKQGELFHRR+ATD K GV+SS NNG
Sbjct: 121 DHWDRDSSDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRIATDPKIGVSSSLNNGTG 180
Query: 181 MPSGTSVGSSIQKAVFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALIGGE 240
MPSGTSVGSSIQKAVFEKDFPSLGSE++QG+S+IGRVSSPGLSSPVQSLPIGNSALIGGE
Sbjct: 181 MPSGTSVGSSIQKAVFEKDFPSLGSEEKQGTSDIGRVSSPGLSSPVQSLPIGNSALIGGE 240
Query: 241 GWTSALAEVPNMIGSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQV 300
GWTSALAEVPN+IGS+TGSSSFQQTVPA SGAG +SVTAGLNMAEALVQAPSRARA PQV
Sbjct: 241 GWTSALAEVPNIIGSSTGSSSFQQTVPAISGAGLLSVTAGLNMAEALVQAPSRARAVPQV 300
Query: 301 SELSVKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP 360
SEL VKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP
Sbjct: 301 SELFVKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP 360
Query: 361 QPLSV-HANQSRGGPVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANN 420
QPL V H NQ+RGG VKSD QKSSHGKFLVLKP RENGVSLA KDV SPTSNAN+MAAN+
Sbjct: 361 QPLPVHHTNQTRGGHVKSDAQKSSHGKFLVLKP-RENGVSLAVKDVPSPTSNANNMAANS 420
Query: 421 QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK 480
QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK
Sbjct: 421 QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK 480
Query: 481 KTSMSSSAVLSDSCSSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRA 540
KT SSSA+LSDSC +VKSP+IGQSNELTREEINIPASPRV+ENGA+ET NGDSSEEV+A
Sbjct: 481 KTPKSSSAILSDSCPAVKSPTIGQSNELTREEINIPASPRVVENGAVETRNGDSSEEVQA 540
Query: 541 SCDSGEKTESHVAAESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPS 600
SCDSGEK SHV AESLDEEEAAFLRSLGWDE+ GE GLTEEEINSFY+EL MNLKP
Sbjct: 541 SCDSGEKLASHVGAESLDEEEAAFLRSLGWDESYGEDEGLTEEEINSFYEELQYMNLKPP 600
Query: 601 LKIGRFIQRKIVVPSESPEGEGSSKDEADSELSSSDSEA 638
K+ R IQ KI VPSES E SKD A SELSSSDSEA
Sbjct: 601 TKMVRCIQPKIFVPSESHE---DSKDGAGSELSSSDSEA 635
BLAST of Lcy09g012070 vs. NCBI nr
Match:
XP_022141429.1 (uncharacterized protein LOC111011833 isoform X2 [Momordica charantia])
HSP 1 Score: 1002.3 bits (2590), Expect = 1.9e-288
Identity = 561/639 (87.79%), Postives = 587/639 (91.86%), Query Frame = 0
Query: 1 MPVKVHPAQNRVFLMERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRN 60
MPVKVHP QNR FLMERSEP+LVPEWLRSTG+VTGG NSNHHFP SSSHSDV SL+QSRN
Sbjct: 1 MPVKVHPTQNRAFLMERSEPTLVPEWLRSTGSVTGGGNSNHHFPLSSSHSDVSSLAQSRN 60
Query: 61 RTSKTTGDFDTSRSAFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG 120
RTSKT GDFDTSRSAFLDR+SSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG
Sbjct: 61 RTSKTIGDFDTSRSAFLDRSSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFG 120
Query: 121 DNWDRDSHDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNV 180
D+WDRDS DPLGKILSNRIDKDALRRSHSMVSRKQGELFHRR+ATD K GV+SS NNG
Sbjct: 121 DHWDRDSSDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRIATDPKIGVSSSLNNGTG 180
Query: 181 MPSGTSVGSSIQKAVFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALIGGE 240
MPSGTSVGSSIQKAVFEKDFPSLGSE++QG+S+IGRVSSPGLSSPVQSLPIGNSALIGGE
Sbjct: 181 MPSGTSVGSSIQKAVFEKDFPSLGSEEKQGTSDIGRVSSPGLSSPVQSLPIGNSALIGGE 240
Query: 241 GWTSALAEVPNMIGSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQV 300
GWTSALAEVPN+IGS+TGSSSFQQTVPA SGAG +SVTAGLNMAEALVQAPSRARA PQ
Sbjct: 241 GWTSALAEVPNIIGSSTGSSSFQQTVPAISGAGLLSVTAGLNMAEALVQAPSRARAVPQ- 300
Query: 301 SELSVKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP 360
L VKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP
Sbjct: 301 --LFVKTQRLEELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQP 360
Query: 361 QPLSV-HANQSRGGPVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANN 420
QPL V H NQ+RGG VKSD QKSSHGKFLVLKP RENGVSLA KDV SPTSNAN+MAAN+
Sbjct: 361 QPLPVHHTNQTRGGHVKSDAQKSSHGKFLVLKP-RENGVSLAVKDVPSPTSNANNMAANS 420
Query: 421 QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK 480
QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK
Sbjct: 421 QFALAPSVPHAPLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKK 480
Query: 481 KTSMSSSAVLSDSCSSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRA 540
KT SSSA+LSDSC +VKSP+IGQSNELTREEINIPASPRV+ENGA+ET NGDSSEEV+A
Sbjct: 481 KTPKSSSAILSDSCPAVKSPTIGQSNELTREEINIPASPRVVENGAVETRNGDSSEEVQA 540
Query: 541 SCDSGEKTESHVAAESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPS 600
SCDSGEK SHV AESLDEEEAAFLRSLGWDE+ GE GLTEEEINSFY+EL MNLKP
Sbjct: 541 SCDSGEKLASHVGAESLDEEEAAFLRSLGWDESYGEDEGLTEEEINSFYEELQYMNLKPP 600
Query: 601 LKIGRFIQRKIVVPSESPEGEGSSKDEADSELSSSDSEA 638
K+ R IQ KI VPSES E SKD A SELSSSDSEA
Sbjct: 601 TKMVRCIQPKIFVPSESHE---DSKDGAGSELSSSDSEA 632
BLAST of Lcy09g012070 vs. NCBI nr
Match:
XP_008460469.1 (PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis melo] >KAA0067384.1 mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis melo var. makuwa] >TYK26525.1 mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 989.9 bits (2558), Expect = 9.8e-285
Identity = 555/625 (88.80%), Postives = 581/625 (92.96%), Query Frame = 0
Query: 15 MERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRNRTSKTTGDFDTSRS 74
MERSEP+LVPEWLRSTG+VTGG NSNHHFPSSSSHSDVPSLSQSRNR SKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 75 AFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGDNWDRDSHDPLGKI 134
+FLDRTSSSNSRRSSSNGS+KHAYSSFNRGHRDKDREKEKDRL+FGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 135 LSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNVMPSGTSVGSSIQKA 194
LSNRIDKDALRRSHSMVSRKQGELFHRRV T+LK SHN+ N + SGTSVGSSIQKA
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQGELFHRRVGTELK-----SHNSSNGILSGTSVGSSIQKA 180
Query: 195 VFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALI-GGEGWTSALAEVPNMI 254
VFEKDFPSLGSE++QG+SEIGRVSSPGLSSPVQSLPIGNSALI GGEGWTSALAEVP+MI
Sbjct: 181 VFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMI 240
Query: 255 GSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEEL 314
GST GSSSFQQTVPATSGAGP+SVTAGLNMAEALVQ+PSR R PQVSELSVKTQRLEEL
Sbjct: 241 GSTPGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQVSELSVKTQRLEEL 300
Query: 315 AIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLSVHANQSRGG 374
AIKQSRQLIPVTPSMPKAMVL+SSDKSKPKLASRTGELN TIKGGQPQP SVHANQSR G
Sbjct: 301 AIKQSRQLIPVTPSMPKAMVLSSSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRVG 360
Query: 375 PVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANNQFALAPSVPHAPLR 434
VK D QKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAAN+QFALAPSVPHAPLR
Sbjct: 361 HVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLR 420
Query: 435 SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 494
SPNN+NVSSVERKIASLDLK+G+TLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC
Sbjct: 421 SPNNTNVSSVERKIASLDLKTGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 480
Query: 495 SSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRASCDSGEKTESHVAA 554
SSVKSPSIGQSNELT EE+ IPASPRVIENGA+E NG+SSEEV+ S DSGEKTESHVAA
Sbjct: 481 SSVKSPSIGQSNELTSEEMGIPASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVAA 540
Query: 555 ESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPSLKIGRFIQRKIVVP 614
ESLDEEEAAFLRSLGWDE+CGE GLTEEEINSFY+E MNLKPSLKIGR IQ KI VP
Sbjct: 541 ESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREY--MNLKPSLKIGRCIQPKIFVP 600
Query: 615 SESPEGEGSSKDEADSELSSSDSEA 638
SES E S D A SELSSSDSEA
Sbjct: 601 SES--REDSKDDGAGSELSSSDSEA 616
BLAST of Lcy09g012070 vs. NCBI nr
Match:
XP_022141430.1 (flocculation protein FLO11 isoform X3 [Momordica charantia] >XP_022141431.1 flocculation protein FLO11 isoform X3 [Momordica charantia])
HSP 1 Score: 986.9 bits (2550), Expect = 8.3e-284
Identity = 552/625 (88.32%), Postives = 578/625 (92.48%), Query Frame = 0
Query: 15 MERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRNRTSKTTGDFDTSRS 74
MERSEP+LVPEWLRSTG+VTGG NSNHHFP SSSHSDV SL+QSRNRTSKT GDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPLSSSHSDVSSLAQSRNRTSKTIGDFDTSRS 60
Query: 75 AFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGDNWDRDSHDPLGKI 134
AFLDR+SSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGD+WDRDS DPLGKI
Sbjct: 61 AFLDRSSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGDHWDRDSSDPLGKI 120
Query: 135 LSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNVMPSGTSVGSSIQKA 194
LSNRIDKDALRRSHSMVSRKQGELFHRR+ATD K GV+SS NNG MPSGTSVGSSIQKA
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQGELFHRRIATDPKIGVSSSLNNGTGMPSGTSVGSSIQKA 180
Query: 195 VFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALIGGEGWTSALAEVPNMIG 254
VFEKDFPSLGSE++QG+S+IGRVSSPGLSSPVQSLPIGNSALIGGEGWTSALAEVPN+IG
Sbjct: 181 VFEKDFPSLGSEEKQGTSDIGRVSSPGLSSPVQSLPIGNSALIGGEGWTSALAEVPNIIG 240
Query: 255 STTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEELA 314
S+TGSSSFQQTVPA SGAG +SVTAGLNMAEALVQAPSRARA PQVSEL VKTQRLEELA
Sbjct: 241 SSTGSSSFQQTVPAISGAGLLSVTAGLNMAEALVQAPSRARAVPQVSELFVKTQRLEELA 300
Query: 315 IKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLSV-HANQSRGG 374
IKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPL V H NQ+RGG
Sbjct: 301 IKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLPVHHTNQTRGG 360
Query: 375 PVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANNQFALAPSVPHAPLR 434
VKSD QKSSHGKFLVLKP RENGVSLA KDV SPTSNAN+MAAN+QFALAPSVPHAPLR
Sbjct: 361 HVKSDAQKSSHGKFLVLKP-RENGVSLAVKDVPSPTSNANNMAANSQFALAPSVPHAPLR 420
Query: 435 SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 494
SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKT SSSA+LSDSC
Sbjct: 421 SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTPKSSSAILSDSC 480
Query: 495 SSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRASCDSGEKTESHVAA 554
+VKSP+IGQSNELTREEINIPASPRV+ENGA+ET NGDSSEEV+ASCDSGEK SHV A
Sbjct: 481 PAVKSPTIGQSNELTREEINIPASPRVVENGAVETRNGDSSEEVQASCDSGEKLASHVGA 540
Query: 555 ESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPSLKIGRFIQRKIVVP 614
ESLDEEEAAFLRSLGWDE+ GE GLTEEEINSFY+EL MNLKP K+ R IQ KI VP
Sbjct: 541 ESLDEEEAAFLRSLGWDESYGEDEGLTEEEINSFYEELQYMNLKPPTKMVRCIQPKIFVP 600
Query: 615 SESPEGEGSSKDEADSELSSSDSEA 638
SES E SKD A SELSSSDSEA
Sbjct: 601 SESHE---DSKDGAGSELSSSDSEA 621
BLAST of Lcy09g012070 vs. NCBI nr
Match:
XP_008460470.1 (PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X2 [Cucumis melo])
HSP 1 Score: 979.5 bits (2531), Expect = 1.3e-281
Identity = 552/625 (88.32%), Postives = 578/625 (92.48%), Query Frame = 0
Query: 15 MERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRNRTSKTTGDFDTSRS 74
MERSEP+LVPEWLRSTG+VTGG NSNHHFPSSSSHSDVPSLSQSRNR SKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 75 AFLDRTSSSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRLSFGDNWDRDSHDPLGKI 134
+FLDRTSSSNSRRSSSNGS+KHAYSSFNRGHRDKDREKEKDRL+FGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 135 LSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNVMPSGTSVGSSIQKA 194
LSNRIDKDALRRSHSMVSRKQGELFHRRV T+LK SHN+ N + SGTSVGSSIQKA
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQGELFHRRVGTELK-----SHNSSNGILSGTSVGSSIQKA 180
Query: 195 VFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALI-GGEGWTSALAEVPNMI 254
VFEKDFPSLGSE++QG+SEIGRVSSPGLSSPVQSLPIGNSALI GGEGWTSALAEVP+MI
Sbjct: 181 VFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMI 240
Query: 255 GSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEEL 314
GST GSSSFQQTVPATSGAGP+SVTAGLNMAEALVQ+PSR R PQ LSVKTQRLEEL
Sbjct: 241 GSTPGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQ---LSVKTQRLEEL 300
Query: 315 AIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLSVHANQSRGG 374
AIKQSRQLIPVTPSMPKAMVL+SSDKSKPKLASRTGELN TIKGGQPQP SVHANQSR G
Sbjct: 301 AIKQSRQLIPVTPSMPKAMVLSSSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRVG 360
Query: 375 PVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANNQFALAPSVPHAPLR 434
VK D QKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAAN+QFALAPSVPHAPLR
Sbjct: 361 HVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLR 420
Query: 435 SPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 494
SPNN+NVSSVERKIASLDLK+G+TLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC
Sbjct: 421 SPNNTNVSSVERKIASLDLKTGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSC 480
Query: 495 SSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRASCDSGEKTESHVAA 554
SSVKSPSIGQSNELT EE+ IPASPRVIENGA+E NG+SSEEV+ S DSGEKTESHVAA
Sbjct: 481 SSVKSPSIGQSNELTSEEMGIPASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVAA 540
Query: 555 ESLDEEEAAFLRSLGWDENCGEG-GLTEEEINSFYQELNSMNLKPSLKIGRFIQRKIVVP 614
ESLDEEEAAFLRSLGWDE+CGE GLTEEEINSFY+E MNLKPSLKIGR IQ KI VP
Sbjct: 541 ESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREY--MNLKPSLKIGRCIQPKIFVP 600
Query: 615 SESPEGEGSSKDEADSELSSSDSEA 638
SES E S D A SELSSSDSEA
Sbjct: 601 SES--REDSKDDGAGSELSSSDSEA 613
BLAST of Lcy09g012070 vs. TAIR 10
Match:
AT1G36990.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G08510.1); Has 5029 Blast hits to 1779 proteins in 339 species: Archae - 2; Bacteria - 1372; Metazoa - 990; Fungi - 933; Plants - 111; Viruses - 28; Other Eukaryotes - 1593 (source: NCBI BLink). )
HSP 1 Score: 426.0 bits (1094), Expect = 5.3e-119
Identity = 297/587 (50.60%), Postives = 373/587 (63.54%), Query Frame = 0
Query: 15 MERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLS-QSRNRTSKTTGDFDTSR 74
M++ E SL PEWLRS+G+ +GG +SNH SSSSHSD SL SRNR S++ D D+
Sbjct: 1 MDKGEHSLAPEWLRSSGHASGGGSSNHLLVSSSSHSDSASLQYNSRNRNSRSKSDVDSIH 60
Query: 75 SAFLDRTSSSNSRRSSSNGSAKHAYSS--FNRGHRDKDREKEKDRLSFGDNWDRDSHDPL 134
S FLDR+SS+NSRR SSNGSAKHAYSS FNR RDKDR ++KDR+S+ D WD D+ PL
Sbjct: 61 SPFLDRSSSTNSRRGSSNGSAKHAYSSFNFNRSQRDKDRSRDKDRVSYVDPWDLDTSIPL 120
Query: 135 GKILSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNVMPSGTSVGSSI 194
IL+ R D D LRRSHSMV+RKQGE R + L G +S+ NGN + SG S+G+S
Sbjct: 121 RTILTGR-DPDPLRRSHSMVTRKQGEHLSRGLTVGLNNGGSSNSYNGNGLLSGPSIGNSF 180
Query: 195 QKAVFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALIGGEGWTSALAEVPN 254
Q+ F+KDFPSLG+E++Q ++ RVSSPG+SS VQ+LP+GNSALIGGEGWTSALAEVPN
Sbjct: 181 QRTGFDKDFPSLGAEEKQNGQDVVRVSSPGISSVVQNLPVGNSALIGGEGWTSALAEVPN 240
Query: 255 MIGSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLE 314
+I S A S AG ++ +GLNMAEALVQAP+R PQ SVKTQRLE
Sbjct: 241 VIEKACTGSLTSPKANAVS-AGTLTGPSGLNMAEALVQAPARTHTPPQG---SVKTQRLE 300
Query: 315 ELAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGG---QPQPLSVHAN 374
+LAIKQSRQLIPV PS PK + LNSSDKSK K RTGE + QP L
Sbjct: 301 DLAIKQSRQLIPVVPSAPKGLSLNSSDKSKTKQVVRTGETCLAPSRNALQQPAVLLGSFQ 360
Query: 375 QSRGGPVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANNQ-FALAPSV 434
+ G +K P+K K LVLKP RENGVS A K+ SP++N N+ AA++Q + S
Sbjct: 361 SNPSGQIK--PEK----KLLVLKPARENGVS-AVKESGSPSANTNTRAASSQLMSNTQST 420
Query: 435 PHAPLRSPNNSNVSSVERKIAS-LDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTSMSSS 494
AP+RS N S E K AS + SG T+EK+PS +Q QSR+ F++ +K+K + S+S
Sbjct: 421 QSAPVRSTN----SPKELKGASAFSMISGQTIEKKPSAAQAQSRSAFYSALKQKQTASTS 480
Query: 495 AVLSDSCSSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRASCDSGEK 554
+ +D SS S S +L + I + P + S EV S
Sbjct: 481 -ITTDPVSSSTSASSSVEVKLNSSKDLIASDP--------SSSQATSGVEVTDSVQVASH 540
Query: 555 TESHVAAESLDEEEAAFLRSLGWDENCGEGGLTEEEINSFYQELNSM 594
T A ++ DEEEA FLRSLGW EN GE LTEEEI+SF ++ +
Sbjct: 541 TSGFEATDTPDEEEAQFLRSLGWVENNGEEYLTEEEIDSFLEQYKEL 562
BLAST of Lcy09g012070 vs. TAIR 10
Match:
AT4G08510.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36990.1); Has 888 Blast hits to 321 proteins in 121 species: Archae - 0; Bacteria - 120; Metazoa - 86; Fungi - 24; Plants - 79; Viruses - 0; Other Eukaryotes - 579 (source: NCBI BLink). )
HSP 1 Score: 313.9 bits (803), Expect = 2.9e-85
Identity = 251/592 (42.40%), Postives = 329/592 (55.57%), Query Frame = 0
Query: 15 MERSEPSLVPEWLRSTGNVTGGSNSNHHFPSSSSHSDVPSLSQSRNRTSKTTGDFDTSRS 74
ME+ EPSLVPEWLRS+G+ +G +SN S SD SL S+NR +++ D D+ S
Sbjct: 1 MEKREPSLVPEWLRSSGHGSGVGSSN-------SLSD--SLRNSKNRNARSRSDADSVGS 60
Query: 75 AFLDRTSSSNSRRSSSNGSAKHAYSS--FNRGHRDKDREKEKDRLSFGDNWDRDSHDPLG 134
FLDR+SS+N+RR SSNGS KHAYSS FNR +RDKDR +EKDR+S+ D WD DS P G
Sbjct: 61 PFLDRSSSTNTRRGSSNGSTKHAYSSFNFNRSNRDKDRSREKDRMSYMDPWDNDSSMPFG 120
Query: 135 KILSNRIDKDALRRSHSMVSRKQGELFHRRVATDLKAGVNSSHNNGNVMPSGTSVGSSIQ 194
L R ++ LRRSHSM +RKQG + K G N + NG+ + GTS S +
Sbjct: 121 TFLIGR-GEEPLRRSHSMTTRKQGNHLAQGFTVGYKNGGNINTFNGHGILPGTSPVKSSK 180
Query: 195 KAVFEKDFPSLGSEDRQGSSEIGRVSSPGLSSPVQSLPIGNSALIGGEGWTSALAEVPNM 254
+ F KDFP L E+R G ++ R+SSPG S QSL + N ALI GEGWTSALAEVPN+
Sbjct: 181 RMGFNKDFPLLRGEERNGGPDVVRISSPGRSPTAQSLSVANPALIIGEGWTSALAEVPNV 240
Query: 255 IGSTTGSSSFQQTVPATSGAGPVSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEE 314
I + G+ S + + +GP A NMAEALVQAP R PQ Q LE+
Sbjct: 241 IEKSGGAESHANVGNSATLSGP----ACRNMAEALVQAPGRTGTPPQ-------AQTLED 300
Query: 315 LAIKQSRQLIPVTPSMPKAMVLNSSDKSKPKLASRTGELNVTIKGGQPQPLSV---HANQ 374
AI+QSRQLIPV PS PK V NSSDKSK K R+GE + Q SV +
Sbjct: 301 RAIRQSRQLIPVVPSAPKGSVHNSSDKSKTKPMFRSGETGLASSRNTQQQSSVMLGNMQS 360
Query: 375 SRGGPVKSDPQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANNQFALAPSVPH 434
+ G +K D K K ++LKP RENGV S NS A +Q APS
Sbjct: 361 NPGSQIKPDTTK----KLVILKPARENGVVAGG-------SPPNSRVAASQPTTAPSTQF 420
Query: 435 -APLRSPNNSNVSSVERKIASLDLKSGSTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAV 494
A +RS N + + AS+++ +G EK+ SL+Q QSR+ F++ +K+KT + S
Sbjct: 421 TASVRSTNGPR----DLRGASVNMLAGKAAEKKLSLAQTQSRHAFYSALKQKTCTNISTD 480
Query: 495 LSDSCSSVKSPSIGQSNELTREEINIPASPRVIENGAMETLNGDSSEEVRASCDSGEKTE 554
S + S + S Q+N + P+SP+ E + E V + E+
Sbjct: 481 PSKTSSCILSSVEEQANSSKELVASDPSSPQAAE-------RDEIMESVEKVSNVAERIS 540
Query: 555 SHVAAESLDEEEAAFLRSLGWDEN-CGEGGLTEEEINSFYQELNSMNLKPSL 600
+A D +EAAFL+SLGWDEN E T EE+ + ++ KPSL
Sbjct: 541 RFESAVRPDPKEAAFLKSLGWDENDSDEYTHTMEEMREWCKK-----FKPSL 544
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CKG8 | 6.8e-292 | 88.26 | flocculation protein FLO11 isoform X1 OS=Momordica charantia OX=3673 GN=LOC11101... | [more] |
A0A6J1CIL7 | 9.2e-289 | 87.79 | uncharacterized protein LOC111011833 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A5D3DT29 | 4.7e-285 | 88.80 | Mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo... | [more] |
A0A1S3CDT9 | 4.7e-285 | 88.80 | mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo... | [more] |
A0A6J1CI28 | 4.0e-284 | 88.32 | flocculation protein FLO11 isoform X3 OS=Momordica charantia OX=3673 GN=LOC11101... | [more] |
Match Name | E-value | Identity | Description | |
XP_022141428.1 | 1.4e-291 | 88.26 | flocculation protein FLO11 isoform X1 [Momordica charantia] | [more] |
XP_022141429.1 | 1.9e-288 | 87.79 | uncharacterized protein LOC111011833 isoform X2 [Momordica charantia] | [more] |
XP_008460469.1 | 9.8e-285 | 88.80 | PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cuc... | [more] |
XP_022141430.1 | 8.3e-284 | 88.32 | flocculation protein FLO11 isoform X3 [Momordica charantia] >XP_022141431.1 floc... | [more] |
XP_008460470.1 | 1.3e-281 | 88.32 | PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X2 [Cuc... | [more] |
Match Name | E-value | Identity | Description | |
AT1G36990.1 | 5.3e-119 | 50.60 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXP... | [more] |
AT4G08510.1 | 2.9e-85 | 42.40 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |