|
Sequences
The following sequences are available for this feature:
Gene sequence (with intron) Legend: exonfive_prime_UTRpolypeptideCDS Hold the cursor over a type above to highlight its positions in the sequence below. TTAGTCTGCAGTTTGTCAACTCAGACGCTCAAGCTATCCATTTGTCCAGAGCTCCCCTGCAATGGCGTCCGAGAGAAAGAAGGAGAAAAATGAGAAGTGGCTATGGTGATGAGCTTGATTGAACGTTGGCAACAGCGAAGAAGAAAAACAGAGTGCAATGGCCATGGCAATGGCAGCCCAAGCTTTTCTTCCATAGAAAGCTCGCTTGCTGTGTGGGTGGTGGGCCAACCACAACTGTCATCTATTCCTTACACTTTCACCCAACACCATCACCATCTTTTCTCTCTCTCTTTCCCTTTCCTTTTCTATCTCTCTTTCTCTTACCATTGTCAAATCTTTCCCCTATTCTGGTTCTTCTTTGCTTTCTTTTTCCTTCTTTTTCCTCCATGTCTTTAATGACGGCCGATATGCGAATTCTATGCGGAAAGATAGAGAGTCGGTGATGGATGAGCATCTGGTCCTCTTTTTTACTGGTTGTCGAGAGAGATTACTACATTTTAAAAGTCTTCTAGTTGTTGCTGTTGGTTTTGTTCTTCGTGTGGTCGGTGGAATTGCTGAAAATTTTGGACATCGAGAGCGGGGGTGGGGTGTGGTGTGTTCGTTCTAAAGTGGAGTGTGATGTTTTGGACTTTCTGTGATGGATTTGGAGTAAATTTAGTTAAACTTCGATCGATTTCATTTCGATTTGGAGAGTTTAATCGCATTTCTGAAGTTTTCGTTAAGGTTTTTGAATCGGTTCGAGATAATTGTGAAAGTGTACTGAATCAGAAGATACTAGAATTTGAGTTCTAATGCTTCTGATCAACCTCTGATAAGAAGAAATTGAGGAAAATATATTATGAAATCTTCGAGTTCATGAATTTTCATTTTGGAGATAATGTTCCCTCAGTTATCTTTTCTTCCAAGAAAACTACGAATCTCTCGATATTCTAAGATTTTGCTGAATTTCTGGGCACGTTACAACGCTTCGATCCGATTCCTTGCCATTTCAAGAAGCCGTGACTAAATTTTGCTGTGTCCGCCCGATTGGTCATCTAGATTCCATGCCTGTTTTTCGTCACACTGGATTTCTCAGGGTCAGCTCTCGAGGTTAGCTTGTCGGTGAATTTTCCGGCGAAACGAAAGCAAGAAATTCCGTTTGTAATCGTTTTCTAAGCTGTTTTATTCTTCGTTCCTGTTATCAGAGTTCATGCAACCATATTAGAGAAGAAAAATTTGGACGGAATCTTTGAAGAAAATGGGGAAGAATTCCAAAAGAATTATGGTTCGTTATGAGAAGGATCAGAGGGGCTGTATGTGGGGTTTGATTAGTTTATTTGATTTCCGCCATGGCCGCGCCTCCAGGAAGCTATTAGCAGACAAGAAGCGTCCTGGCAGGCAATCTGTTGGTAAGAATACTATTTTTCTTCTTCAGATTTTTCGTAGATCGAAAGATTTTAGGCTTTGTTGACGTTGAATTTCAACTTTTGACCTGCAAATTTCTTGTTGATCAAATTTCCTTGCGGATTGTTCTTTGGAGAATCATGATATCTTGAGTTTCAATTCTTTGGTAGTTTTAGCTTTCTTTTTCCCCTGTTTTTTCACCTGCTGCCTAGAATCTTTGTGTATATGCGAAAAACTGCAATAAATTCCTCCTTGAACTGCGTTGTTGCATGCTCCCTGAACCTTCCATTTGAACAGCTACATTTAGAGTTATGAAATTGTTGTGATTAAACACTATCATCATAATTTTGCTAGTGGCTGGGTTTAATTGCTAAGATTTCATTCGTTTGAAAGTATAATTGCAACAATCTTTATAATGTGTGGGTGATTAAGGGTTTAGGGTTTAATAGTTGTAGAACTTACTTGTATAGTTCATGTGCATTTTTCAAATTGCAAGTAATGGAGTTATAACGATAGAGTTTAATTGTAATCATATGTTTTGGTTAATTGCTTAAATTGATGTGTAACAACCCAAGCTCACCGCTAATAGATATTGTCCGCTTTGGCCCGTTACGTATCGCCGTTAGCCTCACGGTTTTTAAAACCCGTCTTCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTTTGTTCCTCTCTCTAATCGAGGTAGGATCATACAATCCACCCCCTTCGGGGCCCAACGTCCTTGTTGGCACACCGTCCGGTGTGTAGCTCTAATACCATTTGTAATAGCCCAAACCCACCGCTAGCAAGTATTGTCCTCTTTGGCGGTGGGCTTGGGCTTGGGCTGTTACAAATGTTATCGAAGTCAAGTCACCGAACGGTGTGAGCCACTAGTCGGAGTAGGACTAGACCCTCTCCAGAGTAGATGCGTTTTAAAAACCTTGAGGAGACAATACCTGCTAGCGGTGGGCTTGGGCTGTTACATGATGATTCGAAGCTCAAGAGTGATTTGTGCTTTCTCAATGTGCATGGTATTTTGTTGGAAATGCAGCGTCACTTGGTGCTGAAGCTTCTCTTTCTTCAGCAGGTACTGGAAATTCAAGAAATAAGTTTGAGATTTTGGCTAATCTGGATGAAGATTGTAGTGTTACTTTGGTAAGTTTTCTCTATGCAGAACATTTTCTAGCAGAGTGAATATTTGGTTTATGATCTAATTGATTGGATGCTGTGATTTTCTCAGAACAGGGAAGAGTGTAAGACAGTAGACATTGGAAAGCCCAGTGTCAAGAAACTCATAGAAGAAGAGATGTTCAATGAGCTGGACGCGAAGAGAATCGAGCATGAACACTCAGGCCACTTAAAGATGAATGACCCGAAGAAAGTGAAGAAGAGTCGGAAGAAAAGCCGTGATATCGATGCTGATTCGTTTAATGCTGCGGAGTATTTGAAGGAACAAAGTGTTAATAATCTTCCTGTTGATGTGATGCTAAAGGAGATCTACAGTCAAATCCATCGGAAGAACACGAGTGACATGAAGTTTGACCCGAACGACAAAGCAGATATGCAATCAAATGAGTTCCTTGCTGATTTAGAACAAAAAATGGTTGATGCACTTAAGGAATATTTTGGCCAGAAGTTCGATATTGGGAAAGATTTCACTAAAATTCAGAAAGTTCAACACTCGAGAGAGATCATGGATGCACTTCAGATTCCCCATTCTGATGACAAGCTAGTTCACGAACTCGCACAAAACCCGAACTCTGTGTTACTCAAGTACATTAGAAATTTGCACGATTTGTCGATAGAAAAAGACATGGAACCCAAGTCACACGAGTTCAGTGAAGTTAGCCAGTCTGAGGAGATTGTTGATCATAAGCAACGTTTATTTTTCAAGAGAAAGGTCAAGAATCCGGGAAGGAACGTATCCAAAGGGAACGAAAATTCTGATGCGTCGAGTAAGATTGTAATCTTGAAGCCAGGGCCAAAAGGTTTGGTGAATTCTGAGGCTGATAGCATTCATCCATCAGTCCAAAATTCTACTGCTAACGATAAAACGAAAGTGCAGAACGAGAGGGCCGGTTCAAACTTTTTTCTTTCAGGAATCAAGAGGAAATTTAGATATGCTATGGGAAAAGATCACCATGAACACTCTGCTTATGGTTCAGATAGATTTTCTTGTGATCATCACAGTACAAAGGAGAGTGAAGAGGGTGTTCATAAGGAAGATAGTGCAAGAAATTCTACCAGCAAGGACCACTTTTTTATTGAAAGAATTTCCAGGCCTTCCACTGATAGCAAGAGAAGGGAGAAGGCTGGAAAACTCAAAAGCTTGGAAATTAATCAAGATTTGGGAAATGTTTATAACGGTAGGAGAAGCTCGTCTAATATTTATGTCGAGGCAAAGAAACATCTTTCCGAGATGCTTAGCAGTGGGGATGAAAGTGTAGACTTTTTAAGGGGACATGTGCCGAAGACCCTCGGACGGATTCTATCTCTTCCTGAGTATAGTTTTTCTCCCATTAGCAGTCCTAAAATGGACTGCAAGCTCAGTCCAGTAACTTCAGAGAAGAGAGTTTCTGTTGGTAGCCGACTTTTAAATGTTGATGAAATTACCCCGTTGCTCAAGGGCGAAAATGACGATACCCCAATCAGTGTGCAGCCTCCAACTGATGATAACCACGATATGAAGAGCGATATCGTTGATCAAAGTATAAGGGAAGAAGCTGTGAGTTCTAGTACTAATGGGAAGATTTCTGAAGGTATGTATCTCTGAATGGTCCAATGCATTGGATTGGCCAACCTTTTAGTGTTGTTTATCAATCCTTGATGTTCACTAATAAACGTGCAGGTGATATTGAGATTCTGAAGGTAAATGAAATTGCAGTCCATGAGGAGAGTAGTGTTTTGGAAGCTCCCTCTGGCTCAAGTGAATATTCTCTTTTGAGAGAAGATCAGAATGGTGAAATGGCCGACGATGCATGCGATGAGAGAACTGTTTCTTATGTTCCCTCCGACCCGATTGCTTCTTCCCCTATCAGAGAAGGCCACGATAATGACACACCAGACATGGGCGACGACAAACCTTCAATGAGCTTGTCCCAAGTAATATATCTTCTAACTTTCGTGTTTATATCTCGTCGTTTGTCTTACTATAAGTCTATCACATAACCGTAAGGTGAATTAGTTTATGATACATCTCATGCTGATGAAATTCCAAGGTGTAATGATAAGCCAGCTTCAGTGTTTATTGAAGCATTTTGCTTAAACTGTTTATTCTGTTTAGGATTCGTCTGAAGAAAATCAACTGTCTCCATCTCGATCGGCATCGTCTTCTAGCTCACCTACACAAGGGAAGGCTGTTGGAGATTTAGATGGTGTTTCCGATGTTCCAGAGCGGCCGAGCCCTGTTTCAGTTCTTGAGCCATTATTTGTAGATGATAACCTGAGCCCTCTGCATGCCATGGCTTTGCCTGGTAGGCACTAAGAACAGCGCGGAGTTTATCCGTGTTTTTCGTATTTCATTTTTTATCTATTCCAGTAGTTACCCTGGCTAATATCAGTGTATTCAATCAGCAGGGTTGCCAGTCCAACCAGTGCATATTGAATTTGAAGAACGCGAACCTGCAGAAACCGATAAGGCTAACGTCACGAAATCGTTAAAGGAAGATAAGGAAGTGATTTTTGACTATGTAAAGGCAGTGCTGTTGGCATCTGGACTTAGCTGGAACCAAATCTGTGGAAAGTGGCTCTCTTCAGAACAGCTTCTTTACTTGTTACTACTCGACGAGATAGAACTTTTCCCAAACCAGCTGTGCTCCGACCAGCAGCTCCTCTTTGACTGTATTAATGAAGTACTTGCTGACTTTTGTCAAAGCTATCCGCCATGGTTCTCGTTCATACAACCTTGTCTACGGTCCGAATATCTGCTCGAGATTTGTGAAGGAGTATATTGGCATCTTCTTCCAATGCCACAGCCCCTTACATTGGACCACCTTGTCAGGAAGGACATGAGCAGAACCAGAACATGGATGAACCTTCATTCTGATGCTGAAACCGTCGGTACCGAGACGTGTCAAGCCATATTCGAAGATTTATTGGACGATACCGTATTCAGCTGCGTGTATGATAGTTCAGAATTCGACGTCGGATTTGGCATGGAGACTGAAAACATGAGTGATGACTTTTAA mRNA sequence TTAGTCTGCAGTTTGTCAACTCAGACGCTCAAGCTATCCATTTGTCCAGAGCTCCCCTGCAATGGCGTCCGAGAGAAAGAAGGAGAAAAATGAGAAGTGGCTATGGTGATGAGCTTGATTGAACGTTGGCAACAGCGAAGAAGAAAAACAGAGTGCAATGGCCATGGCAATGGCAGCCCAAGCTTTTCTTCCATAGAAAGCTCGCTTGCTGTGTGGGTGGTGGGCCAACCACAACTGTCATCTATTCCTTACACTTTCACCCAACACCATCACCATCTTTTCTCTCTCTCTTTCCCTTTCCTTTTCTATCTCTCTTTCTCTTACCATTGTCAAATCTTTCCCCTATTCTGGTTCTTCTTTGCTTTCTTTTTCCTTCTTTTTCCTCCATGTCTTTAATGACGGCCGATATGCGAATTCTATGCGGAAAGATAGAGAGTCGGTGATGGATGAGCATCTGGTCCTCTTTTTTACTGGTTGTCGAGAGAGATTACTACATTTTAAAAGTCTTCTAGTTGTTGCTGTTGGTTTTGTTCTTCGTGTGGTCGGTGGAATTGCTGAAAATTTTGGACATCGAGAGCGGGGGTGGGGTGTGGTGTGTTCGTTCTAAAGTGGAGTGTGATGTTTTGGACTTTCTGTGATGGATTTGGAGTAAATTTAGTTAAACTTCGATCGATTTCATTTCGATTTGGAGAGTTTAATCGCATTTCTGAAGTTTTCGTTAAGGTTTTTGAATCGGTTCGAGATAATTGTGAAAGTGTACTGAATCAGAAGATACTAGAATTTGAGTTCTAATGCTTCTGATCAACCTCTGATAAGAAGAAATTGAGGAAAATATATTATGAAATCTTCGAGTTCATGAATTTTCATTTTGGAGATAATGTTCCCTCAGTTATCTTTTCTTCCAAGAAAACTACGAATCTCTCGATATTCTAAGATTTTGCTGAATTTCTGGGCACGTTACAACGCTTCGATCCGATTCCTTGCCATTTCAAGAAGCCGTGACTAAATTTTGCTGTGTCCGCCCGATTGGTCATCTAGATTCCATGCCTGTTTTTCGTCACACTGGATTTCTCAGGGTCAGCTCTCGAGAGTTCATGCAACCATATTAGAGAAGAAAAATTTGGACGGAATCTTTGAAGAAAATGGGGAAGAATTCCAAAAGAATTATGGTTCGTTATGAGAAGGATCAGAGGGGCTGTATGTGGGGTTTGATTAGTTTATTTGATTTCCGCCATGGCCGCGCCTCCAGGAAGCTATTAGCAGACAAGAAGCGTCCTGGCAGGCAATCTGTTGCAGGTACTGGAAATTCAAGAAATAAGTTTGAGATTTTGGCTAATCTGGATGAAGATTGTAGTGTTACTTTGAACAGGGAAGAGTGTAAGACAGTAGACATTGGAAAGCCCAGTGTCAAGAAACTCATAGAAGAAGAGATGTTCAATGAGCTGGACGCGAAGAGAATCGAGCATGAACACTCAGGCCACTTAAAGATGAATGACCCGAAGAAAGTGAAGAAGAGTCGGAAGAAAAGCCGTGATATCGATGCTGATTCGTTTAATGCTGCGGAGTATTTGAAGGAACAAAGTGTTAATAATCTTCCTGTTGATGTGATGCTAAAGGAGATCTACAGTCAAATCCATCGGAAGAACACGAGTGACATGAAGTTTGACCCGAACGACAAAGCAGATATGCAATCAAATGAGTTCCTTGCTGATTTAGAACAAAAAATGGTTGATGCACTTAAGGAATATTTTGGCCAGAAGTTCGATATTGGGAAAGATTTCACTAAAATTCAGAAAGTTCAACACTCGAGAGAGATCATGGATGCACTTCAGATTCCCCATTCTGATGACAAGCTAGTTCACGAACTCGCACAAAACCCGAACTCTGTGTTACTCAAGTACATTAGAAATTTGCACGATTTGTCGATAGAAAAAGACATGGAACCCAAGTCACACGAGTTCAGTGAAGTTAGCCAGTCTGAGGAGATTGTTGATCATAAGCAACGTTTATTTTTCAAGAGAAAGGTCAAGAATCCGGGAAGGAACGTATCCAAAGGGAACGAAAATTCTGATGCGTCGAGTAAGATTGTAATCTTGAAGCCAGGGCCAAAAGGTTTGGTGAATTCTGAGGCTGATAGCATTCATCCATCAGTCCAAAATTCTACTGCTAACGATAAAACGAAAGTGCAGAACGAGAGGGCCGGTTCAAACTTTTTTCTTTCAGGAATCAAGAGGAAATTTAGATATGCTATGGGAAAAGATCACCATGAACACTCTGCTTATGGTTCAGATAGATTTTCTTGTGATCATCACAGTACAAAGGAGAGTGAAGAGGGTGTTCATAAGGAAGATAGTGCAAGAAATTCTACCAGCAAGGACCACTTTTTTATTGAAAGAATTTCCAGGCCTTCCACTGATAGCAAGAGAAGGGAGAAGGCTGGAAAACTCAAAAGCTTGGAAATTAATCAAGATTTGGGAAATGTTTATAACGGTAGGAGAAGCTCGTCTAATATTTATGTCGAGGCAAAGAAACATCTTTCCGAGATGCTTAGCAGTGGGGATGAAAGTGTAGACTTTTTAAGGGGACATGTGCCGAAGACCCTCGGACGGATTCTATCTCTTCCTGAGTATAGTTTTTCTCCCATTAGCAGTCCTAAAATGGACTGCAAGCTCAGTCCAGTAACTTCAGAGAAGAGAGTTTCTGTTGGTAGCCGACTTTTAAATGTTGATGAAATTACCCCGTTGCTCAAGGGCGAAAATGACGATACCCCAATCAGTGTGCAGCCTCCAACTGATGATAACCACGATATGAAGAGCGATATCGTTGATCAAAGTATAAGGGAAGAAGCTGTGAGTTCTAGTACTAATGGGAAGATTTCTGAAGGTGATATTGAGATTCTGAAGGTAAATGAAATTGCAGTCCATGAGGAGAGTAGTGTTTTGGAAGCTCCCTCTGGCTCAAGTGAATATTCTCTTTTGAGAGAAGATCAGAATGGTGAAATGGCCGACGATGCATGCGATGAGAGAACTGTTTCTTATGTTCCCTCCGACCCGATTGCTTCTTCCCCTATCAGAGAAGGCCACGATAATGACACACCAGACATGGGCGACGACAAACCTTCAATGAGCTTGTCCCAAGATTCGTCTGAAGAAAATCAACTGTCTCCATCTCGATCGGCATCGTCTTCTAGCTCACCTACACAAGGGAAGGCTGTTGGAGATTTAGATGGTGTTTCCGATGTTCCAGAGCGGCCGAGCCCTGTTTCAGTTCTTGAGCCATTATTTGTAGATGATAACCTGAGCCCTCTGCATGCCATGGCTTTGCCTGCAGGGTTGCCAGTCCAACCAGTGCATATTGAATTTGAAGAACGCGAACCTGCAGAAACCGATAAGGCTAACGTCACGAAATCGTTAAAGGAAGATAAGGAAGTGATTTTTGACTATGTAAAGGCAGTGCTGTTGGCATCTGGACTTAGCTGGAACCAAATCTGTGGAAAGTGGCTCTCTTCAGAACAGCTTCTTTACTTGTTACTACTCGACGAGATAGAACTTTTCCCAAACCAGCTGTGCTCCGACCAGCAGCTCCTCTTTGACTGTATTAATGAAGTACTTGCTGACTTTTGTCAAAGCTATCCGCCATGGTTCTCGTTCATACAACCTTGTCTACGGTCCGAATATCTGCTCGAGATTTGTGAAGGAGTATATTGGCATCTTCTTCCAATGCCACAGCCCCTTACATTGGACCACCTTGTCAGGAAGGACATGAGCAGAACCAGAACATGGATGAACCTTCATTCTGATGCTGAAACCGTCGGTACCGAGACGTGTCAAGCCATATTCGAAGATTTATTGGACGATACCGTATTCAGCTGCGTGTATGATAGTTCAGAATTCGACGTCGGATTTGGCATGGAGACTGAAAACATGAGTGATGACTTTTAA Coding sequence (CDS) ATGGGGAAGAATTCCAAAAGAATTATGGTTCGTTATGAGAAGGATCAGAGGGGCTGTATGTGGGGTTTGATTAGTTTATTTGATTTCCGCCATGGCCGCGCCTCCAGGAAGCTATTAGCAGACAAGAAGCGTCCTGGCAGGCAATCTGTTGCAGGTACTGGAAATTCAAGAAATAAGTTTGAGATTTTGGCTAATCTGGATGAAGATTGTAGTGTTACTTTGAACAGGGAAGAGTGTAAGACAGTAGACATTGGAAAGCCCAGTGTCAAGAAACTCATAGAAGAAGAGATGTTCAATGAGCTGGACGCGAAGAGAATCGAGCATGAACACTCAGGCCACTTAAAGATGAATGACCCGAAGAAAGTGAAGAAGAGTCGGAAGAAAAGCCGTGATATCGATGCTGATTCGTTTAATGCTGCGGAGTATTTGAAGGAACAAAGTGTTAATAATCTTCCTGTTGATGTGATGCTAAAGGAGATCTACAGTCAAATCCATCGGAAGAACACGAGTGACATGAAGTTTGACCCGAACGACAAAGCAGATATGCAATCAAATGAGTTCCTTGCTGATTTAGAACAAAAAATGGTTGATGCACTTAAGGAATATTTTGGCCAGAAGTTCGATATTGGGAAAGATTTCACTAAAATTCAGAAAGTTCAACACTCGAGAGAGATCATGGATGCACTTCAGATTCCCCATTCTGATGACAAGCTAGTTCACGAACTCGCACAAAACCCGAACTCTGTGTTACTCAAGTACATTAGAAATTTGCACGATTTGTCGATAGAAAAAGACATGGAACCCAAGTCACACGAGTTCAGTGAAGTTAGCCAGTCTGAGGAGATTGTTGATCATAAGCAACGTTTATTTTTCAAGAGAAAGGTCAAGAATCCGGGAAGGAACGTATCCAAAGGGAACGAAAATTCTGATGCGTCGAGTAAGATTGTAATCTTGAAGCCAGGGCCAAAAGGTTTGGTGAATTCTGAGGCTGATAGCATTCATCCATCAGTCCAAAATTCTACTGCTAACGATAAAACGAAAGTGCAGAACGAGAGGGCCGGTTCAAACTTTTTTCTTTCAGGAATCAAGAGGAAATTTAGATATGCTATGGGAAAAGATCACCATGAACACTCTGCTTATGGTTCAGATAGATTTTCTTGTGATCATCACAGTACAAAGGAGAGTGAAGAGGGTGTTCATAAGGAAGATAGTGCAAGAAATTCTACCAGCAAGGACCACTTTTTTATTGAAAGAATTTCCAGGCCTTCCACTGATAGCAAGAGAAGGGAGAAGGCTGGAAAACTCAAAAGCTTGGAAATTAATCAAGATTTGGGAAATGTTTATAACGGTAGGAGAAGCTCGTCTAATATTTATGTCGAGGCAAAGAAACATCTTTCCGAGATGCTTAGCAGTGGGGATGAAAGTGTAGACTTTTTAAGGGGACATGTGCCGAAGACCCTCGGACGGATTCTATCTCTTCCTGAGTATAGTTTTTCTCCCATTAGCAGTCCTAAAATGGACTGCAAGCTCAGTCCAGTAACTTCAGAGAAGAGAGTTTCTGTTGGTAGCCGACTTTTAAATGTTGATGAAATTACCCCGTTGCTCAAGGGCGAAAATGACGATACCCCAATCAGTGTGCAGCCTCCAACTGATGATAACCACGATATGAAGAGCGATATCGTTGATCAAAGTATAAGGGAAGAAGCTGTGAGTTCTAGTACTAATGGGAAGATTTCTGAAGGTGATATTGAGATTCTGAAGGTAAATGAAATTGCAGTCCATGAGGAGAGTAGTGTTTTGGAAGCTCCCTCTGGCTCAAGTGAATATTCTCTTTTGAGAGAAGATCAGAATGGTGAAATGGCCGACGATGCATGCGATGAGAGAACTGTTTCTTATGTTCCCTCCGACCCGATTGCTTCTTCCCCTATCAGAGAAGGCCACGATAATGACACACCAGACATGGGCGACGACAAACCTTCAATGAGCTTGTCCCAAGATTCGTCTGAAGAAAATCAACTGTCTCCATCTCGATCGGCATCGTCTTCTAGCTCACCTACACAAGGGAAGGCTGTTGGAGATTTAGATGGTGTTTCCGATGTTCCAGAGCGGCCGAGCCCTGTTTCAGTTCTTGAGCCATTATTTGTAGATGATAACCTGAGCCCTCTGCATGCCATGGCTTTGCCTGCAGGGTTGCCAGTCCAACCAGTGCATATTGAATTTGAAGAACGCGAACCTGCAGAAACCGATAAGGCTAACGTCACGAAATCGTTAAAGGAAGATAAGGAAGTGATTTTTGACTATGTAAAGGCAGTGCTGTTGGCATCTGGACTTAGCTGGAACCAAATCTGTGGAAAGTGGCTCTCTTCAGAACAGCTTCTTTACTTGTTACTACTCGACGAGATAGAACTTTTCCCAAACCAGCTGTGCTCCGACCAGCAGCTCCTCTTTGACTGTATTAATGAAGTACTTGCTGACTTTTGTCAAAGCTATCCGCCATGGTTCTCGTTCATACAACCTTGTCTACGGTCCGAATATCTGCTCGAGATTTGTGAAGGAGTATATTGGCATCTTCTTCCAATGCCACAGCCCCTTACATTGGACCACCTTGTCAGGAAGGACATGAGCAGAACCAGAACATGGATGAACCTTCATTCTGATGCTGAAACCGTCGGTACCGAGACGTGTCAAGCCATATTCGAAGATTTATTGGACGATACCGTATTCAGCTGCGTGTATGATAGTTCAGAATTCGACGTCGGATTTGGCATGGAGACTGAAAACATGAGTGATGACTTTTAA Protein sequence MGKNSKRIMVRYEKDQRGCMWGLISLFDFRHGRASRKLLADKKRPGRQSVAGTGNSRNKFEILANLDEDCSVTLNREECKTVDIGKPSVKKLIEEEMFNELDAKRIEHEHSGHLKMNDPKKVKKSRKKSRDIDADSFNAAEYLKEQSVNNLPVDVMLKEIYSQIHRKNTSDMKFDPNDKADMQSNEFLADLEQKMVDALKEYFGQKFDIGKDFTKIQKVQHSREIMDALQIPHSDDKLVHELAQNPNSVLLKYIRNLHDLSIEKDMEPKSHEFSEVSQSEEIVDHKQRLFFKRKVKNPGRNVSKGNENSDASSKIVILKPGPKGLVNSEADSIHPSVQNSTANDKTKVQNERAGSNFFLSGIKRKFRYAMGKDHHEHSAYGSDRFSCDHHSTKESEEGVHKEDSARNSTSKDHFFIERISRPSTDSKRREKAGKLKSLEINQDLGNVYNGRRSSSNIYVEAKKHLSEMLSSGDESVDFLRGHVPKTLGRILSLPEYSFSPISSPKMDCKLSPVTSEKRVSVGSRLLNVDEITPLLKGENDDTPISVQPPTDDNHDMKSDIVDQSIREEAVSSSTNGKISEGDIEILKVNEIAVHEESSVLEAPSGSSEYSLLREDQNGEMADDACDERTVSYVPSDPIASSPIREGHDNDTPDMGDDKPSMSLSQDSSEENQLSPSRSASSSSSPTQGKAVGDLDGVSDVPERPSPVSVLEPLFVDDNLSPLHAMALPAGLPVQPVHIEFEEREPAETDKANVTKSLKEDKEVIFDYVKAVLLASGLSWNQICGKWLSSEQLLYLLLLDEIELFPNQLCSDQQLLFDCINEVLADFCQSYPPWFSFIQPCLRSEYLLEICEGVYWHLLPMPQPLTLDHLVRKDMSRTRTWMNLHSDAETVGTETCQAIFEDLLDDTVFSCVYDSSEFDVGFGMETENMSDDF
Homology
BLAST of CmaCh06G010670 vs. ExPASy Swiss-Prot
Match: F4HSD5 (Protein TRM32 OS=Arabidopsis thaliana OX=3702 GN=TRM32 PE=2 SV=1) HSP 1 Score: 72.8 bits (177), Expect = 2.3e-11 Identity = 54/165 (32.73%), Postives = 87/165 (52.73%), Query Frame = 0 Query: 753 VTKSLKEDKEVIFDYVKAVLLASGLSWNQIC-GKWLSSEQLLYLLLLDEIELFPNQLCSD 812 + K +ED++ F YVK VL SG N+ KW S EQ L LL E+++ + +D Sbjct: 391 IYKHEEEDEDAYFCYVKKVLKVSGFLENKDNEEKWYSEEQPLNPSLLYELDIQEEETVND 450
Query: 813 QQLLFDCINEVLADFCQSYPPWFSFIQPCLRSEYLLEICEGVYWHLLPM---PQPLTLDH 872 ++LLFD +NE + + +F P YL E+ V W L + + +LD Sbjct: 451 KELLFDLVNEAIVETQNHSQIYFPKTFP-YGKRYLDEVWGRVEWSLSGLGAENRDRSLDD 510
Query: 873 LVRKD-MSRTRTWMNLHSDAETVGTETCQAIFEDLLDDTVFSCVY 913 +V +D ++++ WMNL ++E + E IF+D+LD+ + CVY Sbjct: 511 IVGRDLLTKSDGWMNLQGESEWLTLELEDLIFDDVLDELL--CVY 552
BLAST of CmaCh06G010670 vs. TAIR 10
Match: AT4G00440.1 (Protein of unknown function (DUF3741) ) HSP 1 Score: 329.3 bits (843), Expect = 9.8e-90 Identity = 296/947 (31.26%), Postives = 447/947 (47.20%), Query Frame = 0 Query: 3 KNSKRIMVRYEKDQRGCMWGLISLFDFRHGRASRKLLADKKRPGRQSVAGTGNSR-NKFE 62 K +R R E++Q GCMWG +++F FRHG + KLL D+K +GN + N+ Sbjct: 4 KTQRRAAARQEREQLGCMWGFMNMFSFRHGPLAHKLLLDQKH-------ASGNPKNNELN 63
Query: 63 ILANLDEDCSVT-LNREECKTVDIGKPSVKKLIEEEMFNELDAKR-IEHEHSGHL----- 122 ++D T + E T+ I KPSVKKLI EE+ + + K+ E+ +G L Sbjct: 64 KSKFREKDAQETHVGEERNITITIIKPSVKKLIAEELSIDKEIKKQRENAEAGQLSDSEL 123
Query: 123 ---KMNDPKKVKKSRKKSRD-------IDADSFNAAEYLKEQSVNNLPVDVMLKEIYSQI 182 + + ++ K+RKKS D +D++ + +S ++ +D M++E YS+I Sbjct: 124 EGRRRKNQRRKNKTRKKSCDNFSHMNLVDSEE-PLVQRRNRRSARSVDIDNMIEEFYSEI 183
Query: 183 HRKNTSDMKFDPNDKADMQSNEFLADLEQKMVDALKEYFGQKFDIGKDFTKIQKVQHSRE 242 HR++TS K D D ++K+ + +K QK G ++ S++ Sbjct: 184 HRRSTSHAKND-------------EDYKEKLRELVKFLISQKLLHGNRPRGNSEILTSKD 243
Query: 243 IMDALQIPHSDDKLVHELAQNPNSVLLKYIRNLHDLSIEKDMEPKSHEFSEVSQSEEIVD 302 +M+ QI SD++L +L Q+P +L+ + LS+ + S+ + D Sbjct: 244 LMEVFQILGSDEELFLKLLQDP-EILVPREKGAESLSLVE------------SEQSSLAD 303
Query: 303 HKQRLFFKRKVKNPGRNVSKGNENSDASSKIVILKPGPKGLVNSEADSIHPSVQNSTAND 362 K FF+RK + E +AS +I ILKP A P + NS + Sbjct: 304 KKWSSFFRRK--------DEPQEECEASDRIFILKP-------RSASFSSPDIGNSRGSS 363
Query: 363 K------TKVQNERAGSNFFLSGIKRKFRYAMGKDHHEHSAYGSDRFSCDHHSTKESEEG 422 K+QNER S++FLS IKRK ++A+ K E A G EG Sbjct: 364 PDSHLMGNKLQNERNSSHYFLSEIKRKLKHAIKK---EQPAGGFG-------------EG 423
Query: 423 VHKEDSARNSTSKDHFFIERISRPSTDSKRREKAGKLKSLEINQDLGNVYNGRRSSSNIY 482 K +KDHFF+ER+++PST K+ K + SNIY Sbjct: 424 FPK--------TKDHFFLERMAKPSTSQKKSHSEDDKK---------------QRVSNIY 483
Query: 483 VEAKKHLSEMLSSGDESVDFLRGHVPKTLGRILSLPEYSFSPISSPKMDCKLSPVTSEKR 542 EAKKHLSEML++GD V ++LGRILS PEY SP++SP + S +K Sbjct: 484 TEAKKHLSEMLNNGDLDSKSTSRQVQRSLGRILSFPEY-LSPLNSPGRRWEKSSTAHKKS 543
Query: 543 VSVG-SRLLNVDEITPLLK-GENDDTPI---------SVQPPTDDNHDMKSDIVDQSIRE 602 S L+N+ + T + EN D + S+QP + ++ DI D++ E Sbjct: 544 ASADFINLVNIKKETHASQPEENTDIQVCNLSKEPDDSIQPIASEPNEKSVDIEDETANE 603
Query: 603 EAVSSSTNGKISEGDIEILKVNEIAVHEESSVLEAPSGSSEYSLLREDQNGEMADDACDE 662 + + S +GS ADD Sbjct: 604 DNMFS-------------------------------AGS--------------ADDVMIP 663
Query: 663 RTVSYVPSDPIASSPIREGHDNDTPDMGDDKPSMSLSQDSSEENQLSPSRSASSSSSPTQ 722 + VP + ASS + G + + + S++ Q S EE+Q P S+ +S S Sbjct: 664 NELDEVPEE--ASSTL-IGDLSKVEAQDEQRDSINSKQTSLEESQ-PPLSSSVASPSHCL 723
Query: 723 GKAVGDLDGVSDVPERPSPVSVLEPLFVDDNLSPLHAMALPAGLPVQPVHIEFEEREPAE 782 + ++D PE SP+SVLEPLFV+D++SP + VQP I F+E++PA Sbjct: 724 AQTEECKSAITDFPEWSSPISVLEPLFVEDDISPAKMRSQSGEAEVQPWCIHFDEKDPAP 783
Query: 783 TDKANVTKSLKEDKEVIFDYVKAVLLASGLSWNQICGKWLSSEQLLYLLLLDEIELFPNQ 842 T + N S+ DKE++F YVKAVL A ++ K S+QLL L+ I PNQ Sbjct: 784 TYREN---SVTSDKELVFKYVKAVLDAVDSDIEELYLKAQFSDQLLEPALISNIPFCPNQ 808
Query: 843 LCSDQQLLFDCINEVLADFCQSYPPWFSFIQPCLR-----SEYLLEICEGVYWHLLPMPQ 902 LC D +LLFDCINE L + C PPW SF+ P R + E+ E VYWHLLP+P Sbjct: 844 LCPDHELLFDCINEALMELC-CCPPWASFVTPRTRVFSTVKSIIHEVQEAVYWHLLPLPL 808
Query: 903 PLTLDHLVRKDMSRTRTWMNLHSDAETVGTETCQAIFEDLLDDTVFS 910 P LD +VRKDM+R W+++ D + +G ET + I +LL++ + + Sbjct: 904 PHALDQIVRKDMARAGNWLDIRCDIDCIGFETSELILNELLEELILN 808
BLAST of CmaCh06G010670 vs. TAIR 10
Match: AT4G00440.2 (Protein of unknown function (DUF3741) ) HSP 1 Score: 329.3 bits (843), Expect = 9.8e-90 Identity = 296/947 (31.26%), Postives = 447/947 (47.20%), Query Frame = 0 Query: 3 KNSKRIMVRYEKDQRGCMWGLISLFDFRHGRASRKLLADKKRPGRQSVAGTGNSR-NKFE 62 K +R R E++Q GCMWG +++F FRHG + KLL D+K +GN + N+ Sbjct: 4 KTQRRAAARQEREQLGCMWGFMNMFSFRHGPLAHKLLLDQKH-------ASGNPKNNELN 63
Query: 63 ILANLDEDCSVT-LNREECKTVDIGKPSVKKLIEEEMFNELDAKR-IEHEHSGHL----- 122 ++D T + E T+ I KPSVKKLI EE+ + + K+ E+ +G L Sbjct: 64 KSKFREKDAQETHVGEERNITITIIKPSVKKLIAEELSIDKEIKKQRENAEAGQLSDSEL 123
Query: 123 ---KMNDPKKVKKSRKKSRD-------IDADSFNAAEYLKEQSVNNLPVDVMLKEIYSQI 182 + + ++ K+RKKS D +D++ + +S ++ +D M++E YS+I Sbjct: 124 EGRRRKNQRRKNKTRKKSCDNFSHMNLVDSEE-PLVQRRNRRSARSVDIDNMIEEFYSEI 183
Query: 183 HRKNTSDMKFDPNDKADMQSNEFLADLEQKMVDALKEYFGQKFDIGKDFTKIQKVQHSRE 242 HR++TS K D D ++K+ + +K QK G ++ S++ Sbjct: 184 HRRSTSHAKND-------------EDYKEKLRELVKFLISQKLLHGNRPRGNSEILTSKD 243
Query: 243 IMDALQIPHSDDKLVHELAQNPNSVLLKYIRNLHDLSIEKDMEPKSHEFSEVSQSEEIVD 302 +M+ QI SD++L +L Q+P +L+ + LS+ + S+ + D Sbjct: 244 LMEVFQILGSDEELFLKLLQDP-EILVPREKGAESLSLVE------------SEQSSLAD 303
Query: 303 HKQRLFFKRKVKNPGRNVSKGNENSDASSKIVILKPGPKGLVNSEADSIHPSVQNSTAND 362 K FF+RK + E +AS +I ILKP A P + NS + Sbjct: 304 KKWSSFFRRK--------DEPQEECEASDRIFILKP-------RSASFSSPDIGNSRGSS 363
Query: 363 K------TKVQNERAGSNFFLSGIKRKFRYAMGKDHHEHSAYGSDRFSCDHHSTKESEEG 422 K+QNER S++FLS IKRK ++A+ K E A G EG Sbjct: 364 PDSHLMGNKLQNERNSSHYFLSEIKRKLKHAIKK---EQPAGGFG-------------EG 423
Query: 423 VHKEDSARNSTSKDHFFIERISRPSTDSKRREKAGKLKSLEINQDLGNVYNGRRSSSNIY 482 K +KDHFF+ER+++PST K+ K + SNIY Sbjct: 424 FPK--------TKDHFFLERMAKPSTSQKKSHSEDDKK---------------QRVSNIY 483
Query: 483 VEAKKHLSEMLSSGDESVDFLRGHVPKTLGRILSLPEYSFSPISSPKMDCKLSPVTSEKR 542 EAKKHLSEML++GD V ++LGRILS PEY SP++SP + S +K Sbjct: 484 TEAKKHLSEMLNNGDLDSKSTSRQVQRSLGRILSFPEY-LSPLNSPGRRWEKSSTAHKKS 543
Query: 543 VSVG-SRLLNVDEITPLLK-GENDDTPI---------SVQPPTDDNHDMKSDIVDQSIRE 602 S L+N+ + T + EN D + S+QP + ++ DI D++ E Sbjct: 544 ASADFINLVNIKKETHASQPEENTDIQVCNLSKEPDDSIQPIASEPNEKSVDIEDETANE 603
Query: 603 EAVSSSTNGKISEGDIEILKVNEIAVHEESSVLEAPSGSSEYSLLREDQNGEMADDACDE 662 + + S +GS ADD Sbjct: 604 DNMFS-------------------------------AGS--------------ADDVMIP 663
Query: 663 RTVSYVPSDPIASSPIREGHDNDTPDMGDDKPSMSLSQDSSEENQLSPSRSASSSSSPTQ 722 + VP + ASS + G + + + S++ Q S EE+Q P S+ +S S Sbjct: 664 NELDEVPEE--ASSTL-IGDLSKVEAQDEQRDSINSKQTSLEESQ-PPLSSSVASPSHCL 723
Query: 723 GKAVGDLDGVSDVPERPSPVSVLEPLFVDDNLSPLHAMALPAGLPVQPVHIEFEEREPAE 782 + ++D PE SP+SVLEPLFV+D++SP + VQP I F+E++PA Sbjct: 724 AQTEECKSAITDFPEWSSPISVLEPLFVEDDISPAKMRSQSGEAEVQPWCIHFDEKDPAP 783
Query: 783 TDKANVTKSLKEDKEVIFDYVKAVLLASGLSWNQICGKWLSSEQLLYLLLLDEIELFPNQ 842 T + N S+ DKE++F YVKAVL A ++ K S+QLL L+ I PNQ Sbjct: 784 TYREN---SVTSDKELVFKYVKAVLDAVDSDIEELYLKAQFSDQLLEPALISNIPFCPNQ 808
Query: 843 LCSDQQLLFDCINEVLADFCQSYPPWFSFIQPCLR-----SEYLLEICEGVYWHLLPMPQ 902 LC D +LLFDCINE L + C PPW SF+ P R + E+ E VYWHLLP+P Sbjct: 844 LCPDHELLFDCINEALMELC-CCPPWASFVTPRTRVFSTVKSIIHEVQEAVYWHLLPLPL 808
Query: 903 PLTLDHLVRKDMSRTRTWMNLHSDAETVGTETCQAIFEDLLDDTVFS 910 P LD +VRKDM+R W+++ D + +G ET + I +LL++ + + Sbjct: 904 PHALDQIVRKDMARAGNWLDIRCDIDCIGFETSELILNELLEELILN 808
BLAST of CmaCh06G010670 vs. TAIR 10
Match: AT4G00440.3 (Protein of unknown function (DUF3741) ) HSP 1 Score: 329.3 bits (843), Expect = 9.8e-90 Identity = 296/947 (31.26%), Postives = 447/947 (47.20%), Query Frame = 0 Query: 3 KNSKRIMVRYEKDQRGCMWGLISLFDFRHGRASRKLLADKKRPGRQSVAGTGNSR-NKFE 62 K +R R E++Q GCMWG +++F FRHG + KLL D+K +GN + N+ Sbjct: 4 KTQRRAAARQEREQLGCMWGFMNMFSFRHGPLAHKLLLDQKH-------ASGNPKNNELN 63
Query: 63 ILANLDEDCSVT-LNREECKTVDIGKPSVKKLIEEEMFNELDAKR-IEHEHSGHL----- 122 ++D T + E T+ I KPSVKKLI EE+ + + K+ E+ +G L Sbjct: 64 KSKFREKDAQETHVGEERNITITIIKPSVKKLIAEELSIDKEIKKQRENAEAGQLSDSEL 123
Query: 123 ---KMNDPKKVKKSRKKSRD-------IDADSFNAAEYLKEQSVNNLPVDVMLKEIYSQI 182 + + ++ K+RKKS D +D++ + +S ++ +D M++E YS+I Sbjct: 124 EGRRRKNQRRKNKTRKKSCDNFSHMNLVDSEE-PLVQRRNRRSARSVDIDNMIEEFYSEI 183
Query: 183 HRKNTSDMKFDPNDKADMQSNEFLADLEQKMVDALKEYFGQKFDIGKDFTKIQKVQHSRE 242 HR++TS K D D ++K+ + +K QK G ++ S++ Sbjct: 184 HRRSTSHAKND-------------EDYKEKLRELVKFLISQKLLHGNRPRGNSEILTSKD 243
Query: 243 IMDALQIPHSDDKLVHELAQNPNSVLLKYIRNLHDLSIEKDMEPKSHEFSEVSQSEEIVD 302 +M+ QI SD++L +L Q+P +L+ + LS+ + S+ + D Sbjct: 244 LMEVFQILGSDEELFLKLLQDP-EILVPREKGAESLSLVE------------SEQSSLAD 303
Query: 303 HKQRLFFKRKVKNPGRNVSKGNENSDASSKIVILKPGPKGLVNSEADSIHPSVQNSTAND 362 K FF+RK + E +AS +I ILKP A P + NS + Sbjct: 304 KKWSSFFRRK--------DEPQEECEASDRIFILKP-------RSASFSSPDIGNSRGSS 363
Query: 363 K------TKVQNERAGSNFFLSGIKRKFRYAMGKDHHEHSAYGSDRFSCDHHSTKESEEG 422 K+QNER S++FLS IKRK ++A+ K E A G EG Sbjct: 364 PDSHLMGNKLQNERNSSHYFLSEIKRKLKHAIKK---EQPAGGFG-------------EG 423
Query: 423 VHKEDSARNSTSKDHFFIERISRPSTDSKRREKAGKLKSLEINQDLGNVYNGRRSSSNIY 482 K +KDHFF+ER+++PST K+ K + SNIY Sbjct: 424 FPK--------TKDHFFLERMAKPSTSQKKSHSEDDKK---------------QRVSNIY 483
Query: 483 VEAKKHLSEMLSSGDESVDFLRGHVPKTLGRILSLPEYSFSPISSPKMDCKLSPVTSEKR 542 EAKKHLSEML++GD V ++LGRILS PEY SP++SP + S +K Sbjct: 484 TEAKKHLSEMLNNGDLDSKSTSRQVQRSLGRILSFPEY-LSPLNSPGRRWEKSSTAHKKS 543
Query: 543 VSVG-SRLLNVDEITPLLK-GENDDTPI---------SVQPPTDDNHDMKSDIVDQSIRE 602 S L+N+ + T + EN D + S+QP + ++ DI D++ E Sbjct: 544 ASADFINLVNIKKETHASQPEENTDIQVCNLSKEPDDSIQPIASEPNEKSVDIEDETANE 603
Query: 603 EAVSSSTNGKISEGDIEILKVNEIAVHEESSVLEAPSGSSEYSLLREDQNGEMADDACDE 662 + + S +GS ADD Sbjct: 604 DNMFS-------------------------------AGS--------------ADDVMIP 663
Query: 663 RTVSYVPSDPIASSPIREGHDNDTPDMGDDKPSMSLSQDSSEENQLSPSRSASSSSSPTQ 722 + VP + ASS + G + + + S++ Q S EE+Q P S+ +S S Sbjct: 664 NELDEVPEE--ASSTL-IGDLSKVEAQDEQRDSINSKQTSLEESQ-PPLSSSVASPSHCL 723
Query: 723 GKAVGDLDGVSDVPERPSPVSVLEPLFVDDNLSPLHAMALPAGLPVQPVHIEFEEREPAE 782 + ++D PE SP+SVLEPLFV+D++SP + VQP I F+E++PA Sbjct: 724 AQTEECKSAITDFPEWSSPISVLEPLFVEDDISPAKMRSQSGEAEVQPWCIHFDEKDPAP 783
Query: 783 TDKANVTKSLKEDKEVIFDYVKAVLLASGLSWNQICGKWLSSEQLLYLLLLDEIELFPNQ 842 T + N S+ DKE++F YVKAVL A ++ K S+QLL L+ I PNQ Sbjct: 784 TYREN---SVTSDKELVFKYVKAVLDAVDSDIEELYLKAQFSDQLLEPALISNIPFCPNQ 808
Query: 843 LCSDQQLLFDCINEVLADFCQSYPPWFSFIQPCLR-----SEYLLEICEGVYWHLLPMPQ 902 LC D +LLFDCINE L + C PPW SF+ P R + E+ E VYWHLLP+P Sbjct: 844 LCPDHELLFDCINEALMELC-CCPPWASFVTPRTRVFSTVKSIIHEVQEAVYWHLLPLPL 808
Query: 903 PLTLDHLVRKDMSRTRTWMNLHSDAETVGTETCQAIFEDLLDDTVFS 910 P LD +VRKDM+R W+++ D + +G ET + I +LL++ + + Sbjct: 904 PHALDQIVRKDMARAGNWLDIRCDIDCIGFETSELILNELLEELILN 808
BLAST of CmaCh06G010670 vs. TAIR 10
Match: AT2G45900.1 (Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related ) HSP 1 Score: 223.4 bits (568), Expect = 7.6e-58 Identity = 248/902 (27.49%), Postives = 400/902 (44.35%), Query Frame = 0 Query: 18 GCMWGLISLFDFRHGRASRKLLADKKRPGRQSVAG-TGNSRNKFEILANLDEDCSVTLNR 77 GC+W +S+FDFRHG +++KLL DKKR ++ + G G K E D DC Sbjct: 16 GCVWVFMSMFDFRHGGSNQKLLMDKKRRSKRIIIGNNGVFETKVEKQLTCDCDCDEESEA 75
Query: 78 EECKTVDIGKPSVKKLIEEEMFNELDAKRIEHEHSGHLKMNDPKKVKKSRKKSRDIDADS 137 E SVKKLIE E+ D K +K ++R Sbjct: 76 E--------MQSVKKLIEAEI--------------------DEKTTQKCEARNRK----- 135
Query: 138 FNAAEYLKEQSVNNLPVDVMLKEIYSQIHRKNTSDMKFDPNDKADMQSNEFLADLEQKMV 197 + ++ + + D+ N D +D A+ NE Sbjct: 136 ------RRSRTCSKISEDI------------NVLIAGDDDDDHAEKSDNE---------- 195
Query: 198 DALKEYFGQKFDIGKDFTKIQKVQHSREIMDALQIPHSDDKLVHELAQNPNSVLLKYIRN 257 + H++ +V++ ++ S L+K Sbjct: 196 -------------------------------CPIVSHNEVDMVNDDSEEKFSELIK---- 255
Query: 258 LHDLSIEKDMEPKSHEFSEVSQSEEIVDHKQRLFFKRKVKNPGRNVSKGNENSDASSKIV 317 L +K+ E +S + + +VD Q L K + N G S ++ + IV Sbjct: 256 --RLIAQKESEVESCK-------KNLVDAFQVLDSKEESLNIGTPTSGDSQRIKETQTIV 315
Query: 318 ILKPGPKGL-VNSEADSIHPSVQNSTANDKTKVQNERAGSNFFLSGIKRKFRYAMGKDHH 377 ILKP P L V S + PS N K +NE+ S F LS I+R+ ++A+GK Sbjct: 316 ILKPEPNTLDVGSSPGT--PSTDN-------KAKNEKFSSRFSLSRIRRRLKFAVGK--- 375
Query: 378 EHSAYGSDRFSCDHHSTKESEEGVHKEDSARNSTSKDHFFIERI-SRPSTDSK-RREKAG 437 + + H S + + D+ +S S++ E I + P +D + + A Sbjct: 376 -------NPCNAQHDSDPDPD-----ADALSSSMSQNCCLGEEIETNPGSDGEILPDIAS 435
Query: 438 KLKSLEINQDLGNVYNGRRSSSNIYVEAKKHLSEMLSSGDESVDFLRGHVPKTLGRILSL 497 K ++ + + + + ++S IY+ AKKHLSEML+ GD D VP+ LG+IL+L Sbjct: 436 KGEANKEDTFHESEKDSKKSMCGIYIAAKKHLSEMLAEGDIDADLPDKEVPRILGKILAL 495
Query: 498 PEYSFSPISSPKMDCKLSPVTSEKRVSVGSRLLNVDEITPLLKGENDDTPISVQPPTDDN 557 PE+ F+P +SP++ L EK N+ + + Sbjct: 496 PEF-FTPENSPRVTLALDHQIIEKP--------NIQQCS--------------------- 555
Query: 558 HDMKSDIVDQSIREEAVSSSTNGKISEGDIEILKVNEIAVHEESSVLEAPSGSSEYSLLR 617 D + +R + S N HEE+ + P Sbjct: 556 ---SKDYYYEPLRLD----SNN------------------HEETEFMPVP---------- 615
Query: 618 EDQNGEMADDACDERTVSYVPSDPIASSPIREGHDNDTPDMGDDKPSMSLSQDSSEENQL 677 ED E +E+TV S+ I+SS I++ D +D+ L ++ +E Q Sbjct: 616 EDTRMEE-----EEQTVMDSLSEAISSSIIQQDAYID-----EDEHKQLLEKEVLKEGQ- 675
Query: 678 SPSRSASSSSSPTQGKAVGDLDGVSDVPERPSPVSVLEPLFVDDNLSPLHAMALPAGLPV 737 SP +SS ++ + + +DV + SPVSVLEP F DD+ SP + A + + Sbjct: 676 SPCSPPNSSVRMSECQ-----ENTTDVLGKSSPVSVLEPFFTDDDTSPNSSRFSSAEMRM 706
Query: 738 QPVHIEFEEREPAETDKANVTKSLKEDKEVIFDYVKAVLLASGLSWNQICGKWLSSEQLL 797 QP+ I F+E + +K N K+ +DKE+ +Y++AV+ +S L+W ++ + SE++L Sbjct: 736 QPLCIRFDEPDFPGPEKDNDVKTRMDDKELALEYIQAVVKSSELNWEELLARSFYSEKIL 706
Query: 798 YLLLLDEIELFPNQLCSDQQLLFDCINEVLADFCQSYPPWFSFIQPCLR----SEYLLEI 857 L+D+I+ LCSD++LLFDCINEVL +FC + PW SF++P + E +E+ Sbjct: 796 EQALMDDIDFCSTNLCSDKKLLFDCINEVLMEFC-GHGPWISFVKPAMHFFPDMENAVEV 706
Query: 858 C-EGVYWHLLPMPQPLTLDHLVRKDMSRTRTWMNLHSDAETVGTETCQAIFEDLLDDTVF 911 E VYWHLLP+P P TLD +VRKD++RT WM+L D + +ET + I ++LL++ + Sbjct: 856 VQEEVYWHLLPLPSPHTLDQIVRKDLARTGNWMDLRFDIGCIVSETGEIILDELLEEIIS 706
BLAST of CmaCh06G010670 vs. TAIR 10
Match: AT3G61380.1 (Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related ) HSP 1 Score: 215.3 bits (547), Expect = 2.1e-55 Identity = 206/732 (28.14%), Postives = 344/732 (46.99%), Query Frame = 0 Query: 192 EQKMVDALKEYFGQKFDIGKDFTKIQKVQHSREIMDALQIPHSDDKLVHELAQNPNSVLL 251 E+K + +K QK ++Q +++++A + S+++ S L Sbjct: 92 EEKFKEIIKRLIAQKEG---------EIQTCKDLLEAFHVLGSEEE---------ESFLK 151
Query: 252 KYIRNLHDLSIEKDMEPKSHEFSEVSQSEEIVDHKQRL-FFKRKVKNPGRNVSKGNENSD 311 K + N ++ + + VS+ E +V K++ FF RK K+ R S Sbjct: 152 KKLSNEDAQTLGDSKRVEEEKPEVVSKQEAVVIPKRKANFFSRKWKSEER-----RNRSQ 211
Query: 312 ASSKIVILKPGPKGLVNSEADSIHPSVQNSTANDKTKVQNERAGSNFFLSGIKRKFRYAM 371 + IV+LKPGP L + +H STAN K + R S F + +KR+ + A+ Sbjct: 212 VAKTIVVLKPGPNTLDVDSSTGLH-----STAN---KSKTGRTFSRFLIGLVKRRLQSAV 271
Query: 372 GKDHHEHSAYGSDRFSCDHHSTKESEEGVHKEDSARNSTSKDHFFIERISRPSTDSKRRE 431 GK SCD K S+ +E+ S S++ + P D + E Sbjct: 272 GKK------------SCDVSVDKRSQNCSTQEEI--QSKSEEKHDVSDKEEPFCDERTTE 331
Query: 432 KAGKLKSLEINQDLGNVYNGRRSSSNIYVEAKKHLSEMLSSGDESVDFLRGHVPKTLGRI 491 GK +++ ++D ++ S +Y+ AKKHLSEML++GD V+ VP+ LG+I Sbjct: 332 D-GKEETIYSSED------SKKIMSGLYIAAKKHLSEMLANGDIDVNLPDKEVPRILGKI 391
Query: 492 LSLPEYSFSPISSPKMDCKLSPVTSEKRVSVGSRLLNVDEITPLLKGENDDTPISVQPPT 551 LSLPE+ SP SP++ + + VS S+ EI + +T + Sbjct: 392 LSLPEFC-SPADSPRL------IPAHDLVSTLSQTTEQPEIL-----QTPETSSATNDLI 451
Query: 552 DDNHDMKSD---IVDQSIREEAVSSSTNGKISEGDIEILKVNEIAVHEESSVLEAPSGSS 611 D++ D D +D S+ + + + N ++E S ++ S + Sbjct: 452 DEDSDKDDDTLFTIDVSVPRDYGNETEN----------------IDNDEESEIDPLSETC 511
Query: 612 EYSLLREDQNGEMADDACDERTVSYVPSDPIASSPIREGHDNDTPDMGDDKPSMSLSQDS 671 S+ RE +N D+ ++ +++ + H +P S+ L++ Sbjct: 512 SSSVSREVEN---VDEDVEKEMLNH------------QAHSPASPLESPTTCSVRLTE-- 571
Query: 672 SEENQLSPSRSASSSSSPTQGKAVGDLDGVSDVPERPSPVSVLEPLFVDDNLSPLHAMAL 731 +++ QGK SPVSVLE LF DD+ SP + + Sbjct: 572 -----------CKETATDAQGKL--------------SPVSVLETLFTDDDSSPTTSTSF 631
Query: 732 -PAGLPVQPVHIEFEEREPAETDKANVTKSLKEDKEVIFDYVKAVLLASGLSWNQICGKW 791 AG+ +QP+ I F+E + + +K N ++ +DKE+ Y++AV+ ++ LSW ++ + Sbjct: 632 SSAGMRMQPLCIRFDEVDSPKPEKDNNVEATIDDKELTLAYIEAVVKSAALSWEELLTRP 691
Query: 792 LSSEQLLYLLLLDEIELFPNQLCSDQQLLFDCINEVLADFCQSY---PPWFSFIQPCLRS 851 SEQLL L L D I QLC D+ LL+DCINEVL DFC + PW SF++P ++ Sbjct: 692 FYSEQLLELELTDGIVFSSTQLCDDKNLLYDCINEVLMDFCWNEFNPGPWISFVKPEVQL 701
Query: 852 EYLLEIC-----EGVYWHLLPMPQPLTLDHLVRKDMSRTRTWMNLHSDAETVGTETCQAI 911 +EI EGVYWHL P+P P TLD +V+KDM+RT +WM+L + +G+ T + I Sbjct: 752 ISDMEIAAKVAQEGVYWHLQPLPFPHTLDQIVKKDMARTGSWMDLRFEIGCLGSYTSEMI 701
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
F4HSD5 | 2.3e-11 | 32.73 | Protein TRM32 OS=Arabidopsis thaliana OX=3702 GN=TRM32 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT4G00440.1 | 9.8e-90 | 31.26 | Protein of unknown function (DUF3741) | [more] |
AT4G00440.2 | 9.8e-90 | 31.26 | Protein of unknown function (DUF3741) | [more] |
AT4G00440.3 | 9.8e-90 | 31.26 | Protein of unknown function (DUF3741) | [more] |
AT2G45900.1 | 7.6e-58 | 27.49 | Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related | [more] |
AT3G61380.1 | 2.1e-55 | 28.14 | Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related | [more] |
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR Term | IPR Description | Source | Source Term | Source Description | Alignment |
IPR022212 | Domain of unknown function DUF3741 | PFAM | PF12552 | DUF3741 | coord: 205..248 e-value: 2.9E-6 score: 27.1 |
IPR025486 | Domain of unknown function DUF4378 | PFAM | PF14309 | DUF4378 | coord: 765..905 e-value: 8.7E-27 score: 94.5 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 658..689 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 377..434 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 328..349 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 632..703 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 536..555 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 331..349 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 109..130 |
None | No IPR available | PANTHER | PTHR47212:SF4 | ADHESIN-LIKE PROTEIN, PUTATIVE (DUF3741)-RELATED | coord: 1..913 |
None | No IPR available | PANTHER | PTHR47212 | ADHESIN-LIKE PROTEIN, PUTATIVE (DUF3741)-RELATED | coord: 1..913 |
Relationships
The following mRNA feature(s) are a part of this gene:
|