Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTTAATGAGTTATATCTTGATCTCCTTGCACTGAGGGAATTATACATCCTCCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTGTAAGCCTTTCTGATATTATCATTCTCATTCTTCTTGTTTTATCAACCTTACTAAAGAATGGATAAGTGTCTCCCATTTTTCCTTTTTTTTTTTTTTGGGTAAATGGTATGAATCACCTTTAAAGTATCACATTTATTGCAATTACACCCTCAAACTTTTTAGGGTAAAAATTGAACCCTTTCACTTTTAAAAGTGAAAAATTGAACCCATCAATTTCAATTTTAAAAAATTAAACTAAACTTTCTCATAAATAAAAAAATAAAATTTATTTCCTCTCTCTCTCCTTGTTGTCGTCCTCTCTACTTCCCCTCCTAGAAAACTGGTTGGTTTCTTCTTTCACTCTCTGCAATATTTATCAGCCTCCACCACCTATCCCTCAACGAAGACGACAAGAGAGCGTAAACTCTAGCTCCTCCGATTTTTCTATATTGTTTAGAACTTTCTTCCTCTCCAGCTTCTTGAACACGTCCCCAGAGCTCTCGTAGAAAATTTCGGCTATAAACCATTTCTTCTTCCAACAGAGCGGTTGTACTTGCTCACCCTTTAAAAGAAATTGAAAAACATGACCGATTTATCGGTCTCTTGAAACAGGGAAAAAAAAGTTTCTATAACTCGTGTTCTTAGGGGCCATGGTAATGATGGGGAGAGAATTTTGGTTAACGGCATGCTGGAATTTGGAATTGGAGAATTTGTTGCGGGAATTGAGAAGATCATACAGGACAACAATTTTCTATATACTGCAAGACAGGGGAAGGAGAAACACAAGTTCTCTATGAGAGAAAGTGGAGAGGAAAATCCAGGAAGAGAGAGTAGAAAAAAGAAAAGAGAATAAAAACAAATTTATCTTTTCTTTTGTTTAAAAAAAGTTTAAGAGTTTAATTTTTTAAAATTGAAAGTTTAAGGTACAATTTTTTTCACTTTATTAAAGTTTGAGGGTTTAAATTTTAGGTCTGAAAGTTTATGGGTATAATTTTTTTATTTGTAAAGGTTTAAGGGTTTAAGTTTTAAATTGAAAGTTTGAGAGTGTAATTGTAACGATCATCGTACTTTAGGGTGGTTTGTGCAATTTACCCTTTTTTTTTATGCTCTCTCTGATGTCACATCAACTTAAATTTGATTCTCATTTGAAAGAATACTAAACAATATCAATTGCAATCTATTTCAATCCGAGTGTAGTTTTAATCTATATAGTTTTATATCAGTTTAGGGTCCTATTAGACATAACATTAAAAAATTGAATAACCTATTAAATACAATATTAAAAGTTGAGTGACTTACGAGACACTATAATGTTTATTGGCCTATCAGATATTTTTTAAAGTGAAATAACTTATTAGACATAACTTTGAAAGTTTACGACTAAATTTGTGATTTAACCCTAAGAAATTTTATTCCAATTCCATAAATGCTATCGTGAGAAATGGAGATGTTCAATCATTGTACTCATAGGTCTGATTGTCTAAGTGACTGTAGTATGATTTGTCTCCCTCCTTATCTTATGAAGCATGAAACTATTGTAGCAAATTTTCTGCAGATATTTAATAGTTTTTGGCTGCCCTGTGATTTTTATGTGGATTGATTCAAATAGAGACTGAGACATTAGAACAGTCGGAGTGAACTTTGTACTTGAATATTTCACTATTGTAGTTTGGCTTTATTGTCAAAGCAGCTGGATGAAAGGGCACAAATTTTGTTGAAGCATTTGCTCGATGATGCTACTGCTGAAATTGTCCAGTTTCACTCGAAGGTTTGCATGAAAACTTCTTTCATTCTTTCTTTTTTAACAAATGAACCATTTGGCGTCGATAGATTGCTAAATATATGTCTTAGTCCTCAATAGGGGTGTTCAAAAACCCCGACAATCCGACGAACCCGACGAATCCCAACCGACCAGGTCCAATCCGGTCTCGGGTTGACCTTTCAAATTTTTCGGGTTGATTCAATCCAACCCGGTATGCTTTACTTTTAATTTTTTAATATATATAATGTATATAAATTATATATATATATATATACAGTAAACCTAATTTTTCTAACCCAGTTTTTCTAACCCTAGCAGGCGCCTCCCAATTCCCTCCGTCCCTCACCCCCCTTCCCTCTCTGTCTCCCTCCCTCCCTTCATCTTGTCTCTGTTTTCCTCCCTCCCTCCTTTCATCTCCCCCTCCCTCTTCTTCCTCTATCATTCTTTTCTTCCCCTTCTCCAGTCGACCCCCTCTATTCTTCATCCTCTTCTGATGATGGCTGAGTATGAAGTTTTCTCTCTTATGTGACAACGAACACGACGAACAATCGATTTCTGCAATTTAAAATGTTTAGGATTTCGACTGACGTTCGATTCCCTTCCTTCTCTCGTCTCAACCCGACGACCCAACCTAAAAATATAAGGGTTGGGTTGGGTTGGATTGGCCTTGTTCGATTCCATCAGTTCTCTGTTCTGCCCAATCCGAAAATTCGGGTTGGTCCAAAAAACTCCTCTAACCCGATCCAACCCAACCCGTGAACACCCCTGGACTCCTTAATCTATCAGTACTTTTTCTCATTTAGTCCAAACGTTTAAAAATTTTCAATTTAGTCTCTTATGTTTTAACCTTTTTCACTTACACTCTGGTGAATGTTGAAATTGCCTGATGGTTAACTTACATGGCATGATATTTAGTAGGTCAGCAGAAATTTGGAGATTGAACTAGATGACATAAAAGATGAAAAGTTACAATTTCTAATAAATTTTAAGAAGTTAGAAACTTTCTTCTCTTAAGCGGCTTACTTTCTCCTATATTAGACCTTAAGGATTAGCTTGAAAAAAAAATACCTAAACATTAAGAACTAAAGCATATATTTAGCCTATGTTAGATATTCAGAAAAATTGTTCGAGTTTACACTTATGGATGTTATGTCTAACATACATAGCCCTTTTGAGTTGTTACAGACAAAGCCAGTGGAAGAGAAAGTTGCTGAATGGATGGAATACAATCAAAGTACAAGAAAAACGGGAAATGTTGCTGCTAATGACTTATCAAATGGTATTGGTTTAGCACTCAGAAGAATTGAATTCCACATTTTATCTCTGCAACATTATACAAGTCAAAGTAGGAACACAAGAAGCCATATCAATGGAGCTAAATTATCCAACTCTCCATTAGATCAGCAGAAAGTTCAGTCAAGAATGGATCACTCAAATTTGAAGGCCAGAGTTGCTGAGCCAATCAATGGCCATTGTTCCGAGTTCGTTCACGGGTTTAGAGTACCTCTGTCTCAAGACAATGTTGAGGCCATGAAACCTCCAAACGTTGGAACCCAGGTATCTAAACAAAACAAAGTTATAAATCCAGTGATTCTGATAGATAAATCTCGATGTTCAGTGGGATCCAAGGCTACTGTACGGTCCGTGAATCGAACTCAGATACACGAAAGGCGGTGCCAAAATTTGCCTGGTCATATGATCATGAGGCCAACTTTGCTGAATCATATGAAGACTCGAATGCCCACTCAGCAAGAATCAGAATTTACAAACTCAGAATCAGAATCAGTTTCTTCTTCAAGTTGGGCAACTCAGCAGACAAGTGAAACTGAAACCACTGATTACCCTTCTTCCTCAAGTCACCAAGAGGATCAACCGGCAACCGGATCGGAGGTGAGTAGCCGGTACAGAAGCAGCAGAATTTCCTCAAAAGCATTTAGAATCAGCCATGGGAAAAAGGGGTCGAAGAAAGCGATCGGACGTTTCAAGAGACTGAGAAACAAACTAGGCCTTATCTTCCACCACCATCACCACCACCATCACCATCACCACCACAACAGCCATAATAACTTCTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATGGCACAGACAAGAAAAGAGTAACAAGTAAAGGACGACACGAAACGCTAAAGAAAACAGCAATCCGAAGCGTATCTCGGAAGAATCAAGTTGGAAGGTTTCAGGCTCTGGCTGAGGGGCTGAGGAGCCATGTTTGGAAACCGACAGCCATGAAGAAGAAAGAGCTTAGGAAGCCGAGGTTGGGGAAGAAGGGTGTGAAGAAGTTGCACTGGTGGAGGATGTTTTGTCGCCGCCGTGGAGTGAAGTTGCCAAATAAAGGGCGTGTGAAAATAGGGTATGTAAATAGAAAACCACAGCATAAGATAGTTTAGTTTGGTGGAACAATTTTCAATCTTCTGCAGCCAATTTGGGCTTTTTATCCTCTCCTATGTTGAGCTGCTATGAGAGTTGAACATTTTGGTAGACATATGTTGTACGTCTCTAGAGAATGTAAAAGGTTTGTATTTTTAATGTTAGTGTATTCCTATTGATTACAAACAATGTTTCTTAAGCATGAGTTCAAAGCGCTAAACACTATGAGAAGAGCGGGCCTTTTTTTAGACCCGCTTTCCCAAACATTGATTATAAATTTGATTTAAATGAGTCGACTCCATGTAGGTCAACCACGAACTAGATTATGCGCCAAGTGAGATGCAAAAGGGCGAACGATAAAATTAAACATGCCATCCTCAAAGTCAAAAAAATATATACACCCTCAACGTTCATAAGCTAAGCTACCACCAAGGGTCAAAGAAACTACACAAGCTATCATGGTCTTTTTTTTTTTAATATAAGAAGTTAAGCAACCTATAAATTTTAAATCCATGCTACATAACAGTAAGGAAAATAATTACAAATAAAGGGTGAAAGGATACTGCTCTAAATGTTGTGACTCCAATAATCAAGTTCTTCCTCATCTTCTCCCTCTTCTTCGTGTCGAGGATCGTTTGAAAATTCGGGTGGTTTGGGGAATTGAATTGGATTTGCTCCTTGATTGAATCCACACTGTGCAAACATCCGAACGAGGCTACGGTTGAACTCCACCTGATACGACATAAATCTCCTCCACTCTTGATGTTGGTGGTTGAGTCTTCGCTCTAGTCGATCTAATCGCTCCTCCATAGATCCTGATTCTTGTGCACGAGGTCTCGACGCTCGAGCTTGCTCATTCGAGCAAGCCTGCAATGGCTCTATAGCCATTGCGTCTTCAGCAGGGACCCAGCCCTTCAATCCAAGAATAAAACTTTTGTCCATGACACTCCTTGGGTTAAGCAACTCCTCATGAGGTCCCCATACAACCCCAGCCTGTCTACATAGAGCAGTGATCAACGAAGGATGGGCTAGACCGCCCGTGGTTGACGCTCGTCTACAATGTCTAATGGACTGTCCGATGAGCCGCCCTACGTCTATTGATTTACCAGTGACGATAGTATAGACCAAGATCGCCCTTTCCCTAGTAACATCACTGATATGTGTCACTGGCAATAACTTTGCACAGATAAAGCTGTGCCAGACCTTATTGGTAACGCTCAACTCCGAGGTTTTGAAGCTCAATGCTTCATGTCCTCGGAATTTCCATTGAGTTCCTGGTCTACATAGTTTCAAAATCACTTGTTCTAAGTCCAATTCCTCCCTAGCATAGGTGCTATACTTGTCGTGTTCGATGTCGGGCAAATGGTACAACCTATTGATCGTAGTACGATCGAACGGCACCATTTTCCCTCTCACAAACACGCTTATGTCGTCTTCTTTCATATTCGCATAGAACTCCCTCACCACCGAGACGACTGCAGCCTCGGGTTGTTTAACCAGCTCTTCCCAACCCCTTTGCTCAATGTTCAAATGGATGTCCCTTTGGTCCCCACTACATGGTTGTAGCCCCCTTTCTGTAATAACACTACGTTTTGCTACGGAACTGATAAATTTGTCTGAGGATTCTAAGTCTATGAACTTCCTTCTATCAAAAGATGCCAAAGACGATCCACTAGTCTTGGCACGTTTTGGAGCCATAAACAAATTGATGCACAGATTAAGAGAAGTTTACAGCAGAAAATCTATCCCAAAACTTGATTGTCTTTTTGGTTTTTTGTTTCTCAAATCAGCACCAAATCAATCGAAGTTCAAGTACATAGTTCACTTCCACAAGAAACAGCAAGTTCCAGCAAAATCAAGCTCCAAAATAGAGATTTAGAGGAAAAACTTACTTGGAGAGGAATCTAGCGAAAAACCCACGAAAAATCTGGGTAAATTCGCCGGAGTGGAAAGAGGAATCTGCAGATAATAATAGATTTAGGGAAGAACTCGTGATTTGAGGTATGGGAAAGAAGAAATTTGAGAGAAGAGAGAAGTTATTGGAGACGGGGAAATTAGGGCAAGAAGTGACTGAAGCTGTTGTGAGCGTCTTTTTAATTTTTCTGCTATAATAACAGCGCCTAGGCTCGGGATTCCCGCGCGGGGGCGCTATCGCTCTGTGCTTGCGTTTCATTCTGTCTCCCTATGGGAAGAAATTATAGAGTTGAA
mRNA sequence
ATGGAAGTTAATGAGTTATATCTTGATCTCCTTGCACTGAGGGAATTATACATCCTCCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTCTGGATGAAAGGGCACAAATTTTGTTGAAGCATTTGCTCGATGATGCTACTGCTGAAATTGTCCAGTTTCACTCGAAGGTTTGCATGAAAACTTCTTTCATTCTTTCTTTTTTAACAAATGAACCATTTGGCACAAAGCCAGTGGAAGAGAAAGTTGCTGAATGGATGGAATACAATCAAAGTACAAGAAAAACGGGAAATGTTGCTGCTAATGACTTATCAAATGGTATTGGTTTAGCACTCAGAAGAATTGAATTCCACATTTTATCTCTGCAACATTATACAAGTCAAAGTAGGAACACAAGAAGCCATATCAATGGAGCTAAATTATCCAACTCTCCATTAGATCAGCAGAAAGTTCAGTCAAGAATGGATCACTCAAATTTGAAGGCCAGAGTTGCTGAGCCAATCAATGGCCATTGTTCCGAGTTCGTTCACGGGTTTAGAGTACCTCTGTCTCAAGACAATGTTGAGGCCATGAAACCTCCAAACGTTGGAACCCAGGTATCTAAACAAAACAAAGTTATAAATCCAGTGATTCTGATAGATAAATCTCGATGTTCAGTGGGATCCAAGGCTACTGTACGGTCCGTGAATCGAACTCAGATACACGAAAGGCGGTGCCAAAATTTGCCTGGTCATATGATCATGAGGCCAACTTTGCTGAATCATATGAAGACTCGAATGCCCACTCAGCAAGAATCAGAATTTACAAACTCAGAATCAGAATCAGTTTCTTCTTCAAGTTGGGCAACTCAGCAGACAAGTGAAACTGAAACCACTGATTACCCTTCTTCCTCAAGTCACCAAGAGGATCAACCGGCAACCGGATCGGAGGTGAGTAGCCGGTACAGAAGCAGCAGAATTTCCTCAAAAGCATTTAGAATCAGCCATGGGAAAAAGGGGTCGAAGAAAGCGATCGGACGTTTCAAGAGACTGAGAAACAAACTAGGCCTTATCTTCCACCACCATCACCACCACCATCACCATCACCACCACAACAGCCATAATAACTTCTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATGGCACAGACAAGAAAAGAGTAACAAGTAAAGGACGACACGAAACGCTAAAGAAAACAGCAATCCGAAGCGTATCTCGGAAGAATCAAGTTGGAAGGTTTCAGGCTCTGGCTGAGGGGCTGAGGAGCCATGTTTGGAAACCGACAGCCATGAAGAAGAAAGAGCTTAGGAAGCCGAGGTTGGGGAAGAAGGGTGTGAAGAAGTTGCACTGGTGGAGGATGTTTTGTCGCCGCCGTGGAGTGAAGTTGCCAAATAAAGGGCGTGTGAAAATAGGGTATGTAAATAGAAAACCACAGCATAAGATAGTTTAGTTTGGTGGAACAATTTTCAATCTTCTGCAGCCAATTTGGGCTTTTTATCCTCTCCTATGTTGAGCTGCTATGAGAGTTGAACATTTTGGTAGACATATGTTGTACGTCTCTAGAGAATGTAAAAGGTTTGTATTTTTAATGTTAGTGTATTCCTATTGATTACAAACAATGTTTCTTAAGCATGAGTTCAAAGCGCTAAACACTATGAGAAGAGCGGGCCTTTTTTTAGACCCGCTTTCCCAAACATTGATTATAAATTTGATTTAAATGAGTCGACTCCATGTAGGTCAACCACGAACTAGATTATGCGCCAAGTGAGATGCAAAAGGGCGAACGATAAAATTAAACATGCCATCCTCAAAGTCAAAAAAATATATACACCCTCAACGTTCATAAGCTAAGCTACCACCAAGGGTCAAAGAAACTACACAAGCTATCATGGTCTTTTTTTTTTTAATATAAGAAGTTAAGCAACCTATAAATTTTAAATCCATGCTACATAACAGTAAGGAAAATAATTACAAATAAAGGGTGAAAGGATACTGCTCTAAATGTTGTGACTCCAATAATCAAGTTCTTCCTCATCTTCTCCCTCTTCTTCGTGTCGAGGATCGTTTGAAAATTCGGGTGGTTTGGGGAATTGAATTGGATTTGCTCCTTGATTGAATCCACACTGTGCAAACATCCGAACGAGGCTACGGTTGAACTCCACCTGATACGACATAAATCTCCTCCACTCTTGATGTTGGTGGTTGAGTCTTCGCTCTAGTCGATCTAATCGCTCCTCCATAGATCCTGATTCTTGTGCACGAGGTCTCGACGCTCGAGCTTGCTCATTCGAGCAAGCCTGCAATGGCTCTATAGCCATTGCGTCTTCAGCAGGGACCCAGCCCTTCAATCCAAGAATAAAACTTTTGTCCATGACACTCCTTGGGTTAAGCAACTCCTCATGAGGTCCCCATACAACCCCAGCCTGTCTACATAGAGCAGTGATCAACGAAGGATGGGCTAGACCGCCCGTGGTTGACGCTCGTCTACAATGTCTAATGGACTGTCCGATGAGCCGCCCTACGTCTATTGATTTACCAGTGACGATAGTATAGACCAAGATCGCCCTTTCCCTAGTAACATCACTGATATGTGTCACTGGCAATAACTTTGCACAGATAAAGCTGTGCCAGACCTTATTGGTAACGCTCAACTCCGAGGTTTTGAAGCTCAATGCTTCATGTCCTCGGAATTTCCATTGAGTTCCTGGTCTACATAGTTTCAAAATCACTTGTTCTAAGTCCAATTCCTCCCTAGCATAGGTGCTATACTTGTCGTGTTCGATGTCGGGCAAATGGTACAACCTATTGATCGTAGTACGATCGAACGGCACCATTTTCCCTCTCACAAACACGCTTATGTCGTCTTCTTTCATATTCGCATAGAACTCCCTCACCACCGAGACGACTGCAGCCTCGGGTTGTTTAACCAGCTCTTCCCAACCCCTTTGCTCAATGTTCAAATGGATGTCCCTTTGGTCCCCACTACATGGTTGTAGCCCCCTTTCTGTAATAACACTACGTTTTGCTACGGAACTGATAAATTTGTCTGAGGATTCTAAGTCTATGAACTTCCTTCTATCAAAAGATGCCAAAGACGATCCACTAGTCTTGGCACGTTTTGGAGCCATAAACAAATTGATGCACAGATTAAGAGAAGTTTACAGCAGAAAATCTATCCCAAAACTTGATTGTCTTTTTGGTTTTTTGTTTCTCAAATCAGCACCAAATCAATCGAAGTTCAAGTACATAGTTCACTTCCACAAGAAACAGCAAGTTCCAGCAAAATCAAGCTCCAAAATAGAGATTTAGAGGAAAAACTTACTTGGAGAGGAATCTAGCGAAAAACCCACGAAAAATCTGGGTAAATTCGCCGGAGTGGAAAGAGGAATCTGCAGATAATAATAGATTTAGGGAAGAACTCGTGATTTGAGGTATGGGAAAGAAGAAATTTGAGAGAAGAGAGAAGTTATTGGAGACGGGGAAATTAGGGCAAGAAGTGACTGAAGCTGTTGTGAGCGTCTTTTTAATTTTTCTGCTATAATAACAGCGCCTAGGCTCGGGATTCCCGCGCGGGGGCGCTATCGCTCTGTGCTTGCGTTTCATTCTGTCTCCCTATGGGAAGAAATTATAGAGTTGAA
Coding sequence (CDS)
ATGGAAGTTAATGAGTTATATCTTGATCTCCTTGCACTGAGGGAATTATACATCCTCCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTCTGGATGAAAGGGCACAAATTTTGTTGAAGCATTTGCTCGATGATGCTACTGCTGAAATTGTCCAGTTTCACTCGAAGGTTTGCATGAAAACTTCTTTCATTCTTTCTTTTTTAACAAATGAACCATTTGGCACAAAGCCAGTGGAAGAGAAAGTTGCTGAATGGATGGAATACAATCAAAGTACAAGAAAAACGGGAAATGTTGCTGCTAATGACTTATCAAATGGTATTGGTTTAGCACTCAGAAGAATTGAATTCCACATTTTATCTCTGCAACATTATACAAGTCAAAGTAGGAACACAAGAAGCCATATCAATGGAGCTAAATTATCCAACTCTCCATTAGATCAGCAGAAAGTTCAGTCAAGAATGGATCACTCAAATTTGAAGGCCAGAGTTGCTGAGCCAATCAATGGCCATTGTTCCGAGTTCGTTCACGGGTTTAGAGTACCTCTGTCTCAAGACAATGTTGAGGCCATGAAACCTCCAAACGTTGGAACCCAGGTATCTAAACAAAACAAAGTTATAAATCCAGTGATTCTGATAGATAAATCTCGATGTTCAGTGGGATCCAAGGCTACTGTACGGTCCGTGAATCGAACTCAGATACACGAAAGGCGGTGCCAAAATTTGCCTGGTCATATGATCATGAGGCCAACTTTGCTGAATCATATGAAGACTCGAATGCCCACTCAGCAAGAATCAGAATTTACAAACTCAGAATCAGAATCAGTTTCTTCTTCAAGTTGGGCAACTCAGCAGACAAGTGAAACTGAAACCACTGATTACCCTTCTTCCTCAAGTCACCAAGAGGATCAACCGGCAACCGGATCGGAGGTGAGTAGCCGGTACAGAAGCAGCAGAATTTCCTCAAAAGCATTTAGAATCAGCCATGGGAAAAAGGGGTCGAAGAAAGCGATCGGACGTTTCAAGAGACTGAGAAACAAACTAGGCCTTATCTTCCACCACCATCACCACCACCATCACCATCACCACCACAACAGCCATAATAACTTCTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATGGCACAGACAAGAAAAGAGTAACAAGTAAAGGACGACACGAAACGCTAAAGAAAACAGCAATCCGAAGCGTATCTCGGAAGAATCAAGTTGGAAGGTTTCAGGCTCTGGCTGAGGGGCTGAGGAGCCATGTTTGGAAACCGACAGCCATGAAGAAGAAAGAGCTTAGGAAGCCGAGGTTGGGGAAGAAGGGTGTGAAGAAGTTGCACTGGTGGAGGATGTTTTGTCGCCGCCGTGGAGTGAAGTTGCCAAATAAAGGGCGTGTGAAAATAGGGTATGTAAATAGAAAACCACAGCATAAGATAGTTTAG
Protein sequence
MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Homology
BLAST of MC01g0120 vs. ExPASy Swiss-Prot
Match:
Q9FFP2 (Protein KOKOPELLI OS=Arabidopsis thaliana OX=3702 GN=KPL PE=2 SV=1)
HSP 1 Score: 84.3 bits (207), Expect = 4.0e-15
Identity = 135/513 (26.32%), Postives = 223/513 (43.47%), Query Frame = 0
Query: 1 MEVNELYLDLLALRELYILLLKSCLRDANSEL--LDERAQILLKHLLDDATAE------- 60
ME N + +L +LR LY LL+ + + E LD+ Q LLK LLD A+ E
Sbjct: 1 MEPNNVERNLQSLRRLYSLLVANARNEYIPEAYKLDDNTQFLLKRLLDFASHEHFVTQSN 60
Query: 61 IVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGL 120
++ +V KT + +N ++ + + + TR + + A
Sbjct: 61 LLATQLRVFPKTVLHGAKPSNVADSSETTPPAMLSQINGSHITRVSKPLEAKG------- 120
Query: 121 ALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQK-VQSRMDHS----NLKARVA 180
L+R + + ++ S+ T ++ + ++ L V SR+D ++K+ V
Sbjct: 121 TLQR-DLRVDQIRSNPSKDVLTEEVVDAIEQIDTQLSALSFVSSRVDSDERTRSVKSFVT 180
Query: 181 EPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKAT 240
P C + +P + N + +V + + ++ + + +A
Sbjct: 181 PPTE-ECRQNSQA-SMPCLRSNYMTVSQKSVISGAEVTFPYNDQLVRVTSPPQPLPPRA- 240
Query: 241 VRSVNRTQIHERRCQNLPGHMIMRPTLLNHM-------KTRMPTQQESEFTNSESE---- 300
V + R Q +P IM+PTL++ + Q T SESE
Sbjct: 241 VSGFKKPNQSNRASQKMP---IMKPTLMDQETETFDDDSSETEADQTPSATGSESEDEEV 300
Query: 301 -----------SVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVS-SRYRSSRISS 360
S S S W TQ ++TE+ S SS+ + SEVS S + R +S
Sbjct: 301 STSQEYSGETGSSSGSEWETQAENDTES---KSESSYPPQNDDSVSEVSTSPPHTDRDTS 360
Query: 361 KAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHG 420
+ K + +GRFKR++NK+G IFHHHHHHHHHHHH+ W +L+ FH
Sbjct: 361 R-----EPGKQRRNVMGRFKRIKNKIGQIFHHHHHHHHHHHHHDKEKPSAWNKLQSKFH- 420
Query: 421 TDKKRVTSKGRHETLKKT-AIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRL 474
K + SK R + ++ + + +++Q G F AL EGL H +K ++
Sbjct: 421 -HKHQEKSKERKRPMSESKGLTTHKQQHQGGHFHALVEGLVRH-------RKHSKKQKHQ 480
BLAST of MC01g0120 vs. NCBI nr
Match:
XP_022154939.1 (protein KOKOPELLI isoform X2 [Momordica charantia])
HSP 1 Score: 884 bits (2283), Expect = 0.0
Identity = 466/484 (96.28%), Postives = 466/484 (96.28%), Query Frame = 0
Query: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSKVC 60
MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSK
Sbjct: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSK-- 60
Query: 61 MKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHI 120
TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHI
Sbjct: 61 ----------------TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHI 120
Query: 121 LSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGF 180
LSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGF
Sbjct: 121 LSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGF 180
Query: 181 RVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRC 240
RVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRC
Sbjct: 181 RVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRC 240
Query: 241 QNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS 300
QNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS
Sbjct: 241 QNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS 300
Query: 301 HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHH 360
HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHH
Sbjct: 301 HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHH 360
Query: 361 HHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAE 420
HHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAE
Sbjct: 361 HHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAE 420
Query: 421 GLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQ 480
GLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQ
Sbjct: 421 GLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQ 466
Query: 481 HKIV 484
HKIV
Sbjct: 481 HKIV 466
BLAST of MC01g0120 vs. NCBI nr
Match:
XP_022154937.1 (protein KOKOPELLI isoform X1 [Momordica charantia] >XP_022154938.1 protein KOKOPELLI isoform X1 [Momordica charantia])
HSP 1 Score: 879 bits (2271), Expect = 0.0
Identity = 466/485 (96.08%), Postives = 466/485 (96.08%), Query Frame = 0
Query: 1 MEVNELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDATAEIVQFHSKV 60
MEVNELYLDLLALRELYILLLKSCLRDANSEL LDERAQILLKHLLDDATAEIVQFHSK
Sbjct: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELQLDERAQILLKHLLDDATAEIVQFHSK- 60
Query: 61 CMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFH 120
TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFH
Sbjct: 61 -----------------TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFH 120
Query: 121 ILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHG 180
ILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHG
Sbjct: 121 ILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHG 180
Query: 181 FRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERR 240
FRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERR
Sbjct: 181 FRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERR 240
Query: 241 CQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS 300
CQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS
Sbjct: 241 CQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS 300
Query: 301 SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHH 360
SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHH
Sbjct: 301 SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHH 360
Query: 361 HHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALA 420
HHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALA
Sbjct: 361 HHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALA 420
Query: 421 EGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKP 480
EGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKP
Sbjct: 421 EGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKP 467
Query: 481 QHKIV 484
QHKIV
Sbjct: 481 QHKIV 467
BLAST of MC01g0120 vs. NCBI nr
Match:
XP_022154940.1 (uncharacterized protein LOC111022084 isoform X3 [Momordica charantia])
HSP 1 Score: 828 bits (2138), Expect = 7.43e-301
Identity = 435/455 (95.60%), Postives = 436/455 (95.82%), Query Frame = 0
Query: 30 SELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWME 89
S+ LDERAQILLKHLLDDATAEIVQFHSK TKPVEEKVAEWME
Sbjct: 11 SKQLDERAQILLKHLLDDATAEIVQFHSK------------------TKPVEEKVAEWME 70
Query: 90 YNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQ 149
YNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQ
Sbjct: 71 YNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQ 130
Query: 150 QKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVIN 209
QKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVIN
Sbjct: 131 QKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVIN 190
Query: 210 PVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEF 269
PVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEF
Sbjct: 191 PVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEF 250
Query: 270 TNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS 329
TNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS
Sbjct: 251 TNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS 310
Query: 330 HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRV 389
HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRV
Sbjct: 311 HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRV 370
Query: 390 TSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKK 449
TSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKK
Sbjct: 371 TSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKK 430
Query: 450 LHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 484
LHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Sbjct: 431 LHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 447
BLAST of MC01g0120 vs. NCBI nr
Match:
XP_022154941.1 (protein KOKOPELLI isoform X4 [Momordica charantia])
HSP 1 Score: 767 bits (1980), Expect = 1.31e-277
Identity = 397/397 (100.00%), Postives = 397/397 (100.00%), Query Frame = 0
Query: 88 MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPL 147
MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPL
Sbjct: 1 MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPL 60
Query: 148 DQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKV 207
DQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKV
Sbjct: 61 DQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKV 120
Query: 208 INPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQES 267
INPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQES
Sbjct: 121 INPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQES 180
Query: 268 EFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFR 327
EFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFR
Sbjct: 181 EFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFR 240
Query: 328 ISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK 387
ISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK
Sbjct: 241 ISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK 300
Query: 388 RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGV 447
RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGV
Sbjct: 301 RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGV 360
Query: 448 KKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 484
KKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Sbjct: 361 KKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 397
BLAST of MC01g0120 vs. NCBI nr
Match:
XP_022958322.1 (uncharacterized protein LOC111459571 isoform X2 [Cucurbita moschata] >XP_022958323.1 uncharacterized protein LOC111459571 isoform X2 [Cucurbita moschata])
HSP 1 Score: 460 bits (1183), Expect = 3.11e-155
Identity = 292/509 (57.37%), Postives = 346/509 (67.98%), Query Frame = 0
Query: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELL-DERAQILLKHLLDDATAEIVQFHSKV 60
ME +ELYLDLLALR+LY+ LLK CLRDANSEL+ RA+IL KHLLDDAT +++FHSK
Sbjct: 1 MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSK- 60
Query: 61 CMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTR-----------------KTGNVAA 120
T +FL + TKP++EKVAEWME+NQ+ R NVAA
Sbjct: 61 ---TLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKIEHKPGRDRASASNVAA 120
Query: 121 NDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLK 180
NDLS+GI ALRRIE HILSLQ YT RSHI+ KL+ + ++H +K
Sbjct: 121 NDLSSGISSALRRIELHILSLQRYT------RSHISETKLAYYGQSVHQGNESLNHQKVK 180
Query: 181 ARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVG 240
VA HCS+FVHGFR+PL+QD EAMK Q+++ P L+DKS C G
Sbjct: 181 PMVAN----HCSKFVHGFRIPLTQDKNEAMK----------QHELALPPTLMDKSGCPEG 240
Query: 241 SKATVR---SVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSS 300
SKAT R +NRT I E+R +N G ++MRPTL H KT + QQESE+TNSESES S
Sbjct: 241 SKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-HNKTHLAAQQESEYTNSESESAPS 300
Query: 301 SSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRY--RSSRISSKAFRISHGKKGSKK 360
SS AT+QTSE+ETT SS Q PATGSE SS+ SS IS +AF+ SHGKK SKK
Sbjct: 301 SSPATRQTSESETTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKK 360
Query: 361 AIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKG-RHE 420
A+GRFK LRNKLGLIFHHHHHH+H N HN+ MWKQ+R++FH T KK +TSK ++
Sbjct: 361 AVGRFKSLRNKLGLIFHHHHHHYH----NGHNS--MWKQVRRMFHRTGKKELTSKEEKNG 420
Query: 421 TLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGK-KGVKKLHWWRM 480
L+KT IRSVSR NQVG+FQALAEGLRSHVWK AMKKKE R GK G KKLHWW+M
Sbjct: 421 MLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKM 478
Query: 481 FCRRRGVKLPNKGRVKIGYVNRKPQHKIV 484
RRRGVKLPNKGRVKIGYVN+KP KI+
Sbjct: 481 IRRRRGVKLPNKGRVKIGYVNKKPHVKII 478
BLAST of MC01g0120 vs. ExPASy TrEMBL
Match:
A0A6J1DNR3 (protein KOKOPELLI isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4 SV=1)
HSP 1 Score: 884 bits (2283), Expect = 0.0
Identity = 466/484 (96.28%), Postives = 466/484 (96.28%), Query Frame = 0
Query: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSKVC 60
MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSK
Sbjct: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSK-- 60
Query: 61 MKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHI 120
TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHI
Sbjct: 61 ----------------TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHI 120
Query: 121 LSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGF 180
LSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGF
Sbjct: 121 LSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGF 180
Query: 181 RVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRC 240
RVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRC
Sbjct: 181 RVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRC 240
Query: 241 QNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS 300
QNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS
Sbjct: 241 QNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS 300
Query: 301 HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHH 360
HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHH
Sbjct: 301 HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHH 360
Query: 361 HHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAE 420
HHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAE
Sbjct: 361 HHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAE 420
Query: 421 GLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQ 480
GLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQ
Sbjct: 421 GLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQ 466
Query: 481 HKIV 484
HKIV
Sbjct: 481 HKIV 466
BLAST of MC01g0120 vs. ExPASy TrEMBL
Match:
A0A6J1DLN1 (protein KOKOPELLI isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4 SV=1)
HSP 1 Score: 879 bits (2271), Expect = 0.0
Identity = 466/485 (96.08%), Postives = 466/485 (96.08%), Query Frame = 0
Query: 1 MEVNELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDATAEIVQFHSKV 60
MEVNELYLDLLALRELYILLLKSCLRDANSEL LDERAQILLKHLLDDATAEIVQFHSK
Sbjct: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELQLDERAQILLKHLLDDATAEIVQFHSK- 60
Query: 61 CMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFH 120
TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFH
Sbjct: 61 -----------------TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFH 120
Query: 121 ILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHG 180
ILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHG
Sbjct: 121 ILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHG 180
Query: 181 FRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERR 240
FRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERR
Sbjct: 181 FRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERR 240
Query: 241 CQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS 300
CQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS
Sbjct: 241 CQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS 300
Query: 301 SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHH 360
SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHH
Sbjct: 301 SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHH 360
Query: 361 HHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALA 420
HHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALA
Sbjct: 361 HHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALA 420
Query: 421 EGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKP 480
EGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKP
Sbjct: 421 EGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKP 467
Query: 481 QHKIV 484
QHKIV
Sbjct: 481 QHKIV 467
BLAST of MC01g0120 vs. ExPASy TrEMBL
Match:
A0A6J1DL21 (uncharacterized protein LOC111022084 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4 SV=1)
HSP 1 Score: 828 bits (2138), Expect = 3.60e-301
Identity = 435/455 (95.60%), Postives = 436/455 (95.82%), Query Frame = 0
Query: 30 SELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWME 89
S+ LDERAQILLKHLLDDATAEIVQFHSK TKPVEEKVAEWME
Sbjct: 11 SKQLDERAQILLKHLLDDATAEIVQFHSK------------------TKPVEEKVAEWME 70
Query: 90 YNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQ 149
YNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQ
Sbjct: 71 YNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQ 130
Query: 150 QKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVIN 209
QKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVIN
Sbjct: 131 QKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVIN 190
Query: 210 PVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEF 269
PVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEF
Sbjct: 191 PVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEF 250
Query: 270 TNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS 329
TNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS
Sbjct: 251 TNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS 310
Query: 330 HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRV 389
HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRV
Sbjct: 311 HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRV 370
Query: 390 TSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKK 449
TSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKK
Sbjct: 371 TSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKK 430
Query: 450 LHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 484
LHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Sbjct: 431 LHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 447
BLAST of MC01g0120 vs. ExPASy TrEMBL
Match:
A0A6J1DQ76 (protein KOKOPELLI isoform X4 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4 SV=1)
HSP 1 Score: 767 bits (1980), Expect = 6.36e-278
Identity = 397/397 (100.00%), Postives = 397/397 (100.00%), Query Frame = 0
Query: 88 MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPL 147
MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPL
Sbjct: 1 MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPL 60
Query: 148 DQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKV 207
DQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKV
Sbjct: 61 DQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKV 120
Query: 208 INPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQES 267
INPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQES
Sbjct: 121 INPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQES 180
Query: 268 EFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFR 327
EFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFR
Sbjct: 181 EFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFR 240
Query: 328 ISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK 387
ISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK
Sbjct: 241 ISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK 300
Query: 388 RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGV 447
RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGV
Sbjct: 301 RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGV 360
Query: 448 KKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 484
KKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Sbjct: 361 KKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 397
BLAST of MC01g0120 vs. ExPASy TrEMBL
Match:
A0A6J1H2T7 (uncharacterized protein LOC111459571 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111459571 PE=4 SV=1)
HSP 1 Score: 460 bits (1183), Expect = 1.51e-155
Identity = 292/509 (57.37%), Postives = 346/509 (67.98%), Query Frame = 0
Query: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELL-DERAQILLKHLLDDATAEIVQFHSKV 60
ME +ELYLDLLALR+LY+ LLK CLRDANSEL+ RA+IL KHLLDDAT +++FHSK
Sbjct: 1 MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSK- 60
Query: 61 CMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTR-----------------KTGNVAA 120
T +FL + TKP++EKVAEWME+NQ+ R NVAA
Sbjct: 61 ---TLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKIEHKPGRDRASASNVAA 120
Query: 121 NDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLK 180
NDLS+GI ALRRIE HILSLQ YT RSHI+ KL+ + ++H +K
Sbjct: 121 NDLSSGISSALRRIELHILSLQRYT------RSHISETKLAYYGQSVHQGNESLNHQKVK 180
Query: 181 ARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVG 240
VA HCS+FVHGFR+PL+QD EAMK Q+++ P L+DKS C G
Sbjct: 181 PMVAN----HCSKFVHGFRIPLTQDKNEAMK----------QHELALPPTLMDKSGCPEG 240
Query: 241 SKATVR---SVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSS 300
SKAT R +NRT I E+R +N G ++MRPTL H KT + QQESE+TNSESES S
Sbjct: 241 SKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-HNKTHLAAQQESEYTNSESESAPS 300
Query: 301 SSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRY--RSSRISSKAFRISHGKKGSKK 360
SS AT+QTSE+ETT SS Q PATGSE SS+ SS IS +AF+ SHGKK SKK
Sbjct: 301 SSPATRQTSESETTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKK 360
Query: 361 AIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKG-RHE 420
A+GRFK LRNKLGLIFHHHHHH+H N HN+ MWKQ+R++FH T KK +TSK ++
Sbjct: 361 AVGRFKSLRNKLGLIFHHHHHHYH----NGHNS--MWKQVRRMFHRTGKKELTSKEEKNG 420
Query: 421 TLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGK-KGVKKLHWWRM 480
L+KT IRSVSR NQVG+FQALAEGLRSHVWK AMKKKE R GK G KKLHWW+M
Sbjct: 421 MLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKM 478
Query: 481 FCRRRGVKLPNKGRVKIGYVNRKPQHKIV 484
RRRGVKLPNKGRVKIGYVN+KP KI+
Sbjct: 481 IRRRRGVKLPNKGRVKIGYVNKKPHVKII 478
BLAST of MC01g0120 vs. TAIR 10
Match:
AT5G63720.1 (kokopelli )
HSP 1 Score: 84.3 bits (207), Expect = 2.9e-16
Identity = 135/513 (26.32%), Postives = 223/513 (43.47%), Query Frame = 0
Query: 1 MEVNELYLDLLALRELYILLLKSCLRDANSEL--LDERAQILLKHLLDDATAE------- 60
ME N + +L +LR LY LL+ + + E LD+ Q LLK LLD A+ E
Sbjct: 1 MEPNNVERNLQSLRRLYSLLVANARNEYIPEAYKLDDNTQFLLKRLLDFASHEHFVTQSN 60
Query: 61 IVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGL 120
++ +V KT + +N ++ + + + TR + + A
Sbjct: 61 LLATQLRVFPKTVLHGAKPSNVADSSETTPPAMLSQINGSHITRVSKPLEAKG------- 120
Query: 121 ALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQK-VQSRMDHS----NLKARVA 180
L+R + + ++ S+ T ++ + ++ L V SR+D ++K+ V
Sbjct: 121 TLQR-DLRVDQIRSNPSKDVLTEEVVDAIEQIDTQLSALSFVSSRVDSDERTRSVKSFVT 180
Query: 181 EPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKAT 240
P C + +P + N + +V + + ++ + + +A
Sbjct: 181 PPTE-ECRQNSQA-SMPCLRSNYMTVSQKSVISGAEVTFPYNDQLVRVTSPPQPLPPRA- 240
Query: 241 VRSVNRTQIHERRCQNLPGHMIMRPTLLNHM-------KTRMPTQQESEFTNSESE---- 300
V + R Q +P IM+PTL++ + Q T SESE
Sbjct: 241 VSGFKKPNQSNRASQKMP---IMKPTLMDQETETFDDDSSETEADQTPSATGSESEDEEV 300
Query: 301 -----------SVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVS-SRYRSSRISS 360
S S S W TQ ++TE+ S SS+ + SEVS S + R +S
Sbjct: 301 STSQEYSGETGSSSGSEWETQAENDTES---KSESSYPPQNDDSVSEVSTSPPHTDRDTS 360
Query: 361 KAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHG 420
+ K + +GRFKR++NK+G IFHHHHHHHHHHHH+ W +L+ FH
Sbjct: 361 R-----EPGKQRRNVMGRFKRIKNKIGQIFHHHHHHHHHHHHHDKEKPSAWNKLQSKFH- 420
Query: 421 TDKKRVTSKGRHETLKKT-AIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRL 474
K + SK R + ++ + + +++Q G F AL EGL H +K ++
Sbjct: 421 -HKHQEKSKERKRPMSESKGLTTHKQQHQGGHFHALVEGLVRH-------RKHSKKQKHQ 480
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FFP2 | 4.0e-15 | 26.32 | Protein KOKOPELLI OS=Arabidopsis thaliana OX=3702 GN=KPL PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
XP_022154939.1 | 0.0 | 96.28 | protein KOKOPELLI isoform X2 [Momordica charantia] | [more] |
XP_022154937.1 | 0.0 | 96.08 | protein KOKOPELLI isoform X1 [Momordica charantia] >XP_022154938.1 protein KOKOP... | [more] |
XP_022154940.1 | 7.43e-301 | 95.60 | uncharacterized protein LOC111022084 isoform X3 [Momordica charantia] | [more] |
XP_022154941.1 | 1.31e-277 | 100.00 | protein KOKOPELLI isoform X4 [Momordica charantia] | [more] |
XP_022958322.1 | 3.11e-155 | 57.37 | uncharacterized protein LOC111459571 isoform X2 [Cucurbita moschata] >XP_0229583... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DNR3 | 0.0 | 96.28 | protein KOKOPELLI isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4... | [more] |
A0A6J1DLN1 | 0.0 | 96.08 | protein KOKOPELLI isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4... | [more] |
A0A6J1DL21 | 3.60e-301 | 95.60 | uncharacterized protein LOC111022084 isoform X3 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DQ76 | 6.36e-278 | 100.00 | protein KOKOPELLI isoform X4 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4... | [more] |
A0A6J1H2T7 | 1.51e-155 | 57.37 | uncharacterized protein LOC111459571 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |