Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTACTGTAGGAAGATAGCAGCGTACGTCCAAAATGACAAATTGTTGATACACTGCTTTCAAGATAGTTTATCTAGTTCGGCCTCTCGTTGGTACATGCAGTTAGAGAGCTCTCATGTTGGCTCGTGGAAGAATCTGGCCGACTCATTCCTAAAACAATATAAGCATAACATAGACATGGCTCCAGATCGCTTAGATTTACAGAGGATGGAGAAGAAGAGTATAGAAAGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACGGCAGCTCAAGTCCAACCACCTTTAACGGATAAGAAGCTATCTGCCATGTTCATCAATACCCTAAAACATCCTTTCTATGATCGGATGATAGGAAGCGCTTCCACAAATTTCTCTGATATTATGACCATCGGAGAAAGAATCGAATACGGGGTTAAACACGGGCGAATAACCAGTACTGCCGAGGAGCCATTAGCTGCAAAGAATGCAAGTAATTCCAAGAAGAAGGAAGGTGAGGTGCAAATGGTGGGAGTAGACCGATACTCTTGGAAACAACAATCGTACGGTCGGACATCGCGATACACTCCGTATTATTACCCAACGCCATACAGGTATAATCAACCATTTGTGAATAATGCAACTTCACATTACTCCCCTTATGCATCCCAAAATTTTCGACCCCCAACCAGTCAAAATTTCCAACCTAGGGGTTAGCAGCATAATACATTATATACTCAAGGGCAACAGACTAATAGAGGGGCACGTAAACAGACCCAGTTTGACCCAATCCCCATGACTTATACTGAGCTTATACCTCAGTTATTTCAAAATAATCAGCTGACACCTGTACATGTAGATCCGATCCAGCCTCCATACCCAAGATGGTATGATGCAAATGCTCGTTGTGACTATCACGCAAGAGTCATAGGGCATTCAACAAAGAACTGCACTGCATTGAAATACAGGGTTCAAGCTTTGATCAAGGCAGGTTGGTTGAACTTTAAAAAATAAAATGGGCCTGATGTAAGTAAGAATCCGCTGCCGAATCATCAGAATGTCCAAATAAATGCAATCGAATGCCAAGGGATCGAGTCGAAGAGTAAGGTTGTCGATATTACAACCCCTATAGAGGAACTATCCGAAATTCTCTTGGGCAGTGGATATGTATCAGTGGAATACCTATGCTCAAACCTCAAGTATAAATGGTATGATGAAAGTCTGACGTGTCCGTTCCACGCTGGGGCAAAAGGTCACTCCCTAGAGCAATGCAACAGTTTTCAGATGAAAGTTCAAGAGTTGTTAGACTAAAAAATCCTTACTGTTGCAAATTCTCACCAGAAGAAAGGGATAAATGTCGTGGAGGATGTCTCAGTTGCTGAAGGCTCAAGTGATGCGCTTAAGCCAAAACGTCTCACCATATTTTACCGTGAAAAGCCAGATGCACCCAGCTGCAGTCGGAAACCAATCACTATCACGGTCCCAGCTCCTTTCGAGTATAAAAGTTCCAAAGCAGTGCCTTGGAAATATGAGTGCAAGGTAACTGTAGGGCAAGAGGTATCATCTCCTTCACTCCCTGGGAGGTTTGACACGAACTGGAAGATGTTATACACCGGATAGCCTGCTAAAGCGCGTGAATGAAACTACTAGTGAAAAGAATAAAGAGAAAGCAAGTGAGAAGAAAAAGGAGAAAGTGGAAGAGGATAAGAAAGGAAAGGCCAAACTCCACGAGGATGTCCATGATGAATTGGTGGAGGCAATTGTTGTAAAGGATGTAAGTTTTAAACAATCTGTGTCCGAGGAAGAGACTCAAGAGTTTCTAAAGCTAGTGAAGCAAAGTGAATACAAAGTTATTGAACAATTAGGTCGGACACCAGCAAAGATCTCTATATTATCTTTATTGTTATCGTCTGAAGCACATCAGAATACACTATTGGAGGCCTTGAAGCAGGCTTTCGTTTCATAAGACATCACAGTGGATAATTTGAGGAATGTTGTGGGGAATATAACGGCATCTAGCTCCATCACTTTTACATATGAGGAGATACCACCAGAGGGTACAAGACATACCAGGGCTCTCCACATCTCAGTTAAGTGTAAAAACTACCTAGTAGCAAAAGTTCTGGTTGAAAATGGATCTTCTTTAAACATAATGCCGAGATCCACGCTAGAGAAGTTACCGGTCAATATGTCCCACATGAGGCCTAGTACTGTGATAGTAAGAGCTTTCGATGGAGCTCGCAGCACTGTTGTTGGGGATATTGAGATCCCGATCCAGATAGGTCCTTGCACCTTTGACATAACATTTCAGGTTATGGACATCACATCAGCTTATAGTTTCTTGCTGGGGCGACCGTGGATACATTCGGCAGGAGCAATTCCAGCCACTTGACATCAGAAAATTAAGTTTGCGGTTGACCAAAAGTTGGTGATCATATCGGGACAAGAAGACATTCTAGTCTCAAGGCTTGCTTAGATGCCATATGTTGAAGCAGCAGAAGAAGCTTTTGAATCTTCATTCCAATCATTTGAGATTGCAAATGCTACAACTTTACATGGGAAGTTTGGTAGACCTAAGCCATGACTTTTAGGAGCCGCCTTTAAAGGAGACAATGGGAGTTTAGACAAGCTACTGAGGATGGCTAAGAATACAAAGAAGTTCGGGTTGGGGTATAAACCAAGTAGAGGCGACATCATTAGAGTGCGGAATCTAGAAAAAGTAAAACGACTCTCAAGATTTGAGAATGAGGAGCGTGATTACCCTAGGAGGACTGTTCCACCTCTCAGCCTCTCTTTCAGAAGTGCCGGCACAATCCATCAGGAGTACGATGAGAGCTCTGTAGTGGCAGCAGTGACAGAAGAAAGAGAGCAAGTCGGACCTTTTGTTTACCCGTGCCCAGACGGTTTCAAGCTGAGCAATTGGAGCATGTTAGAGCTACCGTCTTTTGTAAGTAATAAATCAAAGTAATTGTCCCTTTTTTTTCCTTCCCCTCATTTTCTTAATTAATTAAATTAATTTTCTATTTCCTGTACATTCCATCCAGACCTATTATTTGTTTATAATTCTTCCATATCAATAAAGTTTGAGTTCATTCACTTTGTTTCTTTCCTCTACTATTCTCTGGGCCAATTTATTCATTTATTTCTGTTTCTTTTCTTTTCTCCTTTATTATGTCTTAAACAGTAATACTGAGATTGAATGCGATAATGATTCGAAATACGAGCTCGATACACCTATATACAATATTAAATCTGATGAGGAAATAGATGACGAGCCCTCTGCTGAGTTATTGAGAATGCTAGAAGAAGAAGAAAAGATGTTGGGACCCCATGAAGAATTAACTGAGACAATTAACTTGGGATCACAAGCCGAAGCCAAAGAGATTAAAATAGGCACTCATATGTCTTCAGAGAGTCGCAAAAAGTTGATAGAGTTACTTCATGAGTATGCTGATATTTTTGCTTGGTCCTATCAGGATATGCCTGGTTTAGATACAGACATCGTAGTGCACAAATTGCCGATCAACCCAGAGTTCAAGCCGGTGAGACAGAAGTTACGGAAAATGAGGCCAGAAATGTTGATCAAGATAAAGGATGAAGTAAGGAAGCAAATCGATGCAGGGTTCCTTACGGTATCTAATTACCCAGAGTGCGTGGCAAACATTGTCCATGTACCAAAGAAAAATGGACAAGTAAGAATGTGTGTGGATTATAGAGACTTGAATCGTGCAAGTCCAAAAGACAACTTCCCGCTTCCTCATATCGATGTTTTGATTGATAATACTGCTGGATTCTCAACCTTCTCCTTTATGGATGGGTTCTCAGGATACAATCAAATTAAGATGGCACCTGAGGATCGTGAAAAAACCACATTCATTACGCTATGGGGAACTTTCTGCTACAAAGTTATGCCATTTGGGCTAAAAAATGCTGGGGCAACCTACCAGCGTGCCATGGTTACTCTCTTTCATGACTTGATGCACAAAGAAATTGAAGTTTACGTGGATGATATGATTGCCAAGTCAAACAGGGCGTGGCGCACACAACTATTTTAAGGAAGTTGTTTGATCGATTGAGGAAGTTCAAATTGAAACTCAATCCCAATAAATGCATATTTCGGGCAACCACTGGAAAAATCCTGGGTTTTGTAGTAAGCCAAGAAGACATTAAAGTTGACCCTGATAAAGTCAAAGCAATCTTTAGAGATGCCACCTCCACAGACGCAGAAAGAAGTCAAAGGATGTCTAAGACGACTCAACTACATCGCAAGGTTCATATCTCACTTAACAGCAACTTGCGAATCCATCTTTAAATTGCTCCGCAAGAACAACGATGGGGTATGGAGTGAAGATTGTCAAGCAGCATTCGATAAGATTAAGCAGTATTTGCAAGACCCTCCAATTCTTGTGCCACCAACTCCAGGACGACCCCTTATTTTGTATCTCACAGTGACTGAAAACTCAATGGGATGTGTACTGGGGCAGCATGATGATTCAGGCAGGAAAGAACAGGCTATATATTACTTAAGTAAGAAGTTCACCGATTGCGAGACTAGATACTCTCAAGTAGAAAAAACTTGTTGTGCTCTAGCTTGGGCTGCCCGACGTCTAAGACAATACATGTTGTATTATACCACATGGCTCATTTCAAAGATGAACTCCATAAAGTACATTTTTGAAAAGCCGTCTCTCTCGGGTCGAATTGCAAGGTGGCAGGTTCTCTTGTCAGAATATGATATTGTCTATGTGACTCAAAAGGCCATTAAGGGGAGTGTTTTGGCCGACTACCTAGCCCAAAAACCTATAAATGACTACGTACCGATAAAGTTCGACTTTCCAGATGAGTATATCTCCACCATAACCGCAAGTGAGGAAAGTTTAGACCCACAAACTTGGACCATGATGTTTGATGGTGCCTCTAACGAGTTAGGTCATTGGATAGGGGCTATTTTGATATCACTCAAAGGGTAACTATACCCTGTTACCGCCAGATTATGTTTTGACTGCTCGTATAATATGGCCGAGTATGAAGCATGCTCGATGGGCGTCCAAGCTGCTATTGATATGAAGGTTAAGAAACTTAAAGTTTTTGGGGATTCTATGCTAGTAATTCATCAGCTAAGAGGAGAATGGGAAACAAGAGACGCTAAGTTGTTGCCTTACAAACAACTCATAACATAATTGTCACAAGAATTTGATGAAATCTCATTTGATTATTTGCCAAGAGAAAATAATCAAGTAGCAGATGCATTGGCCACATTAGCAGTGATGTTCAATTTAGAACTCAATACAGATGTCCGTCCGATTAAAGTTGGGAGGAGAAATGTCTCAGCTTCTTGTATGAGCATTGAGGAAGAACCCGACGGTAACCCCTGGTTTTATGACATCAAGCAGTATATCAAGAGTAAAGAATATCCACCAAATGCTTCAGAAAATGATAAGCGCACCCTCCGCAAGTTGGCAATGAAGTTTTTCTTAAACGGAGAGAACCATGACATGGTTCTCCTAAGGTGTGTCGAAGGAAGAGATGCCAATGAGATTATAGAGGAAATTCATGAAGGAGTTTGTGGCACTCATGCAAATGGACACATGATGGCTAGACAAATTTTAAGAGCCGGCTATTACTGGCTGACTATAGAGACAAATTGTATTAAATATGCAAGAAAATGTCACAAATGTCAAATTTACTCGGACAAGACTCATGCTCTTGCTTCTCATTTGCATACTTTGACAGCTCCTTGGCCTTTCTCTATGTGGGGCATGGATGTGATAGGACCTATCGAACCTAAAACATCAAATGGGCACTGA
mRNA sequence
ATGTACTGTAGGAAGATAGCAGCGTACGTCCAAAATGACAAATTGTTGATACACTGCTTTCAAGATAGTTTATCTAGTTCGGCCTCTCGTTGGTACATGCAGTTAGAGAGCTCTCATGTTGGCTCGTGGAAGAATCTGGCCGACTCATTCCTAAAACAATATAAGCATAACATAGACATGGCTCCAGATCGCTTAGATTTACAGAGGATGGAGAAGAAGAGTATAGAAAGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACGGCAGCTCAAGTCCAACCACCTTTAACGGATAAGAAGCTATCTGCCATGTTCATCAATACCCTAAAACATCCTTTCTATGATCGGATGATAGGAAGCGCTTCCACAAATTTCTCTGATATTATGACCATCGGAGAAAGAATCGAATACGGGGTTAAACACGGGCGAATAACCAGTACTGCCGAGGAGCCATTAGCTGCAAAGAATGCAAGTAATTCCAAGAAGAAGGAAGGTGAGGTGCAAATGCTGACACCTGTACATGTAGATCCGATCCAGCCTCCATACCCAAGATGGTATGATGCAAATGCTCGTTGTGACTATCACGCAAGAGTCATAGGGCATTCAACAAAGAACTGCACTGCATTGAAATACAGGGTTCAAGCTTTGATCAAGGCAGGGATCGAGTCGAAGAGTAAGGTTGTCGATATTACAACCCCTATAGAGGAACTATCCGAAATTCTCTTGGGCAGTGGATATAAGAAAGGGATAAATGTCGTGGAGGATGTCTCAGTTGCTGAAGGCTCAAGTGATGCGCTTAAGCCAAAACGTCTCACCATATTTTACCGTGAAAAGCCAGATGCACCCAGCTGCAGTCGGAAACCAATCACTATCACGGTCCCAGCTCCTTTCGAGTATAAAAGTTCCAAAGCAGTGCCTTGGAAATATGAGTGCAAGGTAACTGTAGGGCAAGAGGATGTAAGTTTTAAACAATCTGTGTCCGAGGAAGAGACTCAAGAGTTTCTAAAGCTAGTGAAGCAAAGTGAATACAAAGTTATTGAACAATTAGTGGATAATTTGAGGAATGTTGTGGGGAATATAACGGCATCTAGCTCCATCACTTTTACATATGAGGAGATACCACCAGAGGGTACAAGACATACCAGGGCTCTCCACATCTCAGTTAAGTGTAAAAACTACCTAGTAGCAAAAGTTCTGGTTGAAAATGGATCTTCTTTAAACATAATGCCGAGATCCACGCTAGAGAAGTTACCGGTCAATATGTCCCACATGAGGCCTAGTACTGTGATAGTAAGAGCTTTCGATGGAGCTCGCAGCACTGTTGTTGGGGATATTGAGATCCCGATCCAGATAGGAGCCGCCTTTAAAGGAGACAATGGGAGTTTAGACAAGCTACTGAGGATGGCTAAGAATACAAAGAAGTTCGGGTTGGGGTATAAACCAAGTAGAGGCGACATCATTAGAGTGCGGAATCTAGAAAAAGTAAAACGACTCTCAAGATTTGAGAATGAGGAGCGTGATTACCCTAGGAGGACTGTTCCACCTCTCAGCCTCTCTTTCAGAAGTGCCGGCACAATCCATCAGGAGTACGATGAGAGCTCTGTAGTGGCAGCAGTGACAGAAGAAAGAGAGCAAGTCGGACCTTTTGTTTACCCGTGCCCAGACGGTTTCAAGCTGAGCAATTGGAGCATTAATACTGAGATTGAATGCGATAATGATTCGAAATACGAGCTCGATACACCTATATACAATATTAAATCTGATGAGGAAATAGATGACGAGCCCTCTGCTGAGTTATTGAGAATGCTAGAAGAAGAAGAAAAGATGTTGGGACCCCATGAAGAATTAACTGAGACAATTAACTTGGGATCACAAGCCGAAGCCAAAGAGATTAAAATAGGCACTCATATGTCTTCAGAGAGTCGCAAAAAGTTGATAGAGTTACTTCATGAGAAAGAACAGGCTATATATTACTTAAGTAAGAAGTTCACCGATTGCGAGACTAGATACTCTCAAGTAGAAAAAACTTGTTGTGCTCTAGCTTGGGCTGCCCGACGTCTAAGACAATACATGTTGTATTATACCACATGGCTCATTTCAAAGATGAACTCCATAAAGTACATTTTTGAAAAGCCGTCTCTCTCGGGTCGAATTGCAAGGTGGCAGGTTCTCTTGTCAGAATATGATATTGTCTATGTGACTCAAAAGGCCATTAAGGGGAGTGTTTTGGCCGACTACCTAGCCCAAAAACCTATAAATGACTACGTACCGATAAAGTTCGACTTTCCAGATGAGTATATCTCCACCATAACCGCAAGTGAGGAAAGTTTAGACCCACAAACTTGGACCATGATATTATGTTTTGACTGCTCGTATAATATGGCCGAGTATGAAGCATGCTCGATGGGCGTCCAAGCTGCTATTGATATGAAGGTTAAGAAACTTAAAGTTTTTGGGGATTCTATGCTAGTAATTCATCAGCTAAGAGGAGAATGGGAAACAAGAGACGCTAAAGAAAATAATCAAGTAGCAGATGCATTGGCCACATTAGCAGTGATGTTCAATTTAGAACTCAATACAGATGTCCGTCCGATTAAAGTTGGGAGGAGAAATGTCTCAGCTTCTTGTATGAGCATTGAGGAAGAACCCGACGGTAACCCCTGGTTTTATGACATCAAGCAGTATATCAAGAGTAAAGAATATCCACCAAATGCTTCAGAAAATGATAAGCGCACCCTCCGCAAGTTGGCAATGAAGTTTTTCTTAAACGGAGAGAACCATGACATGGTTCTCCTAAGGTGTGTCGAAGGAAGAGATGCCAATGAGATTATAGAGGAAATTCATGAAGGAGTTTGTGGCACTCATGCAAATGGACACATGATGGCTAGACAAATTTTAAGAGCCGGCTATTACTGGCTGACTATAGAGACAAATTGTATTAAATATGCAAGAAAATGTCACAAATGTCAAATTTACTCGGACAAGACTCATGCTCTTGCTTCTCATTTGCATACTTTGACAGCTCCTTGGCCTTTCTCTATGTGGGGCATGGATGTGATAGGACCTATCGAACCTAAAACATCAAATGGGCACTGA
Coding sequence (CDS)
ATGTACTGTAGGAAGATAGCAGCGTACGTCCAAAATGACAAATTGTTGATACACTGCTTTCAAGATAGTTTATCTAGTTCGGCCTCTCGTTGGTACATGCAGTTAGAGAGCTCTCATGTTGGCTCGTGGAAGAATCTGGCCGACTCATTCCTAAAACAATATAAGCATAACATAGACATGGCTCCAGATCGCTTAGATTTACAGAGGATGGAGAAGAAGAGTATAGAAAGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACGGCAGCTCAAGTCCAACCACCTTTAACGGATAAGAAGCTATCTGCCATGTTCATCAATACCCTAAAACATCCTTTCTATGATCGGATGATAGGAAGCGCTTCCACAAATTTCTCTGATATTATGACCATCGGAGAAAGAATCGAATACGGGGTTAAACACGGGCGAATAACCAGTACTGCCGAGGAGCCATTAGCTGCAAAGAATGCAAGTAATTCCAAGAAGAAGGAAGGTGAGGTGCAAATGCTGACACCTGTACATGTAGATCCGATCCAGCCTCCATACCCAAGATGGTATGATGCAAATGCTCGTTGTGACTATCACGCAAGAGTCATAGGGCATTCAACAAAGAACTGCACTGCATTGAAATACAGGGTTCAAGCTTTGATCAAGGCAGGGATCGAGTCGAAGAGTAAGGTTGTCGATATTACAACCCCTATAGAGGAACTATCCGAAATTCTCTTGGGCAGTGGATATAAGAAAGGGATAAATGTCGTGGAGGATGTCTCAGTTGCTGAAGGCTCAAGTGATGCGCTTAAGCCAAAACGTCTCACCATATTTTACCGTGAAAAGCCAGATGCACCCAGCTGCAGTCGGAAACCAATCACTATCACGGTCCCAGCTCCTTTCGAGTATAAAAGTTCCAAAGCAGTGCCTTGGAAATATGAGTGCAAGGTAACTGTAGGGCAAGAGGATGTAAGTTTTAAACAATCTGTGTCCGAGGAAGAGACTCAAGAGTTTCTAAAGCTAGTGAAGCAAAGTGAATACAAAGTTATTGAACAATTAGTGGATAATTTGAGGAATGTTGTGGGGAATATAACGGCATCTAGCTCCATCACTTTTACATATGAGGAGATACCACCAGAGGGTACAAGACATACCAGGGCTCTCCACATCTCAGTTAAGTGTAAAAACTACCTAGTAGCAAAAGTTCTGGTTGAAAATGGATCTTCTTTAAACATAATGCCGAGATCCACGCTAGAGAAGTTACCGGTCAATATGTCCCACATGAGGCCTAGTACTGTGATAGTAAGAGCTTTCGATGGAGCTCGCAGCACTGTTGTTGGGGATATTGAGATCCCGATCCAGATAGGAGCCGCCTTTAAAGGAGACAATGGGAGTTTAGACAAGCTACTGAGGATGGCTAAGAATACAAAGAAGTTCGGGTTGGGGTATAAACCAAGTAGAGGCGACATCATTAGAGTGCGGAATCTAGAAAAAGTAAAACGACTCTCAAGATTTGAGAATGAGGAGCGTGATTACCCTAGGAGGACTGTTCCACCTCTCAGCCTCTCTTTCAGAAGTGCCGGCACAATCCATCAGGAGTACGATGAGAGCTCTGTAGTGGCAGCAGTGACAGAAGAAAGAGAGCAAGTCGGACCTTTTGTTTACCCGTGCCCAGACGGTTTCAAGCTGAGCAATTGGAGCATTAATACTGAGATTGAATGCGATAATGATTCGAAATACGAGCTCGATACACCTATATACAATATTAAATCTGATGAGGAAATAGATGACGAGCCCTCTGCTGAGTTATTGAGAATGCTAGAAGAAGAAGAAAAGATGTTGGGACCCCATGAAGAATTAACTGAGACAATTAACTTGGGATCACAAGCCGAAGCCAAAGAGATTAAAATAGGCACTCATATGTCTTCAGAGAGTCGCAAAAAGTTGATAGAGTTACTTCATGAGAAAGAACAGGCTATATATTACTTAAGTAAGAAGTTCACCGATTGCGAGACTAGATACTCTCAAGTAGAAAAAACTTGTTGTGCTCTAGCTTGGGCTGCCCGACGTCTAAGACAATACATGTTGTATTATACCACATGGCTCATTTCAAAGATGAACTCCATAAAGTACATTTTTGAAAAGCCGTCTCTCTCGGGTCGAATTGCAAGGTGGCAGGTTCTCTTGTCAGAATATGATATTGTCTATGTGACTCAAAAGGCCATTAAGGGGAGTGTTTTGGCCGACTACCTAGCCCAAAAACCTATAAATGACTACGTACCGATAAAGTTCGACTTTCCAGATGAGTATATCTCCACCATAACCGCAAGTGAGGAAAGTTTAGACCCACAAACTTGGACCATGATATTATGTTTTGACTGCTCGTATAATATGGCCGAGTATGAAGCATGCTCGATGGGCGTCCAAGCTGCTATTGATATGAAGGTTAAGAAACTTAAAGTTTTTGGGGATTCTATGCTAGTAATTCATCAGCTAAGAGGAGAATGGGAAACAAGAGACGCTAAAGAAAATAATCAAGTAGCAGATGCATTGGCCACATTAGCAGTGATGTTCAATTTAGAACTCAATACAGATGTCCGTCCGATTAAAGTTGGGAGGAGAAATGTCTCAGCTTCTTGTATGAGCATTGAGGAAGAACCCGACGGTAACCCCTGGTTTTATGACATCAAGCAGTATATCAAGAGTAAAGAATATCCACCAAATGCTTCAGAAAATGATAAGCGCACCCTCCGCAAGTTGGCAATGAAGTTTTTCTTAAACGGAGAGAACCATGACATGGTTCTCCTAAGGTGTGTCGAAGGAAGAGATGCCAATGAGATTATAGAGGAAATTCATGAAGGAGTTTGTGGCACTCATGCAAATGGACACATGATGGCTAGACAAATTTTAAGAGCCGGCTATTACTGGCTGACTATAGAGACAAATTGTATTAAATATGCAAGAAAATGTCACAAATGTCAAATTTACTCGGACAAGACTCATGCTCTTGCTTCTCATTTGCATACTTTGACAGCTCCTTGGCCTTTCTCTATGTGGGGCATGGATGTGATAGGACCTATCGAACCTAAAACATCAAATGGGCACTGA
Protein sequence
MYCRKIAAYVQNDKLLIHCFQDSLSSSASRWYMQLESSHVGSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSIESFKEYAQRWRDTAAQVQPPLTDKKLSAMFINTLKHPFYDRMIGSASTNFSDIMTIGERIEYGVKHGRITSTAEEPLAAKNASNSKKKEGEVQMLTPVHVDPIQPPYPRWYDANARCDYHARVIGHSTKNCTALKYRVQALIKAGIESKSKVVDITTPIEELSEILLGSGYKKGINVVEDVSVAEGSSDALKPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQEDVSFKQSVSEEETQEFLKLVKQSEYKVIEQLVDNLRNVVGNITASSSITFTYEEIPPEGTRHTRALHISVKCKNYLVAKVLVENGSSLNIMPRSTLEKLPVNMSHMRPSTVIVRAFDGARSTVVGDIEIPIQIGAAFKGDNGSLDKLLRMAKNTKKFGLGYKPSRGDIIRVRNLEKVKRLSRFENEERDYPRRTVPPLSLSFRSAGTIHQEYDESSVVAAVTEEREQVGPFVYPCPDGFKLSNWSINTEIECDNDSKYELDTPIYNIKSDEEIDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEIKIGTHMSSESRKKLIELLHEKEQAIYYLSKKFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTTWLISKMNSIKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGSVLADYLAQKPINDYVPIKFDFPDEYISTITASEESLDPQTWTMILCFDCSYNMAEYEACSMGVQAAIDMKVKKLKVFGDSMLVIHQLRGEWETRDAKENNQVADALATLAVMFNLELNTDVRPIKVGRRNVSASCMSIEEEPDGNPWFYDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNGENHDMVLLRCVEGRDANEIIEEIHEGVCGTHANGHMMARQILRAGYYWLTIETNCIKYARKCHKCQIYSDKTHALASHLHTLTAPWPFSMWGMDVIGPIEPKTSNGH
Homology
BLAST of Moc01g14690 vs. NCBI nr
Match:
XP_022143495.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia])
HSP 1 Score: 1577.4 bits (4083), Expect = 0.0e+00
Identity = 941/1755 (53.62%), Postives = 978/1755 (55.73%), Query Frame = 0
Query: 1 MYCRKIAAYVQNDKLLIHCFQDSLSSSASRWYMQLESSHVGSWKNLADSFLKQYKHNIDM 60
MYCRK+AAYVQNDKLLIHCFQDSLSS ASRWYMQL+SSHVGSWKNLADSFLKQYKHNIDM
Sbjct: 137 MYCRKMAAYVQNDKLLIHCFQDSLSSPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDM 196
Query: 61 APDRLDLQRMEKKSIESFKEYAQRWRDTAAQVQPPLTDKKLSAMFINTLKHPFYDRMIGS 120
APDRLDLQRMEKKS ESFKEYAQRWRDTAAQVQPPLTDK+LS MFINTLKHPFYDRM+GS
Sbjct: 197 APDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLTDKELSXMFINTLKHPFYDRMVGS 256
Query: 121 ASTNFSDIMTIGERIEYGVKHGRITSTAEEPLAAKNASNSKKKEGEVQMLTPVHVDPIQP 180
ASTNFSDIM IGERIEYGV+HGRITSTA+EPLAAK S+SKKKEGE L V VDPIQP
Sbjct: 257 ASTNFSDIMAIGERIEYGVRHGRITSTADEPLAAKKTSHSKKKEGE---LAHVPVDPIQP 316
Query: 181 PYPRWYDANARCDYHARVIGHSTKNCTALKYRVQALIKAG-------------------- 240
PYPRW DANARCDYH IGHS +NCTALKYRVQALIKAG
Sbjct: 317 PYPRWCDANARCDYHTGAIGHSIENCTALKYRVQALIKAGWLNFKKENGPBVSNNPLPNH 376
Query: 241 ------------IESKSKVVDITTPIEELSEILLGSGY---------------------- 300
IESKSKV DITTP+EEL EILLGSGY
Sbjct: 377 XNVQINAIECQEIESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESLTCP 436
Query: 301 ------------------------------------KKGINVVEDVSV-----AEGSSDA 360
KKGINVVEDVSV AEGSSDA
Sbjct: 437 FHAGAKGHALEQCNSFRMIVQELLDSKILTVANSHQKKGINVVEDVSVAEGSIAEGSSDA 496
Query: 361 LKPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQE------- 420
LKPKRLTIFY EKPDAP+CSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQ+
Sbjct: 497 LKPKRLTIFYSEKPDAPNCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQDVSSPPLP 556
Query: 421 ------------------------------------------------------------ 480
Sbjct: 557 VDNITGVGGLTXTGRCYTPDSLLKRVSETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDV 616
Query: 481 -----------DVSFKQSVSEEETQEFLKLVKQSEYKVIEQL------------------ 540
DVS KQ V EEE QEFLKLVKQSEYKV EQL
Sbjct: 617 HDELVEAIVVKDVSPKQHVFEEEIQEFLKLVKQSEYKVTEQLGRTPAKISILSLLLSSEA 676
Query: 541 -------------------VDNLRNVVGNITASSSITFTYEEIPPEGTRHTRALHISVKC 600
VDNL NVVGNITASSSITFT EEIPPEGT HT+ALHISVKC
Sbjct: 677 HRNTLLEXLKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIPPEGTGHTKALHISVKC 736
Query: 601 KNYLVAKVLVENGSSLNIMPRSTLEKLPVNMSHMRPSTVIVRAFDGARSTVVGDIEIPIQ 660
KN+L+AKVLV+NGSSLNIMPRSTLEKLPV+MSHMRPSTVIVRAFDGARS VVGDIEIPIQ
Sbjct: 737 KNFLIAKVLVDNGSSLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGARSAVVGDIEIPIQ 796
Query: 661 IG---------------------------------------------------------- 720
IG
Sbjct: 797 IGPCTFDITFQVMDITSTYSFLLGRLWIHSAGAVPSTLHQKIKFAVDQKLVIISGQEDIL 856
Query: 721 ---------------------------------------------AAFKGDNGSLDKLLR 780
AFKGDN SLDKLLR
Sbjct: 857 VSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNESLDKLLR 916
Query: 781 MAKNTKKFGLGYKPSRGDIIRVRNLEKVKRLSRFENEERDYPRRTVPPLSLSFRSAGTIH 840
MAKNTKKFGLGYKPSRGDIIRVR+LEK KRLSRFENEERDYPRRTVPPLS SFRSAGTIH
Sbjct: 917 MAKNTKKFGLGYKPSRGDIIRVRSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGTIH 976
Query: 841 QEYDESSVVAAVTEEREQVGPFVYPCPDGFKLSNWSINTEIECDNDSKYELDTPIYNIKS 900
QEYD SSVVAAVTEEREQV PFVYPCPDGF+LSNWS+NTEIECDNDSKYELDTPIYNI+S
Sbjct: 977 QEYDGSSVVAAVTEEREQVRPFVYPCPDGFELSNWSVNTEIECDNDSKYELDTPIYNIES 1036
Query: 901 DEEIDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEIKIGTHMSSESRKKLIE 960
D+EIDDEPSAELLRMLEEEEKMLGPHEELTET+NLGSQAEAKEIKIGTHMSSESRKKLIE
Sbjct: 1037 DKEIDDEPSAELLRMLEEEEKMLGPHEELTETLNLGSQAEAKEIKIGTHMSSESRKKLIE 1096
Query: 961 LLHE-------------------------------------------------------- 1020
LLHE
Sbjct: 1097 LLHEYADVFAWSYQDMPGLDTDIVVHKLQINPKFKPVRQKLRKMRPDMLIKIKDEVRKQI 1156
Query: 1021 ------------------------------------------------------------ 1032
Sbjct: 1157 DAGFLTISNYPEWVANIVPVPKKNGQVRMCVDYRDLNRASPKDNFPLPHIDVLVDNTAGF 1216
BLAST of Moc01g14690 vs. NCBI nr
Match:
XP_022147189.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia])
HSP 1 Score: 1550.0 bits (4012), Expect = 0.0e+00
Identity = 948/1873 (50.61%), Postives = 985/1873 (52.59%), Query Frame = 0
Query: 1 MYCRKIAAYVQNDKLLIHCFQDSLSSSASRWYMQLESSHVGSWKNLADSFLKQYKHNIDM 60
MYCRK+AAYVQNDKLLIHCFQDSLS ASRWYMQL+SSHVGSWKNLADSFLKQYKHNIDM
Sbjct: 126 MYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDM 185
Query: 61 APDRLDLQRMEKKSIESFKEYAQRWRDTAAQVQPPLTDKKLSAMFINTLKHPFYDRMIGS 120
APDRLDLQRMEKKS +SFKEYAQRWRDTAAQVQPPL DK+LSAMFINTLKHPFYDRMIGS
Sbjct: 186 APDRLDLQRMEKKSTKSFKEYAQRWRDTAAQVQPPLIDKELSAMFINTLKHPFYDRMIGS 245
Query: 121 ASTNFSDIMTIGERIEYGVKHGRITSTAEEPLAAKNASNSKKKEGEVQM----------- 180
ASTNFSDIMTIGERIEYGV+HGRITST +EPLAAK AS+SKKKEGEVQM
Sbjct: 246 ASTNFSDIMTIGERIEYGVRHGRITSTTDEPLAAKKASHSKKKEGEVQMVGADRHSWKQQ 305
Query: 181 ------------------------------------------------------------ 240
Sbjct: 306 PYRRTPQYSPYYYPTPYGYNQPFVNNATSHYYPYASQNFRPPASQNFQLTPTSQNFQPRG 365
Query: 241 ------------------------------------------LTPVHVDPIQPPYPRWYD 300
L PV VDPIQPPYPRWYD
Sbjct: 366 QQHNTFYTQGQQNNRGARKQTQFDPIPMTYTELLPQLFQNNQLAPVPVDPIQPPYPRWYD 425
Query: 301 ANARCDYHARVIGHSTKNCTALKYRVQALIKA---------------------------- 360
ANARCDYHA I HST+NCT LKYRVQALIKA
Sbjct: 426 ANARCDYHAGAIXHSTENCTXLKYRVQALIKAGWXNFKKENGXDVSKXXLXNHQNVQINA 485
Query: 361 ----GIESKSKVVDITTPIEELSEILLGSGY----------------------------- 420
GIESKSKV +ITTP+ EL EILLGSGY
Sbjct: 486 IECQGIESKSKVABITTPMXELFEILLGSGYISVEYLCPKYKGYDESLTCXFHXGAKGHS 545
Query: 421 ---------------------------KKGINVVEDVSVAEGSSDALKPKRLTIFYREKP 480
KK NVVED+ VAEGSSD+LKPK LTIFYREKP
Sbjct: 546 LEQCNXFRMKVQELLDSKILTXANSHXKKXTNVVEDILVAEGSSDSLKPKPLTIFYREKP 605
Query: 481 DAPSCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQE--------------------- 540
DAPSCSRKP ITVP PFEYKSSKAVPWKYECKVTVGQ+
Sbjct: 606 DAPSCSRKPXXITVPXPFEYKSSKAVPWKYECKVTVGQDVSSPSLPVDNITGVGGLTRTG 665
Query: 541 ---------------------------------------------------------DVS 600
DVS
Sbjct: 666 RCYTPDSLLKRVNETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDARDELVEAIVVKDVS 725
Query: 601 FKQSVSEEETQEFLKLVKQSEYKVIEQL-------------------------------- 660
KQ +SEEETQEFLKLVKQSEYKVIEQL
Sbjct: 726 PKQPMSEEETQEFLKLVKQSEYKVIEQLGRTPANISILSLLLSSEAHQNALLEALKQAFV 785
Query: 661 -----VDNLRNVVGNITASSSITFTYEEIPPEGTRHTRALHISVKCKNYLVAKVLVENGS 720
VDNL NVVGNITASSSI+FT EEIPPEGT HT+ALHISVKCKN+L+AKVLV+NGS
Sbjct: 786 SQDITVDNLSNVVGNITASSSISFTDEEIPPEGTGHTKALHISVKCKNFLIAKVLVDNGS 845
Query: 721 SLNIMPRSTLEKLPVNMSHMRPSTVIVRAFDGARSTVVGDIEIPIQIG------------ 780
SLNIMPRSTLEKLPV+MSHMRPSTVIVRAFDGARS VVGDIEIPIQIG
Sbjct: 846 SLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGARSAVVGDIEIPIQIGPCTFDITFQVMD 905
Query: 781 ------------------------------------------------------------ 840
Sbjct: 906 ITSAYSFLLGRPWIHSAGAVPSTLHQKIKFAVDQKLVIISGQEDILVSRFASMSYVEVAE 965
Query: 841 -------------------------------AAFKGDNGSLDKLLRMAKNTKKFGLGYKP 900
AFKGDNGSLDKLLRMAKNTKKFGLGYKP
Sbjct: 966 EAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNGSLDKLLRMAKNTKKFGLGYKP 1025
Query: 901 SRGDIIRVRNLEKVKRLSRFENEERDYPRRTVPPLSLSFRSAGTIHQEYDESSVVAAVTE 960
SRGDIIRVR+LEK KRLSRFENEERDYPRR VPPL+ SFRSAGTIHQEYDESSVVAAVTE
Sbjct: 1026 SRGDIIRVRSLEKAKRLSRFENEERDYPRRIVPPLTHSFRSAGTIHQEYDESSVVAAVTE 1085
Query: 961 EREQVGPFVYPCPDGFKLSNWSI------------NTEIECDNDSKYELDTPIYNIKSDE 1020
EREQVGPFVY CPDGF+LSNWS+ NTEIECDNDSKYELDTPIY I+SDE
Sbjct: 1086 EREQVGPFVYLCPDGFELSNWSVIKLPSFVNNKSNNTEIECDNDSKYELDTPIYIIESDE 1145
Query: 1021 EIDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEIKIGTHMSSESRKKLIELL 1032
EIDDEPSAELLRMLEEEEKMLGPHEELTET+NLGSQAEAKEIKIGTHMSSESRKKLIELL
Sbjct: 1146 EIDDEPSAELLRMLEEEEKMLGPHEELTETLNLGSQAEAKEIKIGTHMSSESRKKLIELL 1205
BLAST of Moc01g14690 vs. NCBI nr
Match:
XP_022158986.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia])
HSP 1 Score: 1310.4 bits (3390), Expect = 0.0e+00
Identity = 791/1529 (51.73%), Postives = 878/1529 (57.42%), Query Frame = 0
Query: 1 MYCRKIAAYVQNDKLLIHCFQDSLSSSASRWYMQLESSHVGSWKNLADSFLKQYKHNIDM 60
MYCRK+AAYVQNDKLLIHCFQDSLS ASRWYMQL+SS+VGSWKNLADSFLKQYKHNIDM
Sbjct: 86 MYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSNVGSWKNLADSFLKQYKHNIDM 145
Query: 61 APDRLDLQRMEKKSIESFKEYAQRWRDTAAQVQPPLTDKKLSAMFINTLKHPFYDRMIGS 120
APDRLDLQRMEKKS ESFKEYAQRWRDTAAQVQPPLTDK+LSAMFINTLKHPFYDRMIG+
Sbjct: 146 APDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLTDKELSAMFINTLKHPFYDRMIGN 205
Query: 121 ASTNFSDIMTIGERIEYGVKHGRITSTAEEPLAAKNASNSKKKEGEVQM----------- 180
ASTNFSDIMTIGERIEYGV+HGRITST +EPLAAK AS+SKKKEGEVQM
Sbjct: 206 ASTNFSDIMTIGERIEYGVRHGRITSTVDEPLAAKKASHSKKKEGEVQMVGADRHSWKQQ 265
Query: 181 ------------------------------------------------------------ 240
Sbjct: 266 PYSRTPRYTPYYYPTPYGYNQPFVNNATSHYSPYTFQNFRPPASQNFQPTPASQNFQPRG 325
Query: 241 ------------------------------------------LTPVHVDPIQPPYPRWYD 300
L PV VDPIQPPYPRWYD
Sbjct: 326 QQHNTLYTQEQQTNRGARKQTQFDPIPMTYTELLPQLFQNNQLAPVPVDPIQPPYPRWYD 385
Query: 301 ANARCDYHARVIGHSTKNCTALKYRVQALIKAG--------------------------- 360
NARCDYHA IGHST+NCTALKYRVQALIKAG
Sbjct: 386 TNARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGPDVSKNPLPNHQNVQINA 445
Query: 361 -----IESKSKVVDITTPIEELSEILLGSGY----------------------------- 420
IESKSKV DI TP+ EL EILLGSGY
Sbjct: 446 IECQEIESKSKVADIRTPMVELFEILLGSGYVSVEYLCPNLKYKGYDESLTCPFHAGAKG 505
Query: 421 -----------------------------KKGINVVEDVSVAEGSSDALKPKRLTIFYRE 480
KKGIN+VEDVSVAEGSSDALKPK LTIFY E
Sbjct: 506 HSLEQCNSFRMKVQELLDSKILTVANSHQKKGINIVEDVSVAEGSSDALKPKCLTIFYSE 565
Query: 481 KPDAPSCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQE------------------- 540
KP+AP+CSRKPITITVPAPFEYKSSKAVPWKY+CKVTVGQ+
Sbjct: 566 KPNAPNCSRKPITITVPAPFEYKSSKAVPWKYQCKVTVGQDVSSPPLPIDNITGVGGLTR 625
Query: 541 -----------------------------------------------------------D 600
D
Sbjct: 626 TGRCYTPDSLLKCVNETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDVHDELVEAIVVKD 685
Query: 601 VSFKQSVSEEETQEFLKLVKQSEYKVIEQL------------------------------ 660
VS KQ +SEEETQE LKLVKQSEYKVIEQL
Sbjct: 686 VSPKQPMSEEETQEILKLVKQSEYKVIEQLGRTPAKISILSLLLSSEAHRNALLEALKQA 745
Query: 661 -------VDNLRNVVGNITASSSITFTYEEIPPEGTRHTRALHISVKCKNYLVAKVLVEN 720
VDNL NVVGNI+ +SSITFT EEIPPEGT HT+ALHIS+KCKN+L+AKVLV+N
Sbjct: 746 FVSQDITVDNLSNVVGNISXASSITFTDEEIPPEGTGHTKALHISIKCKNFLIAKVLVDN 805
Query: 721 GSSLNIMPRSTLEKLPVNMSHMRPSTVIVRAFDGARSTVVGDIEIPIQIGAAFKGDNGSL 780
GSSLNIMPRSTLEKLPV+MSHMRPSTVIVRAFDGARS VVGDIEIPIQIG +
Sbjct: 806 GSSLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGARSAVVGDIEIPIQIGPC------TF 865
Query: 781 DKLLRMAKNTKKF----GLGYKPSRGDIIRVRNLEKVKRLSRFENEERDYPRR------T 840
D ++ T + G + S G + + +K+K + RD R +
Sbjct: 866 DITFQVMDITSAYSFLLGRPWIHSAGAVPSTLH-QKIKFAVDQNVDYRDLNRASPKDNFS 925
Query: 841 VPPLSL-----SFRSAGTIHQEYDESSVVAAVTEEREQV------GPFVYP-CPDGFKLS 900
+P + + + S + + + + E+RE+ G F Y P G K +
Sbjct: 926 LPHIDVLVDNTTGFSTFSFMDGFSGYNQIKMAPEDREKTTLITLWGTFCYKVMPFGLKNA 985
Query: 901 NWSINTEIE------CDNDSKYELDTPIYNIKSDEE------------------------ 960
+ + + + +D I K EE
Sbjct: 986 GATYQRAMVTLFHDLMHKEIEVYVDDMIAKSKQGEEHTTILRKLFDRLRKFKLKLNPNKC 1045
Query: 961 IDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEI-----------KIGTHMSS 1020
I + +LL + +E + +++ + + KE+ ++ +H+++
Sbjct: 1046 IFGATTGKLLGFVVSQEDIKVDLDKVKAILEMPPPQTQKEVREFLGRLNYIARLISHLTA 1105
Query: 1021 ESR----------------------KKLIELLHE-------------------------- 1032
K+ + L +
Sbjct: 1106 TCEPIFKLLRKNNDGVWSEDCQAAFDKIKQYLQDPPILVPPTPGRPLILYLTVTENSMGC 1165
BLAST of Moc01g14690 vs. NCBI nr
Match:
XP_022157796.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia])
HSP 1 Score: 1182.9 bits (3059), Expect = 0.0e+00
Identity = 697/1297 (53.74%), Postives = 783/1297 (60.37%), Query Frame = 0
Query: 1 MYCRKIAAYVQNDKLLIHCFQDSLSSSASRWYMQLESSHVGSWKNLADSFLKQYKHNIDM 60
MYCRK+ AYVQN KLLIHCFQDSL ASRWYMQL+SSHVGSWKNLADSFLKQYKHNIDM
Sbjct: 195 MYCRKMXAYVQNXKLLIHCFQDSLXGXASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDM 254
Query: 61 APDRLDLQRMEKKSIESFKEYAQRWRDTAAQVQPPLTDKKLSAMFINTLKHPFYDRMIGS 120
APDRLDLQRMEK S ESFKEYAQRWRDTAAQVQPPLTDK+LSAMFINTLKHPFYDRMIGS
Sbjct: 255 APDRLDLQRMEKNSTESFKEYAQRWRDTAAQVQPPLTDKELSAMFINTLKHPFYDRMIGS 314
Query: 121 ASTNFSDIMTIGERIEYGVKHGRITSTAEEPLAAKNASNSKKKEGEVQMLTPVHVDPIQP 180
ASTNFSDIMTIGERIEYGV+H RITSTA+EPLAAK AS+SKKKEGE LTPV VDPIQP
Sbjct: 315 ASTNFSDIMTIGERIEYGVRHKRITSTADEPLAAKKASHSKKKEGE---LTPVPVDPIQP 374
Query: 181 PYPRWYDANARCDYHARVIGHSTKNCTALKYRVQALIKA--------------------- 240
YPRWYDANARCDYHA IGHST+NCTALKYRVQAL+KA
Sbjct: 375 LYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALLKAGWLNFKKENEPDVSKNPLSNH 434
Query: 241 -----------GIESKSKVVDITTPIEELSEILLGSGY---------------------- 300
GIESKSKV DI TP EEL EILLGSGY
Sbjct: 435 QNVQINAIECQGIESKSKVADIRTPKEELFEILLGSGYVSVEYLCPNLKYKEYDESLTCP 494
Query: 301 ------------------------------------KKGINVVEDVSVAEGSSDALKPKR 360
KKGINVVEDVSVAEGSSDALKPKR
Sbjct: 495 FHAGAKGHSLEQCNSFRMKVQELLDSKILTVANSHQKKGINVVEDVSVAEGSSDALKPKR 554
Query: 361 LTIFYREKPDAPSCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQE------------ 420
LTIFY EKPDAPSCS+KPITITVPAPFEYKSSKAVPWKY+CKVTVGQ+
Sbjct: 555 LTIFYSEKPDAPSCSQKPITITVPAPFEYKSSKAVPWKYQCKVTVGQDVSSPPLPVDNIT 614
Query: 421 ------------DVSFKQSVSEEETQEFLKLVKQ-------------------------- 480
++ +S E ++ ++L+ +
Sbjct: 615 EVGDLGSQAEAKEIKIGTHMSSESRKKLIELLHEYADVFAWSYQDMPGLDTDIVVHKLSI 674
Query: 481 -SEYKVIEQ------------LVDNLRNVV--GNITASSSITFTYEEIPPEGTRHTRALH 540
E+K + Q + D +R + G +T S+ + +P +
Sbjct: 675 NPEFKPVRQKLWKMRPDMLIKIKDEVRKQIDAGFLTVSNYPEWVANIVPVPKKNGQVRMC 734
Query: 541 ISVKCKNYLVAK---------VLVENGSSLNIMP-------RSTLEKLPVNMSHMRPSTV 600
+ + N K VLV+N + + + ++ P + T+
Sbjct: 735 VDYRDLNRASPKDNFPLPHIDVLVDNTAGFSTFSFMGGFSGXNXIKMAPEDREKTTFITL 794
Query: 601 ---------------IVRAFDGARSTVVGDI---EIPIQIG---AAFKGDNGSLDKLLRM 660
+ + A T+ D+ EI + + A K L ++
Sbjct: 795 WGTFCYKVMPFGLKNVGATYQRAMVTLFHDLMHKEIEVYVDDMIAKSKQGEEHTTILRKL 854
Query: 661 AKNTKKFGLGYKPSR-----------GDIIRVRNL----EKVKRLSRFENEERDYPRRTV 720
+KF L P++ G ++ + +KVK + +
Sbjct: 855 FDRLRKFKLKLNPNKCIFGATTGKLLGFVVSQEGIKVDPDKVKAILE------------M 914
Query: 721 PPLSLSFRSAGTIHQEYDESSVVAAVTEEREQVGPFVYPCPDGFKLSNWSINTEIECDND 780
PP G + + + ++ +T E + + DG WS + + +
Sbjct: 915 PPPQTQKEVRGFLGRLNYIARFISHLTATCEPIFKLLRKNNDGV----WSEDCQAAFNKI 974
Query: 781 SKYELDTPIYNIKSDEEIDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEIKI 840
+Y D PI + P L+ L E +G +
Sbjct: 975 KQYLQDPPIL-------VPPTPGRPLILYLTVTENSMG-------------------CVL 1034
Query: 841 GTHMSSESRKKLIELLHEKEQAIYYLSKKFTDCETRYSQVEKTCCALAWAARRLRQYMLY 900
G H S KEQAIYYLSKKFTDCETRYSQVEKTCCALAW ARRLRQYMLY
Sbjct: 1035 GQHDDS----------GRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWVARRLRQYMLY 1094
Query: 901 YTTWLISKMNSIKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGSVLADYLAQKPIN 960
YTTWLISKM+ IKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGS LADYLAQ+PIN
Sbjct: 1095 YTTWLISKMDPIKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGSALADYLAQQPIN 1154
Query: 961 DYVPIKFDFPDEYISTITASEESLDPQTWTMI---------------------------- 1020
DY+P+KFDFPDEYISTITASEESLDPQTWTM+
Sbjct: 1155 DYIPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLTT 1214
Query: 1021 -LCFDCSYNMAEYEACSMGVQAAIDMKVKKLKVFGDSMLVIHQLRGEWETRDAK------ 1032
LCFDC++NMAEYEACSMGVQAAIDMKVKK KVFGDS LVIHQLRGEWETRD K
Sbjct: 1215 KLCFDCTHNMAEYEACSMGVQAAIDMKVKKFKVFGDSTLVIHQLRGEWETRDVKLLPYKQ 1274
BLAST of Moc01g14690 vs. NCBI nr
Match:
XP_022150030.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111018303 [Momordica charantia])
HSP 1 Score: 1179.9 bits (3051), Expect = 0.0e+00
Identity = 714/1387 (51.48%), Postives = 748/1387 (53.93%), Query Frame = 0
Query: 244 SGYKKGINVVEDVSVAEGSSDALKPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSK 303
S KK NVVED+ VAEGSSD++KPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSK
Sbjct: 17 SHQKKRTNVVEDILVAEGSSDSIKPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSK 76
Query: 304 AVPWKYECKVTVGQ---------------------------------------------- 363
AVPWKYECKVTVGQ
Sbjct: 77 AVPWKYECKVTVGQDVSSPSLPVDNITGVGGLTRTGRCYTPDSLLKRVNETTSEKNKEKA 136
Query: 364 ---------EDVSFK--------QSVSEEETQEFLKLVKQSEYKVIEQL----------- 423
ED K + EEETQEFLKLVKQ+EYKVIEQL
Sbjct: 137 SEKKKEKVEEDKKGKAKLHEDVHDELVEEETQEFLKLVKQNEYKVIEQLGRTPAKISILS 196
Query: 424 --------------------------VDNLRNVVGNITASSSITFTYEEIPPEGTRHTRA 483
VDNL NVVGNI ASS ITFT EEIPPEGT HT+A
Sbjct: 197 LLLSSEAHRNALLEALKQAFVSQDITVDNLSNVVGNIMASSCITFTDEEIPPEGTGHTKA 256
Query: 484 LHISVKCKNYLVAKVLVENGSSLNIMPRSTLEKLPVNMSHMRPSTVIVRAFDGARSTVVG 543
LHISVKCKN+L+AKVLV NGSSLNIMPRSTLEKLPV+MSHMRPSTVIVRAFDGAR+ VVG
Sbjct: 257 LHISVKCKNFLIAKVLVGNGSSLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGARNAVVG 316
Query: 544 DIEIPIQIG--------------------------------------------------- 603
DIEIPIQIG
Sbjct: 317 DIEIPIQIGLCTFDITFQVMDITSAYSFLLGRPWIHSAGAVPSTLHQKIKFAVDQKLVII 376
Query: 604 ----------------------------------------------------AAFKGDNG 663
AAFK +NG
Sbjct: 377 SGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLEAAFKVNNG 436
Query: 664 SLDKLLRMAKNTKKFGLGYKPSRGDIIRVRNLEKVKRLSRFENEERDYPRRTVPPLSLSF 723
SLDKLLRMAKNT++FGLGYKP+RGDIIRVR++EK KRLSRFEN ERDY RRTVPPLS S
Sbjct: 437 SLDKLLRMAKNTRRFGLGYKPNRGDIIRVRSMEKAKRLSRFENGERDYSRRTVPPLSHSL 496
Query: 724 RSAGTIHQEYDESSVVAAVTEEREQVGPFVYPCPDGFKLSNWSINTEIECDNDSKYELDT 783
RSAGTIHQEYDESSV AAVTEEREQV PFVYPCPDGFKLSNWS+NTEIECDNDSKYELDT
Sbjct: 497 RSAGTIHQEYDESSVAAAVTEEREQVEPFVYPCPDGFKLSNWSVNTEIECDNDSKYELDT 556
Query: 784 PIYNIKSDEEIDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEIKIGTHMSSE 843
PIYNI+SDEEIDDEPSAELLRMLEEEEKMLGPHEELTET+NLGSQAEAKEIKIGTHMSSE
Sbjct: 557 PIYNIESDEEIDDEPSAELLRMLEEEEKMLGPHEELTETLNLGSQAEAKEIKIGTHMSSE 616
Query: 844 SRKKLIELLHE------------------------------------------------- 903
SRKKLIELLHE
Sbjct: 617 SRKKLIELLHEYADVFAWSYQDMPGLDTDIVVHKLPTNPKFKPVRQKLRKMRPDMLIKIK 676
Query: 904 ------------------------------------------------------------ 963
Sbjct: 677 DEVRKQIDAGFLTVSNYPEWVANIVPVPKKNGQVRMCVDYRDLNRASPKDNFPLPHIDVL 736
Query: 964 ------------------------------------------------------------ 1023
Sbjct: 737 VDNTAWFSTFSFMDGFSGYNQIKMALEDREKTTFITLWGTFCYKVMSFGLKNAGATYQRA 796
Query: 1024 ------------------------------------------------------------ 1032
Sbjct: 797 MVTLFHDLMHKEIEVYVDDMIAKSKQGEEHTTILRKLFDRLRKFKLKLNPNKCIFGATTR 856
BLAST of Moc01g14690 vs. ExPASy TrEMBL
Match:
A0A6J1CNY7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111013372 PE=4 SV=1)
HSP 1 Score: 1577.4 bits (4083), Expect = 0.0e+00
Identity = 941/1755 (53.62%), Postives = 978/1755 (55.73%), Query Frame = 0
Query: 1 MYCRKIAAYVQNDKLLIHCFQDSLSSSASRWYMQLESSHVGSWKNLADSFLKQYKHNIDM 60
MYCRK+AAYVQNDKLLIHCFQDSLSS ASRWYMQL+SSHVGSWKNLADSFLKQYKHNIDM
Sbjct: 137 MYCRKMAAYVQNDKLLIHCFQDSLSSPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDM 196
Query: 61 APDRLDLQRMEKKSIESFKEYAQRWRDTAAQVQPPLTDKKLSAMFINTLKHPFYDRMIGS 120
APDRLDLQRMEKKS ESFKEYAQRWRDTAAQVQPPLTDK+LS MFINTLKHPFYDRM+GS
Sbjct: 197 APDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLTDKELSXMFINTLKHPFYDRMVGS 256
Query: 121 ASTNFSDIMTIGERIEYGVKHGRITSTAEEPLAAKNASNSKKKEGEVQMLTPVHVDPIQP 180
ASTNFSDIM IGERIEYGV+HGRITSTA+EPLAAK S+SKKKEGE L V VDPIQP
Sbjct: 257 ASTNFSDIMAIGERIEYGVRHGRITSTADEPLAAKKTSHSKKKEGE---LAHVPVDPIQP 316
Query: 181 PYPRWYDANARCDYHARVIGHSTKNCTALKYRVQALIKAG-------------------- 240
PYPRW DANARCDYH IGHS +NCTALKYRVQALIKAG
Sbjct: 317 PYPRWCDANARCDYHTGAIGHSIENCTALKYRVQALIKAGWLNFKKENGPDVSNNPLPNH 376
Query: 241 ------------IESKSKVVDITTPIEELSEILLGSGY---------------------- 300
IESKSKV DITTP+EEL EILLGSGY
Sbjct: 377 XNVQINAIECQEIESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLKYKGYDESLTCP 436
Query: 301 ------------------------------------KKGINVVEDVSV-----AEGSSDA 360
KKGINVVEDVSV AEGSSDA
Sbjct: 437 FHAGAKGHALEQCNSFRMIVQELLDSKILTVANSHQKKGINVVEDVSVAEGSIAEGSSDA 496
Query: 361 LKPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQE------- 420
LKPKRLTIFY EKPDAP+CSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQ+
Sbjct: 497 LKPKRLTIFYSEKPDAPNCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQDVSSPPLP 556
Query: 421 ------------------------------------------------------------ 480
Sbjct: 557 VDNITGVGGLTXTGRCYTPDSLLKRVSETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDV 616
Query: 481 -----------DVSFKQSVSEEETQEFLKLVKQSEYKVIEQL------------------ 540
DVS KQ V EEE QEFLKLVKQSEYKV EQL
Sbjct: 617 HDELVEAIVVKDVSPKQHVFEEEIQEFLKLVKQSEYKVTEQLGRTPAKISILSLLLSSEA 676
Query: 541 -------------------VDNLRNVVGNITASSSITFTYEEIPPEGTRHTRALHISVKC 600
VDNL NVVGNITASSSITFT EEIPPEGT HT+ALHISVKC
Sbjct: 677 HRNTLLEXLKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIPPEGTGHTKALHISVKC 736
Query: 601 KNYLVAKVLVENGSSLNIMPRSTLEKLPVNMSHMRPSTVIVRAFDGARSTVVGDIEIPIQ 660
KN+L+AKVLV+NGSSLNIMPRSTLEKLPV+MSHMRPSTVIVRAFDGARS VVGDIEIPIQ
Sbjct: 737 KNFLIAKVLVDNGSSLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGARSAVVGDIEIPIQ 796
Query: 661 IG---------------------------------------------------------- 720
IG
Sbjct: 797 IGPCTFDITFQVMDITSTYSFLLGRLWIHSAGAVPSTLHQKIKFAVDQKLVIISGQEDIL 856
Query: 721 ---------------------------------------------AAFKGDNGSLDKLLR 780
AFKGDN SLDKLLR
Sbjct: 857 VSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNESLDKLLR 916
Query: 781 MAKNTKKFGLGYKPSRGDIIRVRNLEKVKRLSRFENEERDYPRRTVPPLSLSFRSAGTIH 840
MAKNTKKFGLGYKPSRGDIIRVR+LEK KRLSRFENEERDYPRRTVPPLS SFRSAGTIH
Sbjct: 917 MAKNTKKFGLGYKPSRGDIIRVRSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGTIH 976
Query: 841 QEYDESSVVAAVTEEREQVGPFVYPCPDGFKLSNWSINTEIECDNDSKYELDTPIYNIKS 900
QEYD SSVVAAVTEEREQV PFVYPCPDGF+LSNWS+NTEIECDNDSKYELDTPIYNI+S
Sbjct: 977 QEYDGSSVVAAVTEEREQVRPFVYPCPDGFELSNWSVNTEIECDNDSKYELDTPIYNIES 1036
Query: 901 DEEIDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEIKIGTHMSSESRKKLIE 960
D+EIDDEPSAELLRMLEEEEKMLGPHEELTET+NLGSQAEAKEIKIGTHMSSESRKKLIE
Sbjct: 1037 DKEIDDEPSAELLRMLEEEEKMLGPHEELTETLNLGSQAEAKEIKIGTHMSSESRKKLIE 1096
Query: 961 LLHE-------------------------------------------------------- 1020
LLHE
Sbjct: 1097 LLHEYADVFAWSYQDMPGLDTDIVVHKLQINPKFKPVRQKLRKMRPDMLIKIKDEVRKQI 1156
Query: 1021 ------------------------------------------------------------ 1032
Sbjct: 1157 DAGFLTISNYPEWVANIVPVPKKNGQVRMCVDYRDLNRASPKDNFPLPHIDVLVDNTAGF 1216
BLAST of Moc01g14690 vs. ExPASy TrEMBL
Match:
A0A6J1D099 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1)
HSP 1 Score: 1550.8 bits (4014), Expect = 0.0e+00
Identity = 949/1873 (50.67%), Postives = 985/1873 (52.59%), Query Frame = 0
Query: 1 MYCRKIAAYVQNDKLLIHCFQDSLSSSASRWYMQLESSHVGSWKNLADSFLKQYKHNIDM 60
MYCRK+AAYVQNDKLLIHCFQDSLS ASRWYMQL+SSHVGSWKNLADSFLKQYKHNIDM
Sbjct: 126 MYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDM 185
Query: 61 APDRLDLQRMEKKSIESFKEYAQRWRDTAAQVQPPLTDKKLSAMFINTLKHPFYDRMIGS 120
APDRLDLQRMEKKS +SFKEYAQRWRDTAAQVQPPL DK+LSAMFINTLKHPFYDRMIGS
Sbjct: 186 APDRLDLQRMEKKSTKSFKEYAQRWRDTAAQVQPPLIDKELSAMFINTLKHPFYDRMIGS 245
Query: 121 ASTNFSDIMTIGERIEYGVKHGRITSTAEEPLAAKNASNSKKKEGEVQM----------- 180
ASTNFSDIMTIGERIEYGV+HGRITST +EPLAAK AS+SKKKEGEVQM
Sbjct: 246 ASTNFSDIMTIGERIEYGVRHGRITSTTDEPLAAKKASHSKKKEGEVQMVGADRHSWKQQ 305
Query: 181 ------------------------------------------------------------ 240
Sbjct: 306 PYRRTPQYSPYYYPTPYGYNQPFVNNATSHYYPYASQNFRPPASQNFQLTPTSQNFQPRG 365
Query: 241 ------------------------------------------LTPVHVDPIQPPYPRWYD 300
L PV VDPIQPPYPRWYD
Sbjct: 366 QQHNTFYTQGQQNNRGARKQTQFDPIPMTYTELLPQLFQNNQLAPVPVDPIQPPYPRWYD 425
Query: 301 ANARCDYHARVIGHSTKNCTALKYRVQALIKA---------------------------- 360
ANARCDYHA I HST+NCT LKYRVQALIKA
Sbjct: 426 ANARCDYHAGAIXHSTENCTXLKYRVQALIKAGWXNFKKENGXDVSKXXLXNHQNVQINA 485
Query: 361 ----GIESKSKVVDITTPIEELSEILLGSGY----------------------------- 420
GIESKSKV DITTP+ EL EILLGSGY
Sbjct: 486 IECQGIESKSKVADITTPMXELFEILLGSGYISVEYLCPKYKGYDESLTCXFHXGAKGHS 545
Query: 421 ---------------------------KKGINVVEDVSVAEGSSDALKPKRLTIFYREKP 480
KK NVVED+ VAEGSSD+LKPK LTIFYREKP
Sbjct: 546 LEQCNXFRMKVQELLDSKILTXANSHXKKXTNVVEDILVAEGSSDSLKPKPLTIFYREKP 605
Query: 481 DAPSCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQE--------------------- 540
DAPSCSRKP ITVP PFEYKSSKAVPWKYECKVTVGQ+
Sbjct: 606 DAPSCSRKPXXITVPXPFEYKSSKAVPWKYECKVTVGQDVSSPSLPVDNITGVGGLTRTG 665
Query: 541 ---------------------------------------------------------DVS 600
DVS
Sbjct: 666 RCYTPDSLLKRVNETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDARDELVEAIVVKDVS 725
Query: 601 FKQSVSEEETQEFLKLVKQSEYKVIEQL-------------------------------- 660
KQ +SEEETQEFLKLVKQSEYKVIEQL
Sbjct: 726 PKQPMSEEETQEFLKLVKQSEYKVIEQLGRTPANISILSLLLSSEAHQNALLEALKQAFV 785
Query: 661 -----VDNLRNVVGNITASSSITFTYEEIPPEGTRHTRALHISVKCKNYLVAKVLVENGS 720
VDNL NVVGNITASSSI+FT EEIPPEGT HT+ALHISVKCKN+L+AKVLV+NGS
Sbjct: 786 SQDITVDNLSNVVGNITASSSISFTDEEIPPEGTGHTKALHISVKCKNFLIAKVLVDNGS 845
Query: 721 SLNIMPRSTLEKLPVNMSHMRPSTVIVRAFDGARSTVVGDIEIPIQIG------------ 780
SLNIMPRSTLEKLPV+MSHMRPSTVIVRAFDGARS VVGDIEIPIQIG
Sbjct: 846 SLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGARSAVVGDIEIPIQIGPCTFDITFQVMD 905
Query: 781 ------------------------------------------------------------ 840
Sbjct: 906 ITSAYSFLLGRPWIHSAGAVPSTLHQKIKFAVDQKLVIISGQEDILVSRFASMSYVEVAE 965
Query: 841 -------------------------------AAFKGDNGSLDKLLRMAKNTKKFGLGYKP 900
AFKGDNGSLDKLLRMAKNTKKFGLGYKP
Sbjct: 966 EAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNGSLDKLLRMAKNTKKFGLGYKP 1025
Query: 901 SRGDIIRVRNLEKVKRLSRFENEERDYPRRTVPPLSLSFRSAGTIHQEYDESSVVAAVTE 960
SRGDIIRVR+LEK KRLSRFENEERDYPRR VPPL+ SFRSAGTIHQEYDESSVVAAVTE
Sbjct: 1026 SRGDIIRVRSLEKAKRLSRFENEERDYPRRIVPPLTHSFRSAGTIHQEYDESSVVAAVTE 1085
Query: 961 EREQVGPFVYPCPDGFKLSNWSI------------NTEIECDNDSKYELDTPIYNIKSDE 1020
EREQVGPFVY CPDGF+LSNWS+ NTEIECDNDSKYELDTPIY I+SDE
Sbjct: 1086 EREQVGPFVYLCPDGFELSNWSVIKLPSFVNNKSNNTEIECDNDSKYELDTPIYIIESDE 1145
Query: 1021 EIDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEIKIGTHMSSESRKKLIELL 1032
EIDDEPSAELLRMLEEEEKMLGPHEELTET+NLGSQAEAKEIKIGTHMSSESRKKLIELL
Sbjct: 1146 EIDDEPSAELLRMLEEEEKMLGPHEELTETLNLGSQAEAKEIKIGTHMSSESRKKLIELL 1205
BLAST of Moc01g14690 vs. ExPASy TrEMBL
Match:
A0A6J1E2J7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1)
HSP 1 Score: 1310.4 bits (3390), Expect = 0.0e+00
Identity = 791/1529 (51.73%), Postives = 878/1529 (57.42%), Query Frame = 0
Query: 1 MYCRKIAAYVQNDKLLIHCFQDSLSSSASRWYMQLESSHVGSWKNLADSFLKQYKHNIDM 60
MYCRK+AAYVQNDKLLIHCFQDSLS ASRWYMQL+SS+VGSWKNLADSFLKQYKHNIDM
Sbjct: 86 MYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSNVGSWKNLADSFLKQYKHNIDM 145
Query: 61 APDRLDLQRMEKKSIESFKEYAQRWRDTAAQVQPPLTDKKLSAMFINTLKHPFYDRMIGS 120
APDRLDLQRMEKKS ESFKEYAQRWRDTAAQVQPPLTDK+LSAMFINTLKHPFYDRMIG+
Sbjct: 146 APDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLTDKELSAMFINTLKHPFYDRMIGN 205
Query: 121 ASTNFSDIMTIGERIEYGVKHGRITSTAEEPLAAKNASNSKKKEGEVQM----------- 180
ASTNFSDIMTIGERIEYGV+HGRITST +EPLAAK AS+SKKKEGEVQM
Sbjct: 206 ASTNFSDIMTIGERIEYGVRHGRITSTVDEPLAAKKASHSKKKEGEVQMVGADRHSWKQQ 265
Query: 181 ------------------------------------------------------------ 240
Sbjct: 266 PYSRTPRYTPYYYPTPYGYNQPFVNNATSHYSPYTFQNFRPPASQNFQPTPASQNFQPRG 325
Query: 241 ------------------------------------------LTPVHVDPIQPPYPRWYD 300
L PV VDPIQPPYPRWYD
Sbjct: 326 QQHNTLYTQEQQTNRGARKQTQFDPIPMTYTELLPQLFQNNQLAPVPVDPIQPPYPRWYD 385
Query: 301 ANARCDYHARVIGHSTKNCTALKYRVQALIKAG--------------------------- 360
NARCDYHA IGHST+NCTALKYRVQALIKAG
Sbjct: 386 TNARCDYHAGAIGHSTENCTALKYRVQALIKAGWLNFKKENGPDVSKNPLPNHQNVQINA 445
Query: 361 -----IESKSKVVDITTPIEELSEILLGSGY----------------------------- 420
IESKSKV DI TP+ EL EILLGSGY
Sbjct: 446 IECQEIESKSKVADIRTPMVELFEILLGSGYVSVEYLCPNLKYKGYDESLTCPFHAGAKG 505
Query: 421 -----------------------------KKGINVVEDVSVAEGSSDALKPKRLTIFYRE 480
KKGIN+VEDVSVAEGSSDALKPK LTIFY E
Sbjct: 506 HSLEQCNSFRMKVQELLDSKILTVANSHQKKGINIVEDVSVAEGSSDALKPKCLTIFYSE 565
Query: 481 KPDAPSCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQE------------------- 540
KP+AP+CSRKPITITVPAPFEYKSSKAVPWKY+CKVTVGQ+
Sbjct: 566 KPNAPNCSRKPITITVPAPFEYKSSKAVPWKYQCKVTVGQDVSSPPLPIDNITGVGGLTR 625
Query: 541 -----------------------------------------------------------D 600
D
Sbjct: 626 TGRCYTPDSLLKCVNETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDVHDELVEAIVVKD 685
Query: 601 VSFKQSVSEEETQEFLKLVKQSEYKVIEQL------------------------------ 660
VS KQ +SEEETQE LKLVKQSEYKVIEQL
Sbjct: 686 VSPKQPMSEEETQEILKLVKQSEYKVIEQLGRTPAKISILSLLLSSEAHRNALLEALKQA 745
Query: 661 -------VDNLRNVVGNITASSSITFTYEEIPPEGTRHTRALHISVKCKNYLVAKVLVEN 720
VDNL NVVGNI+ +SSITFT EEIPPEGT HT+ALHIS+KCKN+L+AKVLV+N
Sbjct: 746 FVSQDITVDNLSNVVGNISXASSITFTDEEIPPEGTGHTKALHISIKCKNFLIAKVLVDN 805
Query: 721 GSSLNIMPRSTLEKLPVNMSHMRPSTVIVRAFDGARSTVVGDIEIPIQIGAAFKGDNGSL 780
GSSLNIMPRSTLEKLPV+MSHMRPSTVIVRAFDGARS VVGDIEIPIQIG +
Sbjct: 806 GSSLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGARSAVVGDIEIPIQIGPC------TF 865
Query: 781 DKLLRMAKNTKKF----GLGYKPSRGDIIRVRNLEKVKRLSRFENEERDYPRR------T 840
D ++ T + G + S G + + +K+K + RD R +
Sbjct: 866 DITFQVMDITSAYSFLLGRPWIHSAGAVPSTLH-QKIKFAVDQNVDYRDLNRASPKDNFS 925
Query: 841 VPPLSL-----SFRSAGTIHQEYDESSVVAAVTEEREQV------GPFVYP-CPDGFKLS 900
+P + + + S + + + + E+RE+ G F Y P G K +
Sbjct: 926 LPHIDVLVDNTTGFSTFSFMDGFSGYNQIKMAPEDREKTTLITLWGTFCYKVMPFGLKNA 985
Query: 901 NWSINTEIE------CDNDSKYELDTPIYNIKSDEE------------------------ 960
+ + + + +D I K EE
Sbjct: 986 GATYQRAMVTLFHDLMHKEIEVYVDDMIAKSKQGEEHTTILRKLFDRLRKFKLKLNPNKC 1045
Query: 961 IDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEI-----------KIGTHMSS 1020
I + +LL + +E + +++ + + KE+ ++ +H+++
Sbjct: 1046 IFGATTGKLLGFVVSQEDIKVDLDKVKAILEMPPPQTQKEVREFLGRLNYIARLISHLTA 1105
Query: 1021 ESR----------------------KKLIELLHE-------------------------- 1032
K+ + L +
Sbjct: 1106 TCEPIFKLLRKNNDGVWSEDCQAAFDKIKQYLQDPPILVPPTPGRPLILYLTVTENSMGC 1165
BLAST of Moc01g14690 vs. ExPASy TrEMBL
Match:
A0A6J1DZ90 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111024415 PE=4 SV=1)
HSP 1 Score: 1182.9 bits (3059), Expect = 0.0e+00
Identity = 697/1297 (53.74%), Postives = 783/1297 (60.37%), Query Frame = 0
Query: 1 MYCRKIAAYVQNDKLLIHCFQDSLSSSASRWYMQLESSHVGSWKNLADSFLKQYKHNIDM 60
MYCRK+ AYVQN KLLIHCFQDSL ASRWYMQL+SSHVGSWKNLADSFLKQYKHNIDM
Sbjct: 195 MYCRKMXAYVQNXKLLIHCFQDSLXGXASRWYMQLDSSHVGSWKNLADSFLKQYKHNIDM 254
Query: 61 APDRLDLQRMEKKSIESFKEYAQRWRDTAAQVQPPLTDKKLSAMFINTLKHPFYDRMIGS 120
APDRLDLQRMEK S ESFKEYAQRWRDTAAQVQPPLTDK+LSAMFINTLKHPFYDRMIGS
Sbjct: 255 APDRLDLQRMEKNSTESFKEYAQRWRDTAAQVQPPLTDKELSAMFINTLKHPFYDRMIGS 314
Query: 121 ASTNFSDIMTIGERIEYGVKHGRITSTAEEPLAAKNASNSKKKEGEVQMLTPVHVDPIQP 180
ASTNFSDIMTIGERIEYGV+H RITSTA+EPLAAK AS+SKKKEGE LTPV VDPIQP
Sbjct: 315 ASTNFSDIMTIGERIEYGVRHKRITSTADEPLAAKKASHSKKKEGE---LTPVPVDPIQP 374
Query: 181 PYPRWYDANARCDYHARVIGHSTKNCTALKYRVQALIKA--------------------- 240
YPRWYDANARCDYHA IGHST+NCTALKYRVQAL+KA
Sbjct: 375 LYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALLKAGWLNFKKENEPDVSKNPLSNH 434
Query: 241 -----------GIESKSKVVDITTPIEELSEILLGSGY---------------------- 300
GIESKSKV DI TP EEL EILLGSGY
Sbjct: 435 QNVQINAIECQGIESKSKVADIRTPKEELFEILLGSGYVSVEYLCPNLKYKEYDESLTCP 494
Query: 301 ------------------------------------KKGINVVEDVSVAEGSSDALKPKR 360
KKGINVVEDVSVAEGSSDALKPKR
Sbjct: 495 FHAGAKGHSLEQCNSFRMKVQELLDSKILTVANSHQKKGINVVEDVSVAEGSSDALKPKR 554
Query: 361 LTIFYREKPDAPSCSRKPITITVPAPFEYKSSKAVPWKYECKVTVGQE------------ 420
LTIFY EKPDAPSCS+KPITITVPAPFEYKSSKAVPWKY+CKVTVGQ+
Sbjct: 555 LTIFYSEKPDAPSCSQKPITITVPAPFEYKSSKAVPWKYQCKVTVGQDVSSPPLPVDNIT 614
Query: 421 ------------DVSFKQSVSEEETQEFLKLVKQ-------------------------- 480
++ +S E ++ ++L+ +
Sbjct: 615 EVGDLGSQAEAKEIKIGTHMSSESRKKLIELLHEYADVFAWSYQDMPGLDTDIVVHKLSI 674
Query: 481 -SEYKVIEQ------------LVDNLRNVV--GNITASSSITFTYEEIPPEGTRHTRALH 540
E+K + Q + D +R + G +T S+ + +P +
Sbjct: 675 NPEFKPVRQKLWKMRPDMLIKIKDEVRKQIDAGFLTVSNYPEWVANIVPVPKKNGQVRMC 734
Query: 541 ISVKCKNYLVAK---------VLVENGSSLNIMP-------RSTLEKLPVNMSHMRPSTV 600
+ + N K VLV+N + + + ++ P + T+
Sbjct: 735 VDYRDLNRASPKDNFPLPHIDVLVDNTAGFSTFSFMGGFSGXNXIKMAPEDREKTTFITL 794
Query: 601 ---------------IVRAFDGARSTVVGDI---EIPIQIG---AAFKGDNGSLDKLLRM 660
+ + A T+ D+ EI + + A K L ++
Sbjct: 795 WGTFCYKVMPFGLKNVGATYQRAMVTLFHDLMHKEIEVYVDDMIAKSKQGEEHTTILRKL 854
Query: 661 AKNTKKFGLGYKPSR-----------GDIIRVRNL----EKVKRLSRFENEERDYPRRTV 720
+KF L P++ G ++ + +KVK + +
Sbjct: 855 FDRLRKFKLKLNPNKCIFGATTGKLLGFVVSQEGIKVDPDKVKAILE------------M 914
Query: 721 PPLSLSFRSAGTIHQEYDESSVVAAVTEEREQVGPFVYPCPDGFKLSNWSINTEIECDND 780
PP G + + + ++ +T E + + DG WS + + +
Sbjct: 915 PPPQTQKEVRGFLGRLNYIARFISHLTATCEPIFKLLRKNNDGV----WSEDCQAAFNKI 974
Query: 781 SKYELDTPIYNIKSDEEIDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEIKI 840
+Y D PI + P L+ L E +G +
Sbjct: 975 KQYLQDPPIL-------VPPTPGRPLILYLTVTENSMG-------------------CVL 1034
Query: 841 GTHMSSESRKKLIELLHEKEQAIYYLSKKFTDCETRYSQVEKTCCALAWAARRLRQYMLY 900
G H S KEQAIYYLSKKFTDCETRYSQVEKTCCALAW ARRLRQYMLY
Sbjct: 1035 GQHDDS----------GRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWVARRLRQYMLY 1094
Query: 901 YTTWLISKMNSIKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGSVLADYLAQKPIN 960
YTTWLISKM+ IKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGS LADYLAQ+PIN
Sbjct: 1095 YTTWLISKMDPIKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGSALADYLAQQPIN 1154
Query: 961 DYVPIKFDFPDEYISTITASEESLDPQTWTMI---------------------------- 1020
DY+P+KFDFPDEYISTITASEESLDPQTWTM+
Sbjct: 1155 DYIPVKFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLTT 1214
Query: 1021 -LCFDCSYNMAEYEACSMGVQAAIDMKVKKLKVFGDSMLVIHQLRGEWETRDAK------ 1032
LCFDC++NMAEYEACSMGVQAAIDMKVKK KVFGDS LVIHQLRGEWETRD K
Sbjct: 1215 KLCFDCTHNMAEYEACSMGVQAAIDMKVKKFKVFGDSTLVIHQLRGEWETRDVKLLPYKQ 1274
BLAST of Moc01g14690 vs. ExPASy TrEMBL
Match:
A0A6J1D7C7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111018303 PE=4 SV=1)
HSP 1 Score: 1179.9 bits (3051), Expect = 0.0e+00
Identity = 714/1387 (51.48%), Postives = 748/1387 (53.93%), Query Frame = 0
Query: 244 SGYKKGINVVEDVSVAEGSSDALKPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSK 303
S KK NVVED+ VAEGSSD++KPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSK
Sbjct: 17 SHQKKRTNVVEDILVAEGSSDSIKPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSK 76
Query: 304 AVPWKYECKVTVGQ---------------------------------------------- 363
AVPWKYECKVTVGQ
Sbjct: 77 AVPWKYECKVTVGQDVSSPSLPVDNITGVGGLTRTGRCYTPDSLLKRVNETTSEKNKEKA 136
Query: 364 ---------EDVSFK--------QSVSEEETQEFLKLVKQSEYKVIEQL----------- 423
ED K + EEETQEFLKLVKQ+EYKVIEQL
Sbjct: 137 SEKKKEKVEEDKKGKAKLHEDVHDELVEEETQEFLKLVKQNEYKVIEQLGRTPAKISILS 196
Query: 424 --------------------------VDNLRNVVGNITASSSITFTYEEIPPEGTRHTRA 483
VDNL NVVGNI ASS ITFT EEIPPEGT HT+A
Sbjct: 197 LLLSSEAHRNALLEALKQAFVSQDITVDNLSNVVGNIMASSCITFTDEEIPPEGTGHTKA 256
Query: 484 LHISVKCKNYLVAKVLVENGSSLNIMPRSTLEKLPVNMSHMRPSTVIVRAFDGARSTVVG 543
LHISVKCKN+L+AKVLV NGSSLNIMPRSTLEKLPV+MSHMRPSTVIVRAFDGAR+ VVG
Sbjct: 257 LHISVKCKNFLIAKVLVGNGSSLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGARNAVVG 316
Query: 544 DIEIPIQIG--------------------------------------------------- 603
DIEIPIQIG
Sbjct: 317 DIEIPIQIGLCTFDITFQVMDITSAYSFLLGRPWIHSAGAVPSTLHQKIKFAVDQKLVII 376
Query: 604 ----------------------------------------------------AAFKGDNG 663
AAFK +NG
Sbjct: 377 SGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLEAAFKVNNG 436
Query: 664 SLDKLLRMAKNTKKFGLGYKPSRGDIIRVRNLEKVKRLSRFENEERDYPRRTVPPLSLSF 723
SLDKLLRMAKNT++FGLGYKP+RGDIIRVR++EK KRLSRFEN ERDY RRTVPPLS S
Sbjct: 437 SLDKLLRMAKNTRRFGLGYKPNRGDIIRVRSMEKAKRLSRFENGERDYSRRTVPPLSHSL 496
Query: 724 RSAGTIHQEYDESSVVAAVTEEREQVGPFVYPCPDGFKLSNWSINTEIECDNDSKYELDT 783
RSAGTIHQEYDESSV AAVTEEREQV PFVYPCPDGFKLSNWS+NTEIECDNDSKYELDT
Sbjct: 497 RSAGTIHQEYDESSVAAAVTEEREQVEPFVYPCPDGFKLSNWSVNTEIECDNDSKYELDT 556
Query: 784 PIYNIKSDEEIDDEPSAELLRMLEEEEKMLGPHEELTETINLGSQAEAKEIKIGTHMSSE 843
PIYNI+SDEEIDDEPSAELLRMLEEEEKMLGPHEELTET+NLGSQAEAKEIKIGTHMSSE
Sbjct: 557 PIYNIESDEEIDDEPSAELLRMLEEEEKMLGPHEELTETLNLGSQAEAKEIKIGTHMSSE 616
Query: 844 SRKKLIELLHE------------------------------------------------- 903
SRKKLIELLHE
Sbjct: 617 SRKKLIELLHEYADVFAWSYQDMPGLDTDIVVHKLPTNPKFKPVRQKLRKMRPDMLIKIK 676
Query: 904 ------------------------------------------------------------ 963
Sbjct: 677 DEVRKQIDAGFLTVSNYPEWVANIVPVPKKNGQVRMCVDYRDLNRASPKDNFPLPHIDVL 736
Query: 964 ------------------------------------------------------------ 1023
Sbjct: 737 VDNTAWFSTFSFMDGFSGYNQIKMALEDREKTTFITLWGTFCYKVMSFGLKNAGATYQRA 796
Query: 1024 ------------------------------------------------------------ 1032
Sbjct: 797 MVTLFHDLMHKEIEVYVDDMIAKSKQGEEHTTILRKLFDRLRKFKLKLNPNKCIFGATTR 856
BLAST of Moc01g14690 vs. TAIR 10
Match:
AT3G01410.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )
HSP 1 Score: 47.4 bits (111), Expect = 8.2e-05
Identity = 21/46 (45.65%), Postives = 31/46 (67.39%), Query Frame = 0
Query: 793 NMAEYEACSMGVQAAIDMKVKKLKVFGDSMLVIHQLRGEWETRDAK 839
N+AEY A +G+++A+D K + V GDSMLV Q++G W+T K
Sbjct: 197 NVAEYRALLLGLRSALDKGFKNVHVLGDSMLVCMQVQGAWKTNHPK 242
BLAST of Moc01g14690 vs. TAIR 10
Match:
AT3G01410.2 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )
HSP 1 Score: 47.4 bits (111), Expect = 8.2e-05
Identity = 21/46 (45.65%), Postives = 31/46 (67.39%), Query Frame = 0
Query: 793 NMAEYEACSMGVQAAIDMKVKKLKVFGDSMLVIHQLRGEWETRDAK 839
N+AEY A +G+++A+D K + V GDSMLV Q++G W+T K
Sbjct: 197 NVAEYRALLLGLRSALDKGFKNVHVLGDSMLVCMQVQGAWKTNHPK 242
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022143495.1 | 0.0e+00 | 53.62 | LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia] | [more] |
XP_022147189.1 | 0.0e+00 | 50.61 | LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia] | [more] |
XP_022158986.1 | 0.0e+00 | 51.73 | LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia] | [more] |
XP_022157796.1 | 0.0e+00 | 53.74 | LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia] | [more] |
XP_022150030.1 | 0.0e+00 | 51.48 | LOW QUALITY PROTEIN: uncharacterized protein LOC111018303 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CNY7 | 0.0e+00 | 53.62 | Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111013372 PE=4 SV=1 | [more] |
A0A6J1D099 | 0.0e+00 | 50.67 | Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1 | [more] |
A0A6J1E2J7 | 0.0e+00 | 51.73 | Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1 | [more] |
A0A6J1DZ90 | 0.0e+00 | 53.74 | Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111024415 PE=4 SV=1 | [more] |
A0A6J1D7C7 | 0.0e+00 | 51.48 | Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111018303 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G01410.1 | 8.2e-05 | 45.65 | Polynucleotidyl transferase, ribonuclease H-like superfamily protein | [more] |
AT3G01410.2 | 8.2e-05 | 45.65 | Polynucleotidyl transferase, ribonuclease H-like superfamily protein | [more] |