CmaCh11G007800 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh11G007800
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionProtein of Unknown Function (DUF239)
LocationCma_Chr11: 3791871 .. 3796299 (+)
RNA-Seq ExpressionCmaCh11G007800
SyntenyCmaCh11G007800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAAGAAAGTGGGGCAGTTTTGTGAGAGAGAAATTCATTGCTTTTTAGAGAGTTGTGCGCGTGTGGGTTTTACACCTTTCTTCTTTCCTGGTGGCCGTCCGCCATTGTTGTTCTTGAGTGAGCTTCTTATTGTTCTTCTGGGTTTGTTTTTAGTGGCGGTGTGGTAGAGGAAACTGCTGTGGATTTGTGAGTTTAAGAGGGTTTTTGGAGATGGGTTCTGCTAGGTTTAGCAGATGCAGGACCATGGAAGCTCTGGTCGCCATTTTTTGTGTTTTGGGGCTGGTTTCTGTGTGCTGTGCGGCGAGGATGGAATCTGCTTCCCGCCAGAAGCTTGAGGTTCGAAAACACCTCAGGCGCTTGAACAAGCCGGCTGTTAAAACCATCGAGGTTTGTAATCTGTATTCATTTTGGGTTTGGTTTTAGTGGATCTTGCTCTGTTTTGTTTCTAAAATGCTAGAAAATGGGGTGGAAATGGTTGGATTCCTTTGTTTTCTTGGGAACCAAACAAGGATGAACATGTTCCACTTTCTCCCCTTTTCTACTCTAATGGGGTATCTGTTTTTTCCGGGAAAATGTTTGGATTCCTCTGTTTTCTCAGGAACCAAACAAGGATGAACATATTCCACTTTCTTCCCTTTTCTACTCAAATGGGGTTTGCTTGTAACTGCGGCCTGTTTGGTTTCCGGGAAAATGTTTGGATTCCTCTGTTTTCTCAGGAACCAATCAAGGACGAACATATTCCACTTTCTTCCCTTTTCTACTCAAATGGGGTTTGCTTGTAACTGCGGCCTGTTTGGTTTCCGGGAAAATGTTTGGATTCCTTTGTTTTCTTGGGAACCAAACAAGGATGAACATATTCCACTTTCTTCCCGTTTCTACTCAAATGGGGTTTGCTTGTAGTGGCGGTCTGTTTGGATTCCTCTGTTTTCTCGGGAACCAAACAAGGATGAACATAGTCCACTTCCTTCCCTTTTCTACTCAAATGGGGTTTCTGTTTGGTTTCCGGGAAAATGTTTGGATTCCTTTGTTTTCTCGGGAGCCAAACAAGGATGAACATGTTCCACTTTCTTCCCTTTTCTACTCGAATGGGGTTTCTGTTTTTTCCGGGAAAATGTTTGGATTCCTCTGTTTTCTAAGGAACCAAACAAGGAAGAACATATTCCACTTTCTCTTAAAATCTCTGTCTTAACTTTCTTAGGAGCCAAACACCATACAATGTTGTTCTTATGTTTGTTCTATTGGTTGAACCAGAGCCCAGATGGGGACCTAATTGACTGTGTTCATATGTCTCATCAACCTGCATTTGACCATCCTTTCCTTAAAGACCACAAAATCCAGGTTTGTTCTTGTTTCATCAGTTCATTACCATATTTTTCTCTCATTTTCTCTTCGATCCATAAGTTTTTGTTCATCTTGGTTCTATGTGTTCTATCAGACGAGGCCAACTTTTCATCCAGAATGGCTAGATGAGAGCAAAGTAGCTGAGAAAGCCAGTGAAAAACCGAATCCAATTAAGCAACTATGGCATGTTAATGGAAAGTGCCCTGAAGGTACCATCCCCATTAGAAGAACCAAACATGAGGATGTTTTGAGAGCAAGTTCAGTGAAGAGATATGGAAGAAAAAAGCACAGATCAGCCCCAATACCCCCTAGATCTGCTGAGCCTGATCTCATCAACCAAAGTGGCCATCAGGTAATCATCCATTTCTCCGCTTTTTGTAACCGCCCAAGCCCACCGCTAGATGATATTGTCTTCTTTTGGCTTCTCCGCTTTCTGTTAGGGAGAGATTTTCACACCCTAAAGAATGTTTCGTTCTCCTCACCAACCGAGGTGAGATCTCAGAATCCAACCCCTTTTTAGGCCCAGCGTCCTTTCTGTCACTCATTCCCTTCTCCAACCCTCCAATCCACCCCCTTTAGGGTTCAATGTCCTTGCTGGCACACTGCCTCGTGTCCACCCACTTTGAGCTCAGCCTCCTCGCTGACACATTGCTTGGTGTTTGGCTCTAATATCATTTGTAACAGCCCAAGTCCATCGTTAGCAGATTTTGTCCTCCTGGGGCTTTCCCTTCCGAACTTCCCCTCAAGGTTTTTAAAATGCGTCCGCTAGTGAGAGGTTTTCACACCCTGTTTCGTTCTCCTCTTCAACCGATGTAGGATCTCACAATCCACCCCCTTTCGAGGTCCAATGTCTTCGTAGACACTCGATTCCTTCTCCAATCCACCCCCTTTAGGACTCAGCCTCCTCACTGACACATCATCCGTTGTCTTGCTTTGATACCATTTATAACCGTCCAAGCCCACCGCTAGCAGATAGTTCTCTTTGGACTTTTTCTTATAGGCTTTCCCTCAAAGTTTTTAAAACGCGTCCACTAGGGAGAGGTTTTCACACCCTAAAGAATGTTTCTCCTCAACCGATGTGGGATCTCACACTTTTTCTCTTAAAGTAATCCAATTCAGCTCTGATTTTCTTTTTTGGCTTGTTTAGCATGCCATAGCTTATGTAGAAGGAGACAAGTATTATGGAGCTAAAGCCACTATGAACGTGTGGGAGCCCAGCATTCAACAGCCAAATGAGTTTAGCTTATCACAGATTTGGATATTGGGAGGCTCTTTTGGTGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTTTCATCCTTTTCATTATCTTCATGAACTTTATAAACCAGAAAGTGTCATAACTTCTTGTTCTTCCACAAACAGGTCAGTCCCAATCTCTACGGTGATAACAACACGAGACTCTTCACATACTGGACGGTGAGCCTGACCACACACTCGATCGTTACTGTTTCTTTTGCTCAAGTTAGAGGCGTTTTCGTAATAATCTTTCTTCCCAATGTTTGCTTCAGAGTGATGCATATCAAGCTACAGGCTGTTACAACCTCCTCTGCTCGGGCTTTATTCAAATCAACAGCGATATCGCGATGGGAGCAAGCATTTCTCCAGTCTCGGCGTTCCGAAATTCGCAATACGATATCAGTATACTTGTCTGGAAGGTAACAGCCAGAAACAGGCCTTTATAGTTTTTGAATTAGCATGAGAGTTTCTGAAACAAGGAGAAAAATCCAATATCCATGAAAATGATAGTACCGATGATAGATAAGCATGTGGGGGCATCAGTTTATGGTTGTTGTCCCAACCCAGCTGATATCTATACACATTTCAAATTCTTTGGCCACATCAATAGTTACCTTGCAACAATAAAATGAGACACTAATGGGGCGAAAAACAAGTGGCATTGCTTTGTTCTTATGCTGCTAAGAATCTTTTGGCCTTAACCATGACAGATTGGCCTAAGGGAAAAAAGGGACGATTAACCATAAACACGAATACTTTTAGGAATTAGAGGGACCGTCGAACTGTCGTCTCCTTCTATAGATTAGTCAGAGTGTATGCAAGTTGTTTTAAACACTTTCTTTCGTTGTAATCAAAACTCAACCAAACACATCGTTTATTAGTCGGGGTGTATCTCCCCTACAACGTCTCTTGACAAGACGCCACAACCGAGTACAACCGAGTCTCAGGTCCTTTCGTAGTGTATCTCCCCTACCACGTCTCATGACGAAATGCCACAACCGAGTACAACTGACTCTCAGGTCCTTCAATTGTGAGCTTTGCAGATCTGAAATGGTTTATTTGATTGATTTATGTAGGATCCAAAAGAGGGGCATTGGTGGATGCAATTTGGCAATGGCTACGTAATGGGATATTGGCCTTCATTTCTATTCTCATATTTAGCAGACAGTGCCTCCATGATTGAGTGGGGAGGTGAAGTGGTAAACTCCGAGCCAAATGGACAGCACACTTCAACACAAATGGGGAGCGGCCATTTCCCAGATGAAGGATTTGGGAAAGCAAGCTATTTTAGAAACATTCAAGTTGTTGATGCATCAAACAATCTCAAACCACCAAAAGGCATTGGCACATTCACAGAGCAGCCTGATTGCTATGATGTTCAAACAGGAAGCAATGGGGATTGGGGCCACTTCTTTTACTATGGAGGCCCTGGTAGAAACCCCAATTGCCCATGAATGAAGGTGAAATTCCCATTTTACCCTCCAAACTCTTCTTCTCTCTCTCTCTGTAAACTACTACCTCTTAGAGCTAGTGTTAGGGGGGTGCATGTGTCATTTTCTTGCTGATGGAAAGTTTTGGGTCTCTGGGTCATTGTCTTTTATGTACAAAGATTCTGTAGCCATGTAAGTGGTGGGGTAGGGGTAGGGTAAGGGCCCCAAGTTTCTTAAAAGCCCGATAGAAACAGGGCCTTTTGGCTATCAGTTCATGATGTGTGTGTATTTTATTAGTTTGTTTAGTATACTCCTCTCTTTGGAGTTGCTTTATTAATGAAGAATCAATGGATTTTGATTGTTGTAGATTGTGAGGACTATTGTTTCAAAGATA

mRNA sequence

ATGACAAGAAAGTGGGGCAGTTTTTGGCGGTGTGGTAGAGGAAACTGCTGTGGATTTGTGAGTTTAAGAGGGTTTTTGGAGATGGGTTCTGCTAGGTTTAGCAGATGCAGGACCATGGAAGCTCTGGTCGCCATTTTTTGTGTTTTGGGGCTGGTTTCTGTGTGCTGTGCGGCGAGGATGGAATCTGCTTCCCGCCAGAAGCTTGAGGTTCGAAAACACCTCAGGCGCTTGAACAAGCCGGCTGTTAAAACCATCGAGAGCCCAGATGGGGACCTAATTGACTGTGTTCATATGTCTCATCAACCTGCATTTGACCATCCTTTCCTTAAAGACCACAAAATCCAGACGAGGCCAACTTTTCATCCAGAATGGCTAGATGAGAGCAAAGTAGCTGAGAAAGCCAGTGAAAAACCGAATCCAATTAAGCAACTATGGCATGTTAATGGAAAGTGCCCTGAAGGTACCATCCCCATTAGAAGAACCAAACATGAGGATGTTTTGAGAGCAAGTTCAGTGAAGAGATATGGAAGAAAAAAGCACAGATCAGCCCCAATACCCCCTAGATCTGCTGAGCCTGATCTCATCAACCAAAGTGGCCATCAGCATGCCATAGCTTATGTAGAAGGAGACAAGTATTATGGAGCTAAAGCCACTATGAACGTGTGGGAGCCCAGCATTCAACAGCCAAATGAGTTTAGCTTATCACAGATTTGGATATTGGGAGGCTCTTTTGGTGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTCAGTCCCAATCTCTACGGTGATAACAACACGAGACTCTTCACATACTGGACGAGTGATGCATATCAAGCTACAGGCTGTTACAACCTCCTCTGCTCGGGCTTTATTCAAATCAACAGCGATATCGCGATGGGAGCAAGCATTTCTCCAGTCTCGGCGTTCCGAAATTCGCAATACGATATCAGTATACTTGTCTGGAAGGATCCAAAAGAGGGGCATTGGTGGATGCAATTTGGCAATGGCTACGTAATGGGATATTGGCCTTCATTTCTATTCTCATATTTAGCAGACAGTGCCTCCATGATTGAGTGGGGAGGTGAAGTGGTAAACTCCGAGCCAAATGGACAGCACACTTCAACACAAATGGGGAGCGGCCATTTCCCAGATGAAGGATTTGGGAAAGCAAGCTATTTTAGAAACATTCAAGTTGTTGATGCATCAAACAATCTCAAACCACCAAAAGGCATTGGCACATTCACAGAGCAGCCTGATTGCTATGATGTTCAAACAGGAAGCAATGGGGATTGGGGCCACTTCTTTTACTATGGAGGCCCTGGTAGAAACCCCAATTGCCCATGAATGAAGGTGAAATTCCCATTTTACCCTCCAAACTCTTCTTCTCTCTCTCTCTGTAAACTACTACCTCTTAGAGCTAGTGTTAGGGGGGTGCATGTGTCATTTTCTTGCTGATGGAAAGTTTTGGGTCTCTGGGTCATTGTCTTTTATGTACAAAGATTCTGTAGCCATGTAAGTGGTGGGGTAGGGGTAGGGTAAGGGCCCCAAGTTTCTTAAAAGCCCGATAGAAACAGGGCCTTTTGGCTATCAGTTCATGATGTGTGTGTATTTTATTAGTTTGTTTAGTATACTCCTCTCTTTGGAGTTGCTTTATTAATGAAGAATCAATGGATTTTGATTGTTGTAGATTGTGAGGACTATTGTTTCAAAGATA

Coding sequence (CDS)

ATGACAAGAAAGTGGGGCAGTTTTTGGCGGTGTGGTAGAGGAAACTGCTGTGGATTTGTGAGTTTAAGAGGGTTTTTGGAGATGGGTTCTGCTAGGTTTAGCAGATGCAGGACCATGGAAGCTCTGGTCGCCATTTTTTGTGTTTTGGGGCTGGTTTCTGTGTGCTGTGCGGCGAGGATGGAATCTGCTTCCCGCCAGAAGCTTGAGGTTCGAAAACACCTCAGGCGCTTGAACAAGCCGGCTGTTAAAACCATCGAGAGCCCAGATGGGGACCTAATTGACTGTGTTCATATGTCTCATCAACCTGCATTTGACCATCCTTTCCTTAAAGACCACAAAATCCAGACGAGGCCAACTTTTCATCCAGAATGGCTAGATGAGAGCAAAGTAGCTGAGAAAGCCAGTGAAAAACCGAATCCAATTAAGCAACTATGGCATGTTAATGGAAAGTGCCCTGAAGGTACCATCCCCATTAGAAGAACCAAACATGAGGATGTTTTGAGAGCAAGTTCAGTGAAGAGATATGGAAGAAAAAAGCACAGATCAGCCCCAATACCCCCTAGATCTGCTGAGCCTGATCTCATCAACCAAAGTGGCCATCAGCATGCCATAGCTTATGTAGAAGGAGACAAGTATTATGGAGCTAAAGCCACTATGAACGTGTGGGAGCCCAGCATTCAACAGCCAAATGAGTTTAGCTTATCACAGATTTGGATATTGGGAGGCTCTTTTGGTGAAGATCTTAATAGCATTGAAGCTGGTTGGCAGGTCAGTCCCAATCTCTACGGTGATAACAACACGAGACTCTTCACATACTGGACGAGTGATGCATATCAAGCTACAGGCTGTTACAACCTCCTCTGCTCGGGCTTTATTCAAATCAACAGCGATATCGCGATGGGAGCAAGCATTTCTCCAGTCTCGGCGTTCCGAAATTCGCAATACGATATCAGTATACTTGTCTGGAAGGATCCAAAAGAGGGGCATTGGTGGATGCAATTTGGCAATGGCTACGTAATGGGATATTGGCCTTCATTTCTATTCTCATATTTAGCAGACAGTGCCTCCATGATTGAGTGGGGAGGTGAAGTGGTAAACTCCGAGCCAAATGGACAGCACACTTCAACACAAATGGGGAGCGGCCATTTCCCAGATGAAGGATTTGGGAAAGCAAGCTATTTTAGAAACATTCAAGTTGTTGATGCATCAAACAATCTCAAACCACCAAAAGGCATTGGCACATTCACAGAGCAGCCTGATTGCTATGATGTTCAAACAGGAAGCAATGGGGATTGGGGCCACTTCTTTTACTATGGAGGCCCTGGTAGAAACCCCAATTGCCCATGA

Protein sequence

MTRKWGSFWRCGRGNCCGFVSLRGFLEMGSARFSRCRTMEALVAIFCVLGLVSVCCAARMESASRQKLEVRKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPTFHPEWLDESKVAEKASEKPNPIKQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPNLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPKEGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDEGFGKASYFRNIQVVDASNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPNCP
Homology
BLAST of CmaCh11G007800 vs. TAIR 10
Match: AT3G13510.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 695.3 bits (1793), Expect = 3.3e-200
Identity = 326/422 (77.25%), Postives = 365/422 (86.49%), Query Frame = 0

Query: 28  MGSARFSRCRTMEALVAIFCVLGLVSVCCAARMESASRQKLEVRKHLRRLNKPAVKTIES 87
           MG+  FS  +         C+  ++S+ CAA    +SRQK EV+KHL RLNKP VKTI+S
Sbjct: 1   MGAEHFSLVKFNRGF--FVCLWVMLSLSCAAASYGSSRQKFEVKKHLNRLNKPPVKTIQS 60

Query: 88  PDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPTFHPEWL-DESKVAEKASEKPNPIKQLWH 147
           PDGD+IDC+ +S QPAFDHPFLKDHKIQ RP++HPE L D++KV+ +   K   I QLWH
Sbjct: 61  PDGDIIDCIPISKQPAFDHPFLKDHKIQMRPSYHPEGLFDDNKVSAEPKGKETHIPQLWH 120

Query: 148 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSAPIPPRSAEPDLINQSGHQHAIAY 207
             GKC EGTIP+RRT+ +DVLRASSVKRYG+KKHRS PI P+SAEPDLINQ+GHQHAIAY
Sbjct: 121 RYGKCTEGTIPMRRTREDDVLRASSVKRYGKKKHRSVPI-PKSAEPDLINQNGHQHAIAY 180

Query: 208 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPNLYGDNN 267
           VEGDKYYGAKAT+NVWEP IQ  NEFSLSQIW+LGGSFG+DLNSIEAGWQVSP+LYGDNN
Sbjct: 181 VEGDKYYGAKATLNVWEPKIQNTNEFSLSQIWLLGGSFGQDLNSIEAGWQVSPDLYGDNN 240

Query: 268 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPK 327
           TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS +RNSQYDISIL+WKDPK
Sbjct: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSGYRNSQYDISILIWKDPK 300

Query: 328 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDE 387
           EGHWWMQFGNGYV+GYWPSFLFSYL +SASMIEWGGEVVNS+  G HT TQMGSGHFP+E
Sbjct: 301 EGHWWMQFGNGYVLGYWPSFLFSYLTESASMIEWGGEVVNSQSEGHHTWTQMGSGHFPEE 360

Query: 388 GFGKASYFRNIQVVDASNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPN 447
           GF KASYFRNIQVVD SNNLK PKG+GTFTE+ +CYDVQTGSN DWGH+FYYGGPG+N N
Sbjct: 361 GFSKASYFRNIQVVDGSNNLKAPKGLGTFTEKSNCYDVQTGSNDDWGHYFYYGGPGKNKN 419

Query: 448 CP 449
           CP
Sbjct: 421 CP 419

BLAST of CmaCh11G007800 vs. TAIR 10
Match: AT5G56530.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 694.1 bits (1790), Expect = 7.3e-200
Identity = 326/421 (77.43%), Postives = 362/421 (85.99%), Query Frame = 0

Query: 28  MGSARFSRCRTMEALVAIFCVLGLVSVCCAARMESASRQKLEVRKHLRRLNKPAVKTIES 87
           M +A FS+ R     +  FC  GL+S+ CA R+ S SRQ  EV KHL RLNKPAVK+I+S
Sbjct: 1   MAAAHFSKERVFRGFLVWFCFWGLMSLTCAGRL-SVSRQNFEVHKHLNRLNKPAVKSIQS 60

Query: 88  PDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPTFHPEWL-DESKVAEKASEKPNPIKQLWH 147
           PDGD+IDCVH+S QPAFDHPFLKDHKIQ  P++ PE L  ESKV+EK  E  NPI QLWH
Sbjct: 61  PDGDIIDCVHISKQPAFDHPFLKDHKIQMGPSYTPESLFGESKVSEKPKESVNPITQLWH 120

Query: 148 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSAPIPPRSAEPDLINQSGHQHAIAY 207
            NG C EGTIP+RRTK EDVLRASSVKRYG+KKH S P+ PRSA+PDLINQSGHQHAIAY
Sbjct: 121 QNGVCSEGTIPVRRTKKEDVLRASSVKRYGKKKHLSVPL-PRSADPDLINQSGHQHAIAY 180

Query: 208 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPNLYGDNN 267
           VEG K+YGAKAT+NVWEP +Q  NEFSLSQ+WILGGSFG+DLNSIEAGWQVSP+LYGDNN
Sbjct: 181 VEGGKFYGAKATINVWEPKVQSSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNN 240

Query: 268 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPK 327
           TRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVS F N QYDISI +WKDPK
Sbjct: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSQIAMGASISPVSGFHNPQYDISITIWKDPK 300

Query: 328 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDE 387
           EGHWWMQFG+GYV+GYWPSFLFSYLADSAS++EWGGEVVN E +G HT+TQMGSG FPDE
Sbjct: 301 EGHWWMQFGDGYVLGYWPSFLFSYLADSASIVEWGGEVVNMEEDGHHTTTQMGSGQFPDE 360

Query: 388 GFGKASYFRNIQVVDASNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPN 447
           GF KASYFRNIQVVD+SNNLK PKG+ TFTE+ +CYDV+ G N DWGH+FYYGGPGRNPN
Sbjct: 361 GFTKASYFRNIQVVDSSNNLKEPKGLNTFTEKSNCYDVEVGKNDDWGHYFYYGGPGRNPN 419

BLAST of CmaCh11G007800 vs. TAIR 10
Match: AT5G56530.2 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 694.1 bits (1790), Expect = 7.3e-200
Identity = 326/421 (77.43%), Postives = 362/421 (85.99%), Query Frame = 0

Query: 28  MGSARFSRCRTMEALVAIFCVLGLVSVCCAARMESASRQKLEVRKHLRRLNKPAVKTIES 87
           M +A FS+ R     +  FC  GL+S+ CA R+ S SRQ  EV KHL RLNKPAVK+I+S
Sbjct: 1   MAAAHFSKERVFRGFLVWFCFWGLMSLTCAGRL-SVSRQNFEVHKHLNRLNKPAVKSIQS 60

Query: 88  PDGDLIDCVHMSHQPAFDHPFLKDHKIQTRPTFHPEWL-DESKVAEKASEKPNPIKQLWH 147
           PDGD+IDCVH+S QPAFDHPFLKDHKIQ  P++ PE L  ESKV+EK  E  NPI QLWH
Sbjct: 61  PDGDIIDCVHISKQPAFDHPFLKDHKIQMGPSYTPESLFGESKVSEKPKESVNPITQLWH 120

Query: 148 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSAPIPPRSAEPDLINQSGHQHAIAY 207
            NG C EGTIP+RRTK EDVLRASSVKRYG+KKH S P+ PRSA+PDLINQSGHQHAIAY
Sbjct: 121 QNGVCSEGTIPVRRTKKEDVLRASSVKRYGKKKHLSVPL-PRSADPDLINQSGHQHAIAY 180

Query: 208 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPNLYGDNN 267
           VEG K+YGAKAT+NVWEP +Q  NEFSLSQ+WILGGSFG+DLNSIEAGWQVSP+LYGDNN
Sbjct: 181 VEGGKFYGAKATINVWEPKVQSSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNN 240

Query: 268 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPK 327
           TRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVS F N QYDISI +WKDPK
Sbjct: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSQIAMGASISPVSGFHNPQYDISITIWKDPK 300

Query: 328 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDE 387
           EGHWWMQFG+GYV+GYWPSFLFSYLADSAS++EWGGEVVN E +G HT+TQMGSG FPDE
Sbjct: 301 EGHWWMQFGDGYVLGYWPSFLFSYLADSASIVEWGGEVVNMEEDGHHTTTQMGSGQFPDE 360

Query: 388 GFGKASYFRNIQVVDASNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPN 447
           GF KASYFRNIQVVD+SNNLK PKG+ TFTE+ +CYDV+ G N DWGH+FYYGGPGRNPN
Sbjct: 361 GFTKASYFRNIQVVDSSNNLKEPKGLNTFTEKSNCYDVEVGKNDDWGHYFYYGGPGRNPN 419

BLAST of CmaCh11G007800 vs. TAIR 10
Match: AT1G55360.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 690.3 bits (1780), Expect = 1.1e-198
Identity = 328/422 (77.73%), Postives = 364/422 (86.26%), Query Frame = 0

Query: 29  GSARFSRCRTMEALVAIFCVLGLVSVCCAARMESASRQKLEVRKHLRRLNKPAVKTIESP 88
           G    S  +     +   C+ G  S+  AAR    S+QK EV+KHL RLNKPAVK+I+S 
Sbjct: 3   GVVHLSTAKLARGFLVCLCLWGFFSLSYAAR-SGVSKQKFEVKKHLNRLNKPAVKSIQSS 62

Query: 89  DGDLIDCVHMSHQPAFDHPFLKDHKIQTRPTFHPEWL-DESKV-AEKASEKPNPIKQLWH 148
           DGD+IDCV +S QPAFDHPFLKDHKIQ +P +HPE L D++KV A K++EK   I QLWH
Sbjct: 63  DGDVIDCVPISKQPAFDHPFLKDHKIQMKPNYHPEGLFDDNKVSAPKSNEKEGHIPQLWH 122

Query: 149 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSAPIPPRSAEPDLINQSGHQHAIAY 208
             GKC EGTIP+RRTK +DVLRASSVKRYG+KK RS P+ P+SAEPDLINQSGHQHAIAY
Sbjct: 123 RYGKCSEGTIPMRRTKEDDVLRASSVKRYGKKKRRSVPL-PKSAEPDLINQSGHQHAIAY 182

Query: 209 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPNLYGDNN 268
           VEGDKYYGAKAT+NVWEP IQQ NEFSLSQIW+LGGSFG+DLNSIEAGWQVSP+LYGDNN
Sbjct: 183 VEGDKYYGAKATINVWEPKIQQQNEFSLSQIWLLGGSFGQDLNSIEAGWQVSPDLYGDNN 242

Query: 269 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPK 328
           TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS +RNSQYDISIL+WKDPK
Sbjct: 243 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSGYRNSQYDISILIWKDPK 302

Query: 329 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGQHTSTQMGSGHFPDE 388
           EGHWWMQFGNGYV+GYWPSFLFSYL +SASMIEWGGEVVNS+ +GQHTSTQMGSG FP+E
Sbjct: 303 EGHWWMQFGNGYVLGYWPSFLFSYLTESASMIEWGGEVVNSQSDGQHTSTQMGSGKFPEE 362

Query: 389 GFGKASYFRNIQVVDASNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPN 448
           GF KASYFRNIQVVD SNNLK PKG+GTFTEQ +CYDVQTGSN DWGH+FYYGGPG+N  
Sbjct: 363 GFSKASYFRNIQVVDGSNNLKAPKGLGTFTEQSNCYDVQTGSNDDWGHYFYYGGPGKNQK 422

BLAST of CmaCh11G007800 vs. TAIR 10
Match: AT2G44210.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 563.5 bits (1451), Expect = 1.5e-160
Identity = 260/415 (62.65%), Postives = 327/415 (78.80%), Query Frame = 0

Query: 39  MEALVAIFCVLGLVSVCCAARMESASR--QKLEVRKHLRRLNKPAVKTIESPDGDLIDCV 98
           M   V+ F  L +  V  A  + S       L++R HL+RLNKPA+K+I+SPDGD+IDCV
Sbjct: 1   MATRVSFFLALVMTVVILAPSVVSGENGFSDLKIRTHLKRLNKPALKSIKSPDGDMIDCV 60

Query: 99  HMSHQPAFDHPFLKDHKIQTRPTFHPEWL-DESKVAEKA-SEKPNPIKQLWHVNGKCPEG 158
            ++ QPAF HP L +H +Q  P+ +PE +  ESKV+ K  +++ N I QLWHVNGKCP+ 
Sbjct: 61  PITDQPAFAHPLLINHTVQMWPSLNPESVFSESKVSSKTKNQQSNAIHQLWHVNGKCPKN 120

Query: 159 TIPIRRTKHEDVLRASSVKRYGRKKHRSAPIPPRSAEPDLINQSGHQHAIAYVEGDKYYG 218
           TIPIRRT+ +D+ RASSV+ YG K  +S P P  S  P+++ Q+GHQHAI YVE   +YG
Sbjct: 121 TIPIRRTRRQDLYRASSVENYGMKNQKSIPKPKSSEPPNVLTQNGHQHAIMYVEDGVFYG 180

Query: 219 AKATMNVWEPSIQQPNEFSLSQIWILGGSFGEDLNSIEAGWQVSPNLYGDNNTRLFTYWT 278
           AKA +NVW+P ++ PNEFSL+QIW+LGG+F  DLNSIEAGWQVSP LYGDN TRLFTYWT
Sbjct: 181 AKAKINVWKPDVEMPNEFSLAQIWVLGGNFNSDLNSIEAGWQVSPQLYGDNRTRLFTYWT 240

Query: 279 SDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAFRNSQYDISILVWKDPKEGHWWMQF 338
           SDAYQ TGCYNLLCSGF+QIN +IAMG SISP+S + NSQYDI+IL+WKDPKEGHWW+QF
Sbjct: 241 SDAYQGTGCYNLLCSGFVQINREIAMGGSISPLSNYGNSQYDITILIWKDPKEGHWWLQF 300

Query: 339 GNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSE-PNGQHTSTQMGSGHFPDEGFGKASY 398
           G  Y++GYWP+ LFSYL++SASMIEWGGEVVNS+   GQHT+TQMGSG F +EG+GKASY
Sbjct: 301 GEKYIIGYWPASLFSYLSESASMIEWGGEVVNSQSEEGQHTTTQMGSGRFAEEGWGKASY 360

Query: 399 FRNIQVVDASNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPNCP 449
           F+N+QVVD SN L+ P+ +  FT+Q +CY+V++G+ G WG +FYYGGPGRNPNCP
Sbjct: 361 FKNVQVVDGSNELRNPENLQVFTDQENCYNVKSGNGGSWGSYFYYGGPGRNPNCP 415

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT3G13510.13.3e-20077.25Protein of Unknown Function (DUF239) [more]
AT5G56530.17.3e-20077.43Protein of Unknown Function (DUF239) [more]
AT5G56530.27.3e-20077.43Protein of Unknown Function (DUF239) [more]
AT1G55360.11.1e-19877.73Protein of Unknown Function (DUF239) [more]
AT2G44210.11.5e-16062.65Protein of Unknown Function (DUF239) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025521Neprosin activation peptidePFAMPF14365Neprosin_APcoord: 84..206
e-value: 1.5E-46
score: 157.8
IPR004314NeprosinPFAMPF03080Neprosincoord: 219..441
e-value: 2.4E-88
score: 295.2
NoneNo IPR availableGENE3D3.90.1320.10coord: 224..342
e-value: 1.3E-17
score: 65.9
NoneNo IPR availablePANTHERPTHR31589:SF126SIMILARITY TO CARBOXYL-TERMINAL PROTEINASEcoord: 26..446
NoneNo IPR availablePANTHERPTHR31589PROTEIN, PUTATIVE (DUF239)-RELATED-RELATEDcoord: 26..446

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh11G007800.1CmaCh11G007800.1mRNA