CmaCh00G002870 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh00G002870
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPlant protein 1589 of unknown function
LocationCma_Chr00: 22598734 .. 22604518 (+)
RNA-Seq ExpressionCmaCh00G002870
SyntenyCmaCh00G002870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAAATATCACCAAGCCGACTCCTATGTATTTTTCCGTTTCCACCATTTTTCTATCATTCCATTTCCAATTCTCATTTCATTACAAATTTCCTTAAAATCAGAGCACAGTCATTGCTTAACAAGATTTTTTTTCCTCTCAGCTTCGTTTTCTCATCAATTCCCATCCTCTAATTTCCTTGTTTCATCCATTTGATTCAAACATCTTCCGTCGTTCTTTCTCGCACTTGCGGCTATCGATGCAGGATTTCATTTTGTTCTCGTCTTCTAGCTTCTAAGCTTTGAGTAAATCTCAACTGCGAATTTCAACTGCCATTTTTGTTCAACAATCGATATCGTACGCGTGTCTGAGCTTGGTTTCGGAGTTTAGGTTACTAAATTTGTCGAAATTACACAATTTCACCGAGTTTTGAAATCTGGGTTGCAGGTAATATTGCCAGATCGTTAACCTTCCATCCGTATTCCCGGGTTGTTTATATCTGTTTGAAGGAATGAAAATTGGGATTACATTGATTGTTGTAGTTTAATACCACTACTGTTTCGTTGTTAAACTGTATTGATGATGGAATTCAGAATGCCCACGGAACTGAGTTGAACTTGAGATCCAATTGAGTTTAGACAATATATTTTCTCGTTTTCATCCTATGTGCGAAGGTTCCATGCTTTAGTGATTTCCATTCATATTCCATAACTATGTAACATGTGGTATTGGTTTGGAATTGAGTTCGTCATGAAAAGTGATTGTTTCTCTCATTTGAATTTTTGGAGCATGGATGTTTAATCTAGGAGGGCTTTGGATATTGTCTGTTGATGAACTTCACAAGCTGGGAAACATTATTCTTTCTCTTTAAATTAGAAAATCAGTCGTAGGTGGATTTCTAGCATGGAATATAAAGCATTGTTTTGAAAAGTTGTTTTTGATTTAAATGAATTAGAATGATTGGCAGTGGATTTCCAGTATGATTACTCTTTGGGTTTCAAACTATCTAAAGGATCCCTGTTCTTTTTCCCATTCATCGTTGGTCATTTTGCAAAGAATTTACCCAATGAATTAGCAACGTATTGGTTGTGTTAGCTTTTACTTTATGGTGTATATTTCTTACACTTCATCTTATCTGTCCTATTGGTTGGACTGGTGATTGAATAATTTGTATTCTGGTTTCAGGTGAAATAATCAAGCCTACAAGAGCTATACCTATGTCTATAATCACGTTTTTTTCTGCAGCCTTCTCTGTACTTAAAGAATACCTAATTTGTATAGAGAGATGCGATCATTCTGCTTTCAACTGATATGAAGACTGCACAGGTTTCCTTTTATAATTCTGTTGTTGCTTTATACAAAAAGCTCGACGTTCCTCTAACTTCACCTTTTTACTTGTTCTTTGCCTACAACGGAACTTCTCCACTTACTCCTTTACCTCAAATTTTAGAATGATTTTTCAACTTCGTGGTGCTAATGTTATACTTCCAATATTGCCTAATTGAAAGAAGAAATGGATTTGGCTTTAAAATTTAATAGAATTTGGAGACATCGTTTTTTATCCTACGTGGTCCTTATCAGCCTGACATATTTCTTCCATTACCATTAAGGAGTGTGCTGCCTCATATGATATTAGATGTCGGATCATATTACTTATGCCTGCAAATATGGAAGTTTGATGTGACTGTTGGACATCAATCCTTGTTGATATTCTGGCTCTTGGACATTTTATGTTGTATTTTCCCCTGTATCTCATTTATAGTATTTAGTTTCCGAATCGGTTACTATAAATTGGTTATTGTCACACTAAAAAATCTAATTGTCATTCTAAGATCCTTTCTTGACATGCATAGAGAGCTCAAGATGCAAAAAAAGCTCTGGATGATTCCAAAAATAACCTACCAAACAGCTACAAAACTGAGGCTCCTATAACGGATACAGGTTCGATTTCTGCCTCAAACAATGATGGCAAAAAAGTTTCTCACCAAGATATTGAATTTGTAAGCCACAGATTCTTTCTAATTACAATGACCATTAGTTTCTATATTTATTTTTTCATCTTCAATGTTCTGTTGTGATTTCTTATCTCTTCATTGTTGAATGAATTGGAATGCTGCTCTAGGTCCAGAATTTAATAGAGCGGTGCCTACAGTTATACATGAATAGAGATGAGGTGGTTAAGACCCTATTGAATCGTGCAAGGATAGATCCTGGATTTACGTCATTGGGTAAGTTTAGGAGCTTTCATTAAGTTCTATGTTCTAATATTCATGTTCAATCAAATAACAATGATTGTAAACAATTACTATCACTCCACACACACTTATTATAGATGATTATTACGTATCTATTATTGTTTATTAATGTTCTGGACTTTCCTCATTCAGTTTGGCAGAAGCTGGAAGAAGAGAATGCTGATTTTTTCAGAGCCTACTATATAAGATTAAAATTGAAAAAACAAATCCTTCTTTTCAATCATTTGCTTGAACATCAATGTCGTCTCATGAATTATTCAATACCTCCCAAAGTTCCGTTAGCTCCTATGCAGAATGGAATTCATCCTATGCCTGGTAAGCTCTCATATCCTTAACCAGATTATGTAATTATCTAGCTCAGTGGATACCATGGGTGCTGGGATTCCAAGCCTAGTTGTAGCTCTGCTCTTAGATAGCTTAATTTTATATATTGAAGTAAGAGAGTTACATGTTGCAAGATACTAGTATTTTGTTATTCATCACTACTTATTTTCAGTGCAGTTAACAACTTACCAATGGGATATCCAGTCCTTCAACAACCTCCTATGTCATTGCCAGGTCAACCCCACATGGATACCATGGGCTCTGGGATGCCAAGCTGCCATGTGGTCAATGGAGTTCCTGCACCTAGTAACTTCCATCCTATTAGAATGAACTCTGGGAACGAGTAAGAAGACTGCCCTAGTAAAGTCGTTATTAATATACATTCTAAAACATTCAATGCTGTGGTCTCGATAAAACAAAACAATAGAAAGTTATAAAATGGTCAATAAGATTGCATAAAATGGAAATATTTTTAAACTTAGACGAAAAAATGTGATACTGTAAAAGAATTTGGCATGTTTATTGAGCACACCCTTTTCAATATAGTTTCTATTACCTCTAGGGGAGTGGTAATTATGCACAAGTACTAGCATTGGGTGTCATGGCTCATGCTGCTAGTCTTGCCATGATATCATTGATCATTACTGCTTCGCTAAATTCTCCTCAAGAAAAATCAGCCATGTTCATGTGTCTGTGTGGTAGATTTTGCATGGAAGAAACTTGTGCTGGAGTTTTGAGTATCTACAAACAAAGCTAAAGGAATTACATCCAGTGATATAAAATCGTACTAATATGCATGGGACTAATGACCAATGGTTAAGATATTGCCTCTAGGATTCATGTGCCGACTGTAAGAGAGCATTTAAAAATATTCATGGAGGATAGTAATCCAGTATTTACATTCATTGGCTTCATAAATTGATCTCACTTCTCACTTTTCACTTTAACTTTTTGTTATTTTATTGCATCTAATACTTGTTCTTATTGAAGTATGTTGATGAACAGCGTGGCTGATGTGGCTTCTGTTATTCCACCAAATGGTACAATGTCCTCCATGTCAGAGATGTCTATGAGTCCTACATCGGTGGCATCCAGTGGTCATTTCTCATTCACTGCTCCAGAAATATCTGGAATTGGAGTAGATACTTCTGCACTTGATACTGCATTTACATCAGAAATCGTAAATTCAGTTGGTCTGCAACTTTCGCAAGATGATGGAGCAGGAAATTCCAGGGATTCTCTCAGATCATTGGATCTGATTCAGTGGAATTTTAGCCTGTCAGATCTTAAAACAGATTTATCAAATTTGGGAGGTAATTTCTAGATGATTTGTAGATTTCTCTAATTTTGTCTCTTATATTGATGTCTACAGAACTTTTAGCAGTATCATATCACCTTTTCAGATCACCGTAATTTCTTTGGAATATTACTTCACAAGGCTAGAATAATAAGCATTGTTCTAAATTGGACTTATTTTTTCCTCCTTCATTTAGCTAGGTTTACAATAAGAAACAGGGTAAATACTAGAAAGGGTAAACGTTGCGAAGGGTCAATCTCTCCGGCCTTTATACTAGGCAATAAGAAACATCTAAATTTTGGAAAATGCATCTCCAAACGCCCAAGCTAAGCTAGGTGATCCTGATCACAAAGATTTCAATAGAATGGAGAGAGAAACCAAGCTGGACAAGAAAGGGGGACCAGACATTATACCATTCAAGTAATAAATTATTGAAAAAGGGGAGTCAATAGTAGTCCCTGCCCCCTCGATGGTCAGCCAATACATAAAAGAATAATGAGAAATAAGCCATTGAAAACCAAACAACTTTAACCACCTAGCAGTAGTTCAACAATCCTTTTTGCAATAAATAGAGAAGGACGTAGAGAGTGAGAAAAAGCTTTCCCTATCTTGTGAACAATGGCAACAGTTCAAGGGAAATTTGCCTCTGAAGTCCAACTCAAAATATTTGAAAGCTAGAGGATGTCGGACTTGATTGACAGATATCCTAATCGGAGTCGAACAAATATACTGTGTGGATAAGTCAATGCCTTGTTCATTCAGCGATCACCCTAGATTGAAGTTGTTCATTCATCCCTGATATGAATATAGCCGTGGGCATCCTTCAGATGACCGATTGAGGAATTGTTTGGGAATGATGGCCTTTTACTTTAGTATCATGTGAATCTAATTTTTCTTTTCCTTTTCCAATAAGTTACTTCAGTCAAATTGTTACGAAGTTTTTCTTTCAATCTTTTTTAATCTTCCGTAAGGAAAAGTCTTGTGTTGGTCTATAGAAGTTGATAGCTTTTAGGAAGACCTTTAAATAAGAATGCAAAATCCTATTTCGTTGGCAGTTTTCACTCTTCTCTTGGACTAACTTTAAGCATATTGATGAAATCTCTTCAAATTGGTGCAGATTTAGGACCATTGGGTAACTACCCTGGTTCTCCATTTTTGCACTCCGACCCGGAAATTTTGCTCGATTCACCAGAGCAAGACGACCTAGGTATGGGAACTGCAAGACTCTTTTCAGCTCTAAATTGTTTGATGTAACCAAAACCACCCTTTTTATCATGACAAGCAATTTTGGCGAACGTTAATCATTTTATATTTGTTCATTACTCTTTTCTTTCTTCAAGTATGATGACTCTTGCGAATTAGAAGTTCCTAGTTGTCGATGGAAGTTGAATGACATAGATTTTAGATATCAACTATGCAAATTTAAGAAAGCCAAGAAGTCGAGTCTAAACCTTTATGTACATAACTTATCATTTGGCCACTCACTAACGTCTCAATTTTATCGCAGTGGAGGAGTTCTTTGTTGATTCTGTCCCTGAGCAGCCAGACGAAGAGAAGTCCTAGAGAAAGGCGACAAAGGTTTTTTTTCTTCCCACTTGAATAAGTAGGATAGTATAAGAGTTGATCAGAAATCAAGAGCTCATTTTCTATTCACAATAGGCATCCAGTAATTTTGTCATCAACAATTTCAATTATTTAAAAATTGTAGTCACATAGCTTGGTTGGTTGGTAGGATTGTAGATAGGGATGTAGTCATGTAGGGGAAGCTGTGCTTCTCTTGTGCCGCTCATTTTAGAGGTCTCCAGTTTCTTGGAAGTTGAAGCTGACTGTTCAGTTTTTAATCCCTTGTCTGGTCTGTATTAAGAGTCAATTACAACAATCCCATAATAATGGTATTTTTTAGATGTTTAATTATTCTATCTCTTGTTTAA

mRNA sequence

TAAAATATCACCAAGCCGACTCCTATGTATTTTTCCGTTTCCACCATTTTTCTATCATTCCATTTCCAATTCTCATTTCATTACAAATTTCCTTAAAATCAGAGCACAGTCATTGCTTAACAAGATTTTTTTTCCTCTCAGCTTCGTTTTCTCATCAATTCCCATCCTCTAATTTCCTTGTTTCATCCATTTGATTCAAACATCTTCCGTCGTTCTTTCTCGCACTTGCGGCTATCGATGCAGGATTTCATTTTGTTCTCGTCTTCTAGCTTCTAAGCTTTGAGTAAATCTCAACTGCGAATTTCAACTGCCATTTTTGTTCAACAATCGATATCGTACGCGTGTCTGAGCTTGGTTTCGGAGTTTAGGTTACTAAATTTGTCGAAATTACACAATTTCACCGAGTTTTGAAATCTGGGTTGCAGGTGAAATAATCAAGCCTACAAGAGCTATACCTATGTCTATAATCACGTTTTTTTCTGCAGCCTTCTCTGTACTTAAAGAATACCTAATTTGTATAGAGAGATGCGATCATTCTGCTTTCAACTGATATGAAGACTGCACAGAGAGCTCAAGATGCAAAAAAAGCTCTGGATGATTCCAAAAATAACCTACCAAACAGCTACAAAACTGAGGCTCCTATAACGGATACAGGTTCGATTTCTGCCTCAAACAATGATGGCAAAAAAGTTTCTCACCAAGATATTGAATTTGTCCAGAATTTAATAGAGCGGTGCCTACAGTTATACATGAATAGAGATGAGGTGGTTAAGACCCTATTGAATCGTGCAAGGATAGATCCTGGATTTACGTCATTGGTTTGGCAGAAGCTGGAAGAAGAGAATGCTGATTTTTTCAGAGCCTACTATATAAGATTAAAATTGAAAAAACAAATCCTTCTTTTCAATCATTTGCTTGAACATCAATGTCGTCTCATGAATTATTCAATACCTCCCAAAGTTCCGTTAGCTCCTATGCAGAATGGAATTCATCCTATGCCTGTTAACAACTTACCAATGGGATATCCAGTCCTTCAACAACCTCCTATGTCATTGCCAGGTCAACCCCACATGGATACCATGGGCTCTGGGATGCCAAGCTGCCATGTGGTCAATGGAGTTCCTGCACCTAGTAACTTCCATCCTATTAGAATGAACTCTGGGAACGATATGTTGATGAACAGCGTGGCTGATGTGGCTTCTGTTATTCCACCAAATGGTACAATGTCCTCCATGTCAGAGATGTCTATGAGTCCTACATCGGTGGCATCCAGTGGTCATTTCTCATTCACTGCTCCAGAAATATCTGGAATTGGAGTAGATACTTCTGCACTTGATACTGCATTTACATCAGAAATCGTAAATTCAGTTGGTCTGCAACTTTCGCAAGATGATGGAGCAGGAAATTCCAGGGATTCTCTCAGATCATTGGATCTGATTCAGTGGAATTTTAGCCTGTCAGATCTTAAAACAGATTTATCAAATTTGGGAGATTTAGGACCATTGGGTAACTACCCTGGTTCTCCATTTTTGCACTCCGACCCGGAAATTTTGCTCGATTCACCAGAGCAAGACGACCTAGTGGAGGAGTTCTTTGTTGATTCTGTCCCTGAGCAGCCAGACGAAGAGAAGTCCTAGAGAAAGGCGACAAAGGTTTTTTTTCTTCCCACTTGAATAAGTAGGATAGTATAAGAGTTGATCAGAAATCAAGAGCTCATTTTCTATTCACAATAGGCATCCAGTAATTTTGTCATCAACAATTTCAATTATTTAAAAATTGTAGTCACATAGCTTGGTTGGTTGGTAGGATTGTAGATAGGGATGTAGTCATGTAGGGGAAGCTGTGCTTCTCTTGTGCCGCTCATTTTAGAGGTCTCCAGTTTCTTGGAAGTTGAAGCTGACTGTTCAGTTTTTAATCCCTTGTCTGGTCTGTATTAAGAGTCAATTACAACAATCCCATAATAATGGTATTTTTTAGATGTTTAATTATTCTATCTCTTGTTTAA

Coding sequence (CDS)

ATGAAGACTGCACAGAGAGCTCAAGATGCAAAAAAAGCTCTGGATGATTCCAAAAATAACCTACCAAACAGCTACAAAACTGAGGCTCCTATAACGGATACAGGTTCGATTTCTGCCTCAAACAATGATGGCAAAAAAGTTTCTCACCAAGATATTGAATTTGTCCAGAATTTAATAGAGCGGTGCCTACAGTTATACATGAATAGAGATGAGGTGGTTAAGACCCTATTGAATCGTGCAAGGATAGATCCTGGATTTACGTCATTGGTTTGGCAGAAGCTGGAAGAAGAGAATGCTGATTTTTTCAGAGCCTACTATATAAGATTAAAATTGAAAAAACAAATCCTTCTTTTCAATCATTTGCTTGAACATCAATGTCGTCTCATGAATTATTCAATACCTCCCAAAGTTCCGTTAGCTCCTATGCAGAATGGAATTCATCCTATGCCTGTTAACAACTTACCAATGGGATATCCAGTCCTTCAACAACCTCCTATGTCATTGCCAGGTCAACCCCACATGGATACCATGGGCTCTGGGATGCCAAGCTGCCATGTGGTCAATGGAGTTCCTGCACCTAGTAACTTCCATCCTATTAGAATGAACTCTGGGAACGATATGTTGATGAACAGCGTGGCTGATGTGGCTTCTGTTATTCCACCAAATGGTACAATGTCCTCCATGTCAGAGATGTCTATGAGTCCTACATCGGTGGCATCCAGTGGTCATTTCTCATTCACTGCTCCAGAAATATCTGGAATTGGAGTAGATACTTCTGCACTTGATACTGCATTTACATCAGAAATCGTAAATTCAGTTGGTCTGCAACTTTCGCAAGATGATGGAGCAGGAAATTCCAGGGATTCTCTCAGATCATTGGATCTGATTCAGTGGAATTTTAGCCTGTCAGATCTTAAAACAGATTTATCAAATTTGGGAGATTTAGGACCATTGGGTAACTACCCTGGTTCTCCATTTTTGCACTCCGACCCGGAAATTTTGCTCGATTCACCAGAGCAAGACGACCTAGTGGAGGAGTTCTTTGTTGATTCTGTCCCTGAGCAGCCAGACGAAGAGAAGTCCTAG

Protein sequence

MKTAQRAQDAKKALDDSKNNLPNSYKTEAPITDTGSISASNNDGKKVSHQDIEFVQNLIERCLQLYMNRDEVVKTLLNRARIDPGFTSLVWQKLEEENADFFRAYYIRLKLKKQILLFNHLLEHQCRLMNYSIPPKVPLAPMQNGIHPMPVNNLPMGYPVLQQPPMSLPGQPHMDTMGSGMPSCHVVNGVPAPSNFHPIRMNSGNDMLMNSVADVASVIPPNGTMSSMSEMSMSPTSVASSGHFSFTAPEISGIGVDTSALDTAFTSEIVNSVGLQLSQDDGAGNSRDSLRSLDLIQWNFSLSDLKTDLSNLGDLGPLGNYPGSPFLHSDPEILLDSPEQDDLVEEFFVDSVPEQPDEEKS
Homology
BLAST of CmaCh00G002870 vs. TAIR 10
Match: AT2G46420.1 (Plant protein 1589 of unknown function )

HSP 1 Score: 468.0 bits (1203), Expect = 6.8e-132
Identity = 248/367 (67.57%), Postives = 291/367 (79.29%), Query Frame = 0

Query: 1   MKTAQRAQDAKKALDDSKNNLPNSYKTEAPITDTGSISASNNDGKKVSHQDIEFVQNLIE 60
           MK  Q  Q + +   +S+     +   EAPI D+GS+SAS+NDG+KVS QDIE VQNLIE
Sbjct: 1   MKNGQELQSSTQVSHESQGEQKVNLSVEAPIQDSGSVSASSNDGRKVSRQDIELVQNLIE 60

Query: 61  RCLQLYMNRDEVVKTLLNRARIDPGFTSLVWQKLEEENADFFRAYYIRLKLKKQILLFNH 120
           RCLQLYM+RDEVVKTLL RARIDPGFT+LVWQKLEEENA+FFRAYYIRLKLKKQI++FNH
Sbjct: 61  RCLQLYMSRDEVVKTLLTRARIDPGFTTLVWQKLEEENAEFFRAYYIRLKLKKQIVVFNH 120

Query: 121 LLEHQCRLMNYSIPPKVPLAPMQNGIHPM-PVNNLPMGYPVLQQPPMSLPGQPHMDTMGS 180
           LLEHQ  L  Y++  KVPL PMQNGIHPM  VNN+PMGYPVLQ P M   G PH+D M  
Sbjct: 121 LLEHQYHLTKYNVHSKVPLVPMQNGIHPMASVNNMPMGYPVLQHPQMHAQGHPHLDPMSC 180

Query: 181 GMPSCHVVNGVPAPSNFHPIRMNSGNDMLMN-SVADVASVIPPNGTMSSMSEMSMSPTSV 240
           GM SCHVVNGVPAP+NF P+R+NSGNDM+++ ++A+   +IPPN   S MS+M +SP SV
Sbjct: 181 GMSSCHVVNGVPAPANFQPMRINSGNDMVIDTTMAEPTPMIPPN---SGMSDMPVSPASV 240

Query: 241 ASSGHFSFTAPEISGIGVDTSALDTAFTSEIVNSVGLQLSQDDGAGNSRDSLRSLDLIQW 300
           ASSGHF F A ++SG+G+DTSALD+AFTS++  SVGLQL  D GAGNSRD LR  D I W
Sbjct: 241 ASSGHFPFAASDMSGMGMDTSALDSAFTSDVGTSVGLQLGSDGGAGNSRDPLRPFDQIPW 300

Query: 301 NFSLSDLKTDLSNLGDLGPLGNYPGSPFLHSDPEILLDSPEQDDLVEEFFVDSVP----E 360
           NFSLSDL  DLSNLGDLG LGNYPGSPFL SD EILLDSPEQ+D ++EFFVDS+P     
Sbjct: 301 NFSLSDLTADLSNLGDLGALGNYPGSPFLPSDSEILLDSPEQED-IDEFFVDSIPGPPCS 360

Query: 361 QPDEEKS 362
           Q +E+KS
Sbjct: 361 QSEEDKS 363

BLAST of CmaCh00G002870 vs. TAIR 10
Match: AT2G46420.2 (Plant protein 1589 of unknown function )

HSP 1 Score: 456.4 bits (1173), Expect = 2.1e-128
Identity = 237/345 (68.70%), Postives = 276/345 (80.00%), Query Frame = 0

Query: 1   MKTAQRAQDAKKALDDSKNNLPNSYKTEAPITDTGSISASNNDGKKVSHQDIEFVQNLIE 60
           MK  Q  Q + +   +S+     +   EAPI D+GS+SAS+NDG+KVS QDIE VQNLIE
Sbjct: 1   MKNGQELQSSTQVSHESQGEQKVNLSVEAPIQDSGSVSASSNDGRKVSRQDIELVQNLIE 60

Query: 61  RCLQLYMNRDEVVKTLLNRARIDPGFTSLVWQKLEEENADFFRAYYIRLKLKKQILLFNH 120
           RCLQLYM+RDEVVKTLL RARIDPGFT+LVWQKLEEENA+FFRAYYIRLKLKKQI++FNH
Sbjct: 61  RCLQLYMSRDEVVKTLLTRARIDPGFTTLVWQKLEEENAEFFRAYYIRLKLKKQIVVFNH 120

Query: 121 LLEHQCRLMNYSIPPKVPLAPMQNGIHPM-PVNNLPMGYPVLQQPPMSLPGQPHMDTMGS 180
           LLEHQ  L  Y++  KVPL PMQNGIHPM  VNN+PMGYPVLQ P M   G PH+D M  
Sbjct: 121 LLEHQYHLTKYNVHSKVPLVPMQNGIHPMASVNNMPMGYPVLQHPQMHAQGHPHLDPMSC 180

Query: 181 GMPSCHVVNGVPAPSNFHPIRMNSGNDMLMN-SVADVASVIPPNGTMSSMSEMSMSPTSV 240
           GM SCHVVNGVPAP+NF P+R+NSGNDM+++ ++A+   +IPPN   S MS+M +SP SV
Sbjct: 181 GMSSCHVVNGVPAPANFQPMRINSGNDMVIDTTMAEPTPMIPPN---SGMSDMPVSPASV 240

Query: 241 ASSGHFSFTAPEISGIGVDTSALDTAFTSEIVNSVGLQLSQDDGAGNSRDSLRSLDLIQW 300
           ASSGHF F A ++SG+G+DTSALD+AFTS++  SVGLQL  D GAGNSRD LR  D I W
Sbjct: 241 ASSGHFPFAASDMSGMGMDTSALDSAFTSDVGTSVGLQLGSDGGAGNSRDPLRPFDQIPW 300

Query: 301 NFSLSDLKTDLSNLGDLGPLGNYPGSPFLHSDPEILLDSPEQDDL 344
           NFSLSDL  DLSNLGDLG LGNYPGSPFL SD EILLDSPEQ+D+
Sbjct: 301 NFSLSDLTADLSNLGDLGALGNYPGSPFLPSDSEILLDSPEQEDI 342

BLAST of CmaCh00G002870 vs. TAIR 10
Match: AT3G61700.1 (Plant protein 1589 of unknown function )

HSP 1 Score: 443.7 bits (1140), Expect = 1.4e-124
Identity = 245/364 (67.31%), Postives = 283/364 (77.75%), Query Frame = 0

Query: 4   AQRAQDAKKALDDSKNNLPNSYKTEAPITDTGSISASNNDGKKVSHQDIEFVQNLIERCL 63
           AQ  Q + +A  DS+ +   +   +API D+GS+SAS+ND +KVS QDIE VQNLIERCL
Sbjct: 7   AQELQSSTQASHDSQGDQKTNLSIDAPIQDSGSVSASSNDSRKVSRQDIELVQNLIERCL 66

Query: 64  QLYMNRDEVVKTLLNRARIDPGFTSLVWQKLEEENADFFRAYYIRLKLKKQILLFNHLLE 123
           QLYMNRDEVVKTLL RARIDPGFT+LVWQKLEEENADFFRAYYIRLKLKKQI+LFNHLLE
Sbjct: 67  QLYMNRDEVVKTLLTRARIDPGFTTLVWQKLEEENADFFRAYYIRLKLKKQIILFNHLLE 126

Query: 124 HQCRLMNYSI-PPKVPLAPMQNGIHPMPVNNLPMGYPVLQQPPMSLPGQP-HMDTMGSGM 183
           HQ  LM Y   PPKVPLAP+QNG+HPM   N+PMGYPVLQ P M +PG P H+D M  G+
Sbjct: 127 HQYHLMKYPPGPPKVPLAPIQNGMHPMAPVNMPMGYPVLQHPQMHVPGHPHHLDAM--GV 186

Query: 184 PSCHVVNGVPAPSNFHPIRMNSGNDMLMNSVADVAS--VIPPNGTMSSMSEMSMSPTSVA 243
            SCHVVNGVPAP+NFHP+RMN+ NDM++++ A+ A+  VIPPN    +M EM  SP SVA
Sbjct: 187 SSCHVVNGVPAPANFHPLRMNTANDMVIDTTANDATPQVIPPNS--GAMPEMVASPASVA 246

Query: 244 SSGHFSFTAPEISGIGVDTSALDTAFTSEIVNSVGLQLSQDDGAGNSRDSLRSLDLIQWN 303
           SSGHF F A ++SG+ +DTS LD+AFTS++           +GAGNSRDSLRS D I WN
Sbjct: 247 SSGHFPFAASDMSGMVMDTSVLDSAFTSDVG-------PDGEGAGNSRDSLRSFDQIPWN 306

Query: 304 FSLSDLKTDLSNLGDLGPLGNYPGSPFLHSDPEILLDSPEQDDLVEEFFVDSVP---EQP 361
           FSLSDL  DLSNLGDLG LGNYPGSPFL SD EI LDSPEQ+D +EEFFVDSVP      
Sbjct: 307 FSLSDLTADLSNLGDLGALGNYPGSPFLPSDSEIFLDSPEQED-IEEFFVDSVPGPRSNS 358

BLAST of CmaCh00G002870 vs. TAIR 10
Match: AT3G61700.2 (Plant protein 1589 of unknown function )

HSP 1 Score: 437.6 bits (1124), Expect = 9.9e-123
Identity = 245/369 (66.40%), Postives = 283/369 (76.69%), Query Frame = 0

Query: 4   AQRAQDAKKALDDSKNNLPNSYKTEAPITDTGSISASNNDGKKVSHQDIEFVQNLIERCL 63
           AQ  Q + +A  DS+ +   +   +API D+GS+SAS+ND +KVS QDIE VQNLIERCL
Sbjct: 7   AQELQSSTQASHDSQGDQKTNLSIDAPIQDSGSVSASSNDSRKVSRQDIELVQNLIERCL 66

Query: 64  QLYMNRDEVVKTLLNRARIDPGFTSLVWQKLEEENADFFRAYYIRLKLKKQILLFNHLLE 123
           QLYMNRDEVVKTLL RARIDPGFT+LVWQKLEEENADFFRAYYIRLKLKKQI+LFNHLLE
Sbjct: 67  QLYMNRDEVVKTLLTRARIDPGFTTLVWQKLEEENADFFRAYYIRLKLKKQIILFNHLLE 126

Query: 124 HQCRLMNYSI-PPKVPLAPMQNGIHPMPVNNLPMGYPVLQQPPMSLPGQP-HMDTMGSGM 183
           HQ  LM Y   PPKVPLAP+QNG+HPM   N+PMGYPVLQ P M +PG P H+D M  G+
Sbjct: 127 HQYHLMKYPPGPPKVPLAPIQNGMHPMAPVNMPMGYPVLQHPQMHVPGHPHHLDAM--GV 186

Query: 184 PSCHVVNGVPAPSNFHPIRMNSGNDMLMNSVADVAS--VIPPNGTMSSMSEMSMSPTSVA 243
            SCHVVNGVPAP+NFHP+RMN+ NDM++++ A+ A+  VIPPN    +M EM  SP SVA
Sbjct: 187 SSCHVVNGVPAPANFHPLRMNTANDMVIDTTANDATPQVIPPNS--GAMPEMVASPASVA 246

Query: 244 SSGHFSFTAPEISGIGVDTSALDTAFTSEIVNSVGLQLSQDDGAGNSRDSLRSLDLIQWN 303
           SSGHF F A ++SG+ +DTS LD+AFTS++           +GAGNSRDSLRS D I WN
Sbjct: 247 SSGHFPFAASDMSGMVMDTSVLDSAFTSDVG-------PDGEGAGNSRDSLRSFDQIPWN 306

Query: 304 FSLSDLKTDLSNLG-----DLGPLGNYPGSPFLHSDPEILLDSPEQDDLVEEFFVDSVP- 361
           FSLSDL  DLSNLG     DLG LGNYPGSPFL SD EI LDSPEQ+D +EEFFVDSVP 
Sbjct: 307 FSLSDLTADLSNLGDMYVADLGALGNYPGSPFLPSDSEIFLDSPEQED-IEEFFVDSVPG 363

BLAST of CmaCh00G002870 vs. TAIR 10
Match: AT5G04090.2 (Plant protein 1589 of unknown function )

HSP 1 Score: 145.6 bits (366), Expect = 7.8e-35
Identity = 109/296 (36.82%), Postives = 146/296 (49.32%), Query Frame = 0

Query: 45  KKVSHQDIEFVQNLIERCLQLYMNRDEVVKTLLNRARIDPGFTSLVWQKLEEENADFFRA 104
           ++VS +DI+ VQNLIERCLQLYMN+ EVV TLL +A+I+PGFT LVWQKLEEEN +FF+A
Sbjct: 7   RRVSREDIQLVQNLIERCLQLYMNQKEVVDTLLEQAKIEPGFTELVWQKLEEENREFFKA 66

Query: 105 YYIRLKLKKQILLFNHLLEHQCRLMNYSIPPKVPLAPMQNGIHPMPVNNLPMGYPVLQQP 164
           YY+RL +K QI+ +N LLE Q   M    P        +NG H  P+N   + Y   ++P
Sbjct: 67  YYLRLMVKHQIMEYNELLEQQINHMRQMHPTAGASVRNRNGSHVPPMNQQQLLYE-RKEP 126

Query: 165 PMSLPGQPHMDTMGSGMPSCHVVNGVPAPSNFHPIRMNSGNDMLMNSVADVASVIPPNGT 224
             S P        G    + ++ + V   S  H  R++                  PN  
Sbjct: 127 DQSSPNLSSPYLNGGSAINTNIPSYVDFSS--HSRRVDPS----------------PNSL 186

Query: 225 MSSMSEMSMSPTSVASSGHFSFTAPEISGIGVDTSALDTAFTSEIVNSVGLQLSQDDGAG 284
               + M +    + S   +   AP + G G   S +     +   N    Q   D    
Sbjct: 187 SLQATNMPLMQGMIKSETAYQNCAPYMYG-GEAQSTVGDVTIASFSNDSSNQSLNDPLVD 246

Query: 285 NSRDSLRSLDLIQWNFSLSDLKTDLSNLGDLGPLGNYPGSPFLHSDPEILLDSPEQ 341
               +  SL  I  NFSLSDL  D S   D+  L +Y GSPFL +D E  LDS E+
Sbjct: 247 PDAPTFGSLGQIPQNFSLSDLTADFSQSSDI--LESYEGSPFLLADAENFLDSSER 280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT2G46420.16.8e-13267.57Plant protein 1589 of unknown function [more]
AT2G46420.22.1e-12868.70Plant protein 1589 of unknown function [more]
AT3G61700.11.4e-12467.31Plant protein 1589 of unknown function [more]
AT3G61700.29.9e-12366.40Plant protein 1589 of unknown function [more]
AT5G04090.27.8e-3536.82Plant protein 1589 of unknown function [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006476Conserved hypothetical protein CHP01589, plantPFAMPF09713A_thal_3526coord: 55..106
e-value: 6.8E-26
score: 90.3
IPR006476Conserved hypothetical protein CHP01589, plantTIGRFAMTIGR01589TIGR01589coord: 52..108
e-value: 6.5E-29
score: 97.7
IPR006476Conserved hypothetical protein CHP01589, plantPANTHERPTHR31871OS02G0137100 PROTEINcoord: 4..360
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..44
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..16
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..44
NoneNo IPR availablePANTHERPTHR31871:SF9HELICASE WITH ZINC FINGER PROTEINcoord: 4..360

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh00G002870.1CmaCh00G002870.1mRNA