Cp4.1LG04g07500 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g07500
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDUF21 domain-containing protein
LocationCp4.1LG04: 4126980 .. 4133126 (+)
RNA-Seq ExpressionCp4.1LG04g07500
SyntenyCp4.1LG04g07500
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCTTCGCTACGCTCACTTCCCCTATCCATCGGCATCATCATCTTCTTCCTCCATCCCTCTCTCTCTCTCTCTACTGTACACGTTCACTTTCTCTCTCCTCTCTCTCTCTCTCTTCCCCAATCAACGTATATTCTCCCTCTCCCTCTTTTCTCTGCAACTGTTTCGTTAATGGACCTTCTCCGTCGCCTTCAACTCAACGCCGGCCTCTGAAGCTGAACTCGCCGTCAAACGCTCTTTCCTCTCAATTCAATGGCCGTTGATTACGAATGCTGCAGCACTAATTTCTTCATCCACATACTCATCATCGCCTTCTTGGTCATTTTTGCCGGCTTGATGTCCGGCCTCACCCTCGGCCTTATGTCGATGAGCCTCGTTGATCTCGAAGTTCTTGCCAAGTCTGGCACCCCCAAGGATCGCAAGTACGCCGGTAAAACTTCTCTTTCTTTTTATGGTTAATTGATCCGCTTTCCCACTCCGGAAATGTGAGGTGGAATTGGATGAACGAGTTTGTTGTGTGGGGCCGTTTACTGTACCTATTCGATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNAGTCCTCTTTTTCCCCCTTTTCTTTTGATTGGGTATATTTTATTGTTATCTACAGCCAATTTGTTTTCATCGTGTCTAATGTTGATTTGAGGCTTCCCCTGTTTTTCTTTTTTGACTTAGAAATATGAGGGGAAAAAAACCTTGAGATTCACCGTTGATGGAATATGTTGAATCTGTTTTTGGGGAGCATCTAATTGAGGAAAGGGAACTGACATGTAATTGGATAAGGATTAAAAGATTGTTCCCATCTGCTTCTTGATGTGTTATGCTGGGCTTGCTGTTCCACGACTCCATAAGCTTCAAATGAGGTTCTATGATATATTTATCATGGGTTTATAAGTAAGGAATATATCTCCATTCGTAAGGAAAATATCTCCATTGGTACAAAGCCTTTTGGGGAAGCCAAAGTAAAATCATGAAAACTTATGCGTAAAGTGGACAATATTATACCATTAATGTTATACCAGTATGGAGAGTCATGATTTCTAACTGATACCATTGTGGAGAGTCGTAATTTCTAATCGGGTGTCGGGTGGGAGGAAGTCTGATAAATTATATTGTTCTGTTTTGATTCTTCATTTTTTTTTATCTCCATTTCTTCATTTGTTCCTTCCCCCAAATTGGGTGGGTCAAAAAAAGGATCAATACGACGCCTTTTTCAATGTACTTTCTACTAACTTTTAGTCCTCTATAAGGATGTATCAATGGTCAAAATGTAACAGTGTCAAACTGTAGCTGAAAAGTTACCCACTAGTTTGGGATGACTTTGGAAGATCCTTTAGACCCTAAGGTGCAAGGGAGTTTCTTCTTTTTGTTTCTTGAAGTTCTTTTCTTATTATTCTATCTGGCTAAATGTTTTTGATACTTGACAACTATACTTTATAGATGAGGGGAAGGGAGGGATTAGAACCTATGACACATAAGGGGAGGTACTCGATCTTGGAACAGTTAAACTATGCTCAGATCAATTGAGATGTTGATCTAATAGCTCTAAAATGGCTGGAGGATTGTGTTACCATAGAAAACTTTCACAATGGAGCGTGTTAAATTATGTCTCGTATTTTCTTTGATGAGAACTCTCACATTCAATCTTAGAGATGGTTTGCTTCTTCAACTTTTGTTGACATGATTGAAAATTAAGGTCTAGGGGAGTTTCTCCATTAATATTTTGGTGTTTGACCAGTAATCTTATTTAAGAAATATAATTGTCATTAATGAACTTCAGAATCTGCAACAAGAGGCTTGCTTGGTTGACTTAAACATTGGAATGGATATTCCCTAATTCTTTAAAGCTCTTACTAATCTTGTTAAATTATGTTATTCGAAGATTAAAAAACAATAAGCATGTGTCTTTCTTTTTCTTTATTATTATTATTTTTTTTCTAATGACACTTGAAAAGTCATGCATTTGATAATAAAAATATGATAATTTATCATTTTTAGAAATCTAATTCCGCTGACTTAAGGTTCAGAGGAATGTATCTTATACCTGTTTGTAATGGTTTTCTTTTTGAACTTCAATCAGAGTCTCTGTGTTGTGATTAAAATCTCTTCATAATAATTCACTTGCTTTTGAACAGCAAAGATAATGCCAGTAGTAAAAAACCAGCATTTATTGCTGTGCACCCTACTGATTTGCAATGCAGCAGCCATGGAGGTACATTTTATTGAAGTGTAGCGAATTAGTGTGTAGAAGGTTGACCACACAGAGTAAATATAACTGTTCTTGCAGGCACTTCCTATTTTCCTTGACAGTCTAGTTACAGCTTGGGGTGCTGTTTTAATCTCGGTGACATTGATTCTTCTATTTGGCGAGGTCAGAGGAATTATCTAAGATATGGGCTTTTATGCAAGTTTGGAATATTTGTTCATTTTGTTATTCCATTAAATATTAGCTAGGTCACTCAAGGATTGCATTATGGATCTGCAACTCTGTCATTGAAGCATACTGTAATCTAACCGCCCAAACTCACTGCTAACAGATATTGTTATTTTTGGACTTTTCCTTTCGAGTTTCCCTTGAAGGTTTTTAAAATGTGTCTACTAGAGAGAGGTTTCCACACCCTTATAAAAAATGTTTCGTTCTCCTTCCCAACCGATGTGAGATCTCACAATCAGGGCCCAGCGTCCTCGTTGACACTCGTCCCCCTCTCCAACTCGAAGGGGGTGGATTGTGTGATCCCATATCGATTGGAGAGAGGAACGAATACCAGCGAGGACGCTGGTCTCCGCAGGGGGTGGATTGTGAAATCCCACATCGGTTAGGGGGAGAATGAAGCATTCTTTATAAGAGTGTGGAAACCTTAAAAACTTTGAGGGGAAGCCCGAAACGAAAAACCCAAAGAAGATAATGTTTATTGACGGTGAGCTTGAGTCATTACACCAGAGTTGATTCAGCACAATTTTTTGCTGTAACGATATTGATACCTCAGTTCACATTTTTCAGTTGGTACCAAAGCTAAGGGGTAAATTTGATGAAATCTATTTACTTTGATGTACAGATTATACCACAATCTGTTTGCTCTCGTTATGGTTTGGCAATTGGATCAACAGTTGCTCCTTTTGTTCGTGTTCTTGTCTGGATTTGCTTCCCTATTGCATATCCAATCAGCAAGGTAGACATTGAGGACTTGGGTTACTAAAGTTTGCTACATTATTGTTCCAAATGGTTCCTAAAATATTTATGTTAAACGTGATTTTCAGTTACTAGACTTTCTACTGGGCCATGGACATGTGGCCCTTTTCCGTAGAGCTGAGCTCAAAGCACTTGTGAATATGCACGGTAATGAGGTATATAAGTTTATATTTTGTGTGGGGTTATGTTTATTTTAAATTTTTGGTAGACTTATGACATGTTTAACATTAACAAATATTTGTTGTTTTTAATTTTGAAGCAGACCTTTGCTTCTCTTTTCTGCTTCCATATTCTACCACTAATGTCCGAGATTAACCTGTAACAGGCTGGAAAGGGTGGAGAGCTGACACATGATGAAACCACTATCATTGCTGGAGCACTTGAACTTACTGAGAAAACTGCTAGTGATGCCATGACTCCCATATCTGAAACTTTTGTAATTGACATTAATGCAAAGCTCGATAGGTCATATGCATGTTGAGAATAAGACTATAAATGTTCACCTTCATAATTCACTACTTCCAATTCATAGCTTCTGATATATTTGTGATGATTGCACAGGAAGTTAATGAATCTCGTTTTGGCGAAAGGGCATAGCAGGGTGCCAGTTTATTATGAAGAACAAACCAACATAATCGGTCTTATCCTGGTAAACTTGTTAATCATCTCTGCTCATATCGCATCAAACAACCCTCTATACAATGAATTTTATGACCTTTTCTTGATTTAGAGGCCTCTTCTCGAGCATTACTATTGTTGGTTGTCATCTTGTCATTTGTACATTATTTGGATATGGAGTAATTGGATGCAAAACTGGAGGCTTCACTCCCATTTTTCAATATCAACAGCACATTCTTTTGCATATTTCTATTTATGTCATTCACGTGTGATCTGTAATCTCCAACTTTTCCAACACAATGTGCAATAATTCAAACCCACCCCACCTTAGCAGATATTGTTCTCTTTAGGCTTTCCCTTTCGGGTTTCTCCTCAAGATTTTTAAAACGTGTCTACTAGGGAGAGATTTCCACCCCCTTAGAAAAAATGATTCGTTTTCCTCCTCAATCGACGTTTGATCTCACAATCCAACCGATGTTACAAATGGTATCAGAGCCAAACACCAGGCGATGTGCTAGCGAGGAGGCTGAACCCCGAATGAGGTGGACACGAGGTGGCGTGCCAGCAAGGACGTTGGGCCCCGAAGGGGGTGGATCGGGGGTCCCACATCGATTGGAGGAAGGAATGAGTACTAATGAGGACGCTGGGCCCCAAAGGGAGTAGATTGTGAGATCCCACATCGATTGGGGAGGAGAACGAAGCACTCTTTATAAGGTTAAAAATTTGGATGCATTTTAACAACCTAGAGGGAAAGCCCGAAAGTAAAAGCCCAAAGAGAACAATATTTGCTAGCGGTGGGCTTGAGCTGTTAGAGAATGCATCTATTCTCTGTTGCAAAAAGGTAATTTCTCCTTTTTCTTAACACCTTTGGCTTTCTATATCAGGTTAAAAATTTGCTTACAATCCACCCAGATGATGAAATTCCAGTAAAGAATGTTACCATTCGAAGGATTCCAAGGTACATTGGTGGCAGAAAGGATATAGTTCTTTTAAATTTGATAGTTCAAATTCCAGTATCATTCTGATATAGACAATTTCATTACAGAGTTCCAGAAACTTTGCCATTATATGACATTCTGAATGAGTTCCAGAAAGGTCATAGCCACATGGCCATTGTTGTTAAACAATGCAACAAAATGAATGGAAAACCCGATGAACAACCGGGCGACGGTACGTGCCTCTGTGTTCATTATACACAAGTCATTACTCTGTTTAGATATTCAGATTCTGGTTTTGGTAGGCATTATATTAATTGAACTGCTCAATCAAACTAGATTCCCAGAAAGAAGTTAGAATTGATGTGGATGGTGAAAGGCCTTCACCTTCCCAAGAAAAAGCCATGAAGATTAAGACATCATTTAAGAAGTGTAAAAGCTTTCCCGCAAATAGTTCATTCAGGAGTGGTAGTTCTAGAAGCAAGAAATGGACAAAAGATATGTACTCTGATATCCTAAAAATAGATGAAAACCCTCTCCCTAAGCTCGTCGAGGAAGAAGCTGTGGGTGTTATAACAATGGAAGATGTCATTGAAGAACTATTACAGGTAGTATCTGTTCATGCTGCACATGACATAATTATTCGTTTGTTTTCGACATAGGATGGTTTTTTAGTATCAATGTTTGTTTTTGACAGGAGGAGATATTTGATGAGACTGATCATCAAACAGAGGACTCATGATAGTTGTTAGTCAAAGAATGAATGGTATGTTCTAAATTATATACGTACATAGAAAATATTTGTTGAATATGTTGAGGCTCTTAATAGATGTTGGAACCCACGAAATGGTAAAGCTAAGCTTTTGAGGGTTAAGGAGAGAAGTTTGTGTTCGTATTATACAACCAGAGAATGAGTATTTTAGATGTTTAAGAGATGAAAAGACAGTCTCAAAGTAAATAGAAGAGAAAACTCAAATTTTAGAAGCTTTTGAGGGTTAAATTAGTGTATGAGCTCGATTCAACTTCGTTCTAATTTAGTTTATGCGGTTTTTAGAAGTTTTTTCCAGTTGTGCTTTGGTTAAATCTCTCCAAATCAAAGGTCAAGAATGTTCATGAATGTTCATGAGCTAGGTTGGATTGGGTTGAGTTGAGTTTCAGATATTTTTTTTAGACTCAACTTACAGGAACCGTAACTCATTCACAATGGTATGATATTTGTCTACGTTGAGCATCCCTTGACGACGCCTGTTTTTGGAGCATATTGCTAGTTTAATAGAGTTTAATCACGGATTTTCACCATGTTATACCTTCGAGACTCTTAACTTCTTTGCTTGACACTTGAGAATTTTAAGTTAGGGCATGACTCTCATACCATATCATAGGA

mRNA sequence

TTCCTTCGCTACGCTCACTTCCCCTATCCATCGGCATCATCATCTTCTTCCTCCATCCCTCTCTCTCTCTCTCTACTGTACACGTTCACTTTCTCTCTCCTCTCTCTCTCTCTCTTCCCCAATCAACGTATATTCTCCCTCTCCCTCTTTTCTCTGCAACTGTTTCGTTAATGGACCTTCTCCGTCGCCTTCAACTCAACGCCGGCCTCTGAAGCTGAACTCGCCGTCAAACGCTCTTTCCTCTCAATTCAATGGCCGTTGATTACGAATGCTGCAGCACTAATTTCTTCATCCACATACTCATCATCGCCTTCTTGGTCATTTTTGCCGGCTTGATGTCCGGCCTCACCCTCGGCCTTATGTCGATGAGCCTCGTTGATCTCGAAGTTCTTGCCAAGTCTGGCACCCCCAAGGATCGCAAGTACGCCGCAAAGATAATGCCAGTAGTAAAAAACCAGCATTTATTGCTGTGCACCCTACTGATTTGCAATGCAGCAGCCATGGAGGCACTTCCTATTTTCCTTGACAGTCTAGTTACAGCTTGGGGTGCTGTTTTAATCTCGGTGACATTGATTCTTCTATTTGGCGAGATTATACCACAATCTGTTTGCTCTCGTTATGGTTTGGCAATTGGATCAACAGTTGCTCCTTTTGTTCGTGTTCTTGTCTGGATTTGCTTCCCTATTGCATATCCAATCAGCAAGTTACTAGACTTTCTACTGGGCCATGGACATGTGGCCCTTTTCCGTAGAGCTGAGCTCAAAGCACTTGTGAATATGCACGGTAATGAGGCTGGAAAGGGTGGAGAGCTGACACATGATGAAACCACTATCATTGCTGGAGCACTTGAACTTACTGAGAAAACTGCTAGTGATGCCATGACTCCCATATCTGAAACTTTTGTAATTGACATTAATGCAAAGCTCGATAGGAAGTTAATGAATCTCGTTTTGGCGAAAGGGCATAGCAGGGTGCCAGTTTATTATGAAGAACAAACCAACATAATCGGTCTTATCCTGGTTAAAAATTTGCTTACAATCCACCCAGATGATGAAATTCCAGTAAAGAATGTTACCATTCGAAGGATTCCAAGAGTTCCAGAAACTTTGCCATTATATGACATTCTGAATGAGTTCCAGAAAGGTCATAGCCACATGGCCATTGTTGTTAAACAATGCAACAAAATGAATGGAAAACCCGATGAACAACCGGGCGACGATTCCCAGAAAGAAGTTAGAATTGATGTGGATGGTGAAAGGCCTTCACCTTCCCAAGAAAAAGCCATGAAGATTAAGACATCATTTAAGAAGTGTAAAAGCTTTCCCGCAAATAGTTCATTCAGGAGTGGTAGTTCTAGAAGCAAGAAATGGACAAAAGATATGTACTCTGATATCCTAAAAATAGATGAAAACCCTCTCCCTAAGCTCGTCGAGGAAGAAGCTGTGGGTGTTATAACAATGGAAGATGTCATTGAAGAACTATTACAGGAGGAGATATTTGATGAGACTGATCATCAAACAGAGGACTCATGATAGTTGTTAGTCAAAGAATGAATGGTATGTTCTAAATTATATACGTACATAGAAAATATTTGTTGAATATGTTGAGGCTCTTAATAGATGTTGGAACCCACGAAATGGTAAAGCTAAGCTTTTGAGGGTTAAGGAGAGAAGTTTGTGTTCGTATTATACAACCAGAGAATGAGTATTTTAGATGTTTAAGAGATGAAAAGACAGTCTCAAAGTAAATAGAAGAGAAAACTCAAATTTTAGAAGCTTTTGAGGGTTAAATTAGTGTATGAGCTCGATTCAACTTCGTTCTAATTTAGTTTATGCGGTTTTTAGAAGTTTTTTCCAGTTGTGCTTTGGTTAAATCTCTCCAAATCAAAGGTCAAGAATGTTCATGAATGTTCATGAGCTAGGTTGGATTGGGTTGAGTTGAGTTTCAGATATTTTTTTTAGACTCAACTTACAGGAACCGTAACTCATTCACAATGGTATGATATTTGTCTACGTTGAGCATCCCTTGACGACGCCTGTTTTTGGAGCATATTGCTAGTTTAATAGAGTTTAATCACGGATTTTCACCATGTTATACCTTCGAGACTCTTAACTTCTTTGCTTGACACTTGAGAATTTTAAGTTAGGGCATGACTCTCATACCATATCATAGGA

Coding sequence (CDS)

ATGGCCGTTGATTACGAATGCTGCAGCACTAATTTCTTCATCCACATACTCATCATCGCCTTCTTGGTCATTTTTGCCGGCTTGATGTCCGGCCTCACCCTCGGCCTTATGTCGATGAGCCTCGTTGATCTCGAAGTTCTTGCCAAGTCTGGCACCCCCAAGGATCGCAAGTACGCCGCAAAGATAATGCCAGTAGTAAAAAACCAGCATTTATTGCTGTGCACCCTACTGATTTGCAATGCAGCAGCCATGGAGGCACTTCCTATTTTCCTTGACAGTCTAGTTACAGCTTGGGGTGCTGTTTTAATCTCGGTGACATTGATTCTTCTATTTGGCGAGATTATACCACAATCTGTTTGCTCTCGTTATGGTTTGGCAATTGGATCAACAGTTGCTCCTTTTGTTCGTGTTCTTGTCTGGATTTGCTTCCCTATTGCATATCCAATCAGCAAGTTACTAGACTTTCTACTGGGCCATGGACATGTGGCCCTTTTCCGTAGAGCTGAGCTCAAAGCACTTGTGAATATGCACGGTAATGAGGCTGGAAAGGGTGGAGAGCTGACACATGATGAAACCACTATCATTGCTGGAGCACTTGAACTTACTGAGAAAACTGCTAGTGATGCCATGACTCCCATATCTGAAACTTTTGTAATTGACATTAATGCAAAGCTCGATAGGAAGTTAATGAATCTCGTTTTGGCGAAAGGGCATAGCAGGGTGCCAGTTTATTATGAAGAACAAACCAACATAATCGGTCTTATCCTGGTTAAAAATTTGCTTACAATCCACCCAGATGATGAAATTCCAGTAAAGAATGTTACCATTCGAAGGATTCCAAGAGTTCCAGAAACTTTGCCATTATATGACATTCTGAATGAGTTCCAGAAAGGTCATAGCCACATGGCCATTGTTGTTAAACAATGCAACAAAATGAATGGAAAACCCGATGAACAACCGGGCGACGATTCCCAGAAAGAAGTTAGAATTGATGTGGATGGTGAAAGGCCTTCACCTTCCCAAGAAAAAGCCATGAAGATTAAGACATCATTTAAGAAGTGTAAAAGCTTTCCCGCAAATAGTTCATTCAGGAGTGGTAGTTCTAGAAGCAAGAAATGGACAAAAGATATGTACTCTGATATCCTAAAAATAGATGAAAACCCTCTCCCTAAGCTCGTCGAGGAAGAAGCTGTGGGTGTTATAACAATGGAAGATGTCATTGAAGAACTATTACAGGAGGAGATATTTGATGAGACTGATCATCAAACAGAGGACTCATGA

Protein sequence

MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAAKIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVCSRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNEAGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSRVPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPANSSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETDHQTEDS
Homology
BLAST of Cp4.1LG04g07500 vs. ExPASy Swiss-Prot
Match: Q9ZQR4 (DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF3 PE=2 SV=2)

HSP 1 Score: 631.3 bits (1627), Expect = 7.8e-180
Identity = 337/426 (79.11%), Postives = 382/426 (89.67%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECC T+FFIHI +I  LV+FAGLMSGLTLGLMSMSLVDLEVLAKSGTP+DR +AA
Sbjct: 1   MAVEYECCGTSFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KI+PVVKNQHLLLCTLLICNAAAMEALPIFLD+LVTAWGA+LISVTLILLFGEIIPQSVC
Sbjct: 61  KILPVVKNQHLLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SR+GLAIG+TVAPFVRVLVWIC P+A+PISKLLDFLLGHG VALFRRAELK LV++HGNE
Sbjct: 121 SRHGLAIGATVAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALEL+EK A DAMTPIS+TFVIDINAKLDR LMNL+L KGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYE++TNIIGL+LVKNLLTI+PD+EI VKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEQRTNIIGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMA+VV+QC+K++        +++  EVR+DVD ER SP QE  +K + S +K KSFP  
Sbjct: 301 HMAVVVRQCDKIHPLQSNDAANETVNEVRVDVDYER-SP-QETKLKRRRSLQKWKSFPNR 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEE-AVGVITMEDVIEELLQEEIFDET 420
           ++  S  SRSK+W+KD  +DIL+++E+PLPKL EEE AVG+ITMEDVIEELLQEEIFDET
Sbjct: 361 AN--SLGSRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDET 420

Query: 421 DHQTED 426
           DH  ED
Sbjct: 421 DHHFED 422

BLAST of Cp4.1LG04g07500 vs. ExPASy Swiss-Prot
Match: Q8VZI2 (DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF6 PE=1 SV=1)

HSP 1 Score: 622.9 bits (1605), Expect = 2.8e-177
Identity = 331/427 (77.52%), Postives = 371/427 (86.89%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+Y CCS NFFIHI +I FLV+FAGLMSGLTLGLMS+SLVDLEVLAKSGTP+ RKYAA
Sbjct: 1   MAVEYVCCSPNFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KI+PVVKNQHLLL TLLICNAAAME LPIFLD LVTAWGA+LISVTLILLFGEIIPQS+C
Sbjct: 61  KILPVVKNQHLLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSIC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIG+TVAPFVRVLV+IC P+A+PISKLLDFLLGH   ALFRRAELK LV+ HGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALEL+EK   DAMTPIS+ FVIDINAKLDR LMNL+L KGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYE+ TNIIGL+LVKNLLTI+PD+EIPVKNVTIRRIPRVPE LPLYDILNEFQKG S
Sbjct: 241 VPVYYEQPTNIIGLVLVKNLLTINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMA+VV+QC+K++  P +   + S KE R+DVD E     QE+ ++ K S +K KSFP  
Sbjct: 301 HMAVVVRQCDKIHPLPSK---NGSVKEARVDVDSEGTPTPQERMLRTKRSLQKWKSFPNR 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLV-EEEAVGVITMEDVIEELLQEEIFDET 420
           +S   G S+SKKW+KD  +DIL+++ NPLPKL  EEEAVG+ITMEDVIEELLQEEIFDET
Sbjct: 361 ASSFKGGSKSKKWSKDNDADILQLNGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDET 420

Query: 421 DHQTEDS 427
           DH  EDS
Sbjct: 421 DHHFEDS 424

BLAST of Cp4.1LG04g07500 vs. ExPASy Swiss-Prot
Match: Q8RY60 (DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF7 PE=1 SV=1)

HSP 1 Score: 430.3 bits (1105), Expect = 2.6e-119
Identity = 244/438 (55.71%), Postives = 311/438 (71.00%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           M+ D  CC T F ++++II  LV FAGLM+GLTLGLMS+ LVDLEVL KSG P+DR  A 
Sbjct: 1   MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KI PVVKNQHLLLCTLLI N+ AMEALPIFLD +V  W A+L+SVTLIL+FGEI+PQ+VC
Sbjct: 61  KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           +RYGL +G+ +APFVRVL+ + FPI+YPISK+LD++LG GH  L RRAELK  VN HGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGG+LT DET+II GALELTEKTA DAMTPIS  F ++++  L+ + +N +++ GHSR
Sbjct: 181 AGKGGDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVY+   T+IIGLILVKNLL +    E+P++ +++R+IPRV ET+PLYDILNEFQKGHS
Sbjct: 241 VPVYFRNPTHIIGLILVKNLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPD------EQPGDDSQKEVRIDVDGERPSP----SQEKAMKIKTS 360
           H+A+V K  ++    P+      E+  +   K+        +P      S+++  KI+T 
Sbjct: 301 HIAVVYKDLDEQEQSPETSENGIERRKNKKTKDELFKDSCRKPKAQFEVSEKEVFKIETG 360

Query: 361 FKKCKSFPANSSFRSGSSR-------SKKWTKDMYSDILKIDENPLPKL-VEEEAVGVIT 420
             K      N   + GS +       +KK  +     IL I+  P+P     EE VGVIT
Sbjct: 361 DAK-SGKSENGEEQQGSGKTSLLAAPAKKRHRGCSFCILDIENTPIPDFPTNEEVVGVIT 420

BLAST of Cp4.1LG04g07500 vs. ExPASy Swiss-Prot
Match: Q9LTD8 (DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF5 PE=2 SV=2)

HSP 1 Score: 406.4 bits (1043), Expect = 4.1e-112
Identity = 233/424 (54.95%), Postives = 298/424 (70.28%), Query Frame = 0

Query: 2   AVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAAK 61
           A D  CC T F++++L+   LV+FAGLMSGLTLGLMS+S+V+LEV+ K+G P DRK A K
Sbjct: 3   ANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEK 62

Query: 62  IMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVCS 121
           I+P+VKNQHLLLCTLLI NA AMEALPIF+DSL+ AWGA+LISVTLIL FGEIIPQ+VCS
Sbjct: 63  ILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCS 122

Query: 122 RYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNEA 181
           RYGL+IG+ ++  VR+++ + FP++YPISKLLD LLG  H  L  RAELK+LV MHGNEA
Sbjct: 123 RYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEA 182

Query: 182 GKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSRV 241
           GKGGELTHDETTII+GAL++++K+A DAMTP+S+ F +DIN KLD K M L+ + GHSR+
Sbjct: 183 GKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRI 242

Query: 242 PVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHSH 301
           P+Y      IIG ILVKNL+ + P+DE  ++++ IRR+P+V   LPLYDILN FQ G SH
Sbjct: 243 PIYSVNPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSH 302

Query: 302 MAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPA-N 361
           MA VV   N  N                +       SP+++  + +        S PA N
Sbjct: 303 MAAVVGTKNHTN------------TNTPVHEKSINGSPNKDANVFL--------SIPALN 362

Query: 362 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 421
           SS  S  S  +      Y D +  DE       +EE +G+IT+EDV+EEL+QEEI+DETD
Sbjct: 363 SSETSHQSPIR------YIDSIS-DE-------DEEVIGIITLEDVMEELIQEEIYDETD 392

Query: 422 HQTE 425
              E
Sbjct: 423 QYVE 392

BLAST of Cp4.1LG04g07500 vs. ExPASy Swiss-Prot
Match: Q67XQ0 (DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF1 PE=1 SV=1)

HSP 1 Score: 377.9 bits (969), Expect = 1.5e-103
Identity = 216/411 (52.55%), Postives = 280/411 (68.13%), Query Frame = 0

Query: 18  IIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAAKIMPVVKNQHLLLCTLL 77
           I  FLV+FAG+MSGLTLGLMS+ LV+LE+L +SGTP ++K AA I PVV+ QH LL TLL
Sbjct: 41  ISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLL 100

Query: 78  ICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVCSRYGLAIGSTVAPFVRV 137
           +CNA AME LPI+LD L   + A+++SVT +L FGE+IPQ++C+RYGLA+G+     VR+
Sbjct: 101 LCNAMAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRI 160

Query: 138 LVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNEAGKGGELTHDETTIIAG 197
           L+ +C+PIA+PI K+LD +LGH   ALFRRA+LKALV++H  EAGKGGELTHDETTII+G
Sbjct: 161 LMTLCYPIAFPIGKILDLVLGHND-ALFRRAQLKALVSIHSQEAGKGGELTHDETTIISG 220

Query: 198 ALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSRVPVYYEEQTNIIGLILV 257
           AL+LTEKTA +AMTPI  TF +D+N+KLD + M  +LA+GHSRVPVY     N+IGL+LV
Sbjct: 221 ALDLTEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLV 280

Query: 258 KNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAIVVKQCNKMNGKP- 317
           K+LLT+ P+ E  V  V IRRIPRVP  +PLYDILNEFQKG SHMA VVK   K    P 
Sbjct: 281 KSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPS 340

Query: 318 ---DEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPANSSFRSGSSRSKKW 377
              +E   + +  ++   +  +R        + I  +  +   F  N S   G S + + 
Sbjct: 341 TLLEEHTDESNDSDLTAPLLLKREGNHDNVIVTIDKANGQ-SFFQNNESGPHGFSHTSEA 400

Query: 378 TKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETDHQTE 425
            +D                   E +G+IT+EDV EELLQEEI DETD   +
Sbjct: 401 IED------------------GEVIGIITLEDVFEELLQEEIVDETDEYVD 431

BLAST of Cp4.1LG04g07500 vs. NCBI nr
Match: XP_023529505.1 (DUF21 domain-containing protein At2g14520 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 811 bits (2095), Expect = 1.18e-295
Identity = 426/426 (100.00%), Postives = 426/426 (100.00%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA
Sbjct: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC
Sbjct: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE
Sbjct: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN
Sbjct: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 420
           SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD
Sbjct: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 420

Query: 421 HQTEDS 426
           HQTEDS
Sbjct: 421 HQTEDS 426

BLAST of Cp4.1LG04g07500 vs. NCBI nr
Match: XP_022927831.1 (DUF21 domain-containing protein At2g14520 isoform X2 [Cucurbita moschata])

HSP 1 Score: 799 bits (2064), Expect = 6.25e-291
Identity = 416/426 (97.65%), Postives = 423/426 (99.30%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECCSTNFFIH+LIIAFLVIFAGLMSGLTLGLMSMS+VDLEVLAKSGTPKDRKYAA
Sbjct: 1   MAVEYECCSTNFFIHVLIIAFLVIFAGLMSGLTLGLMSMSIVDLEVLAKSGTPKDRKYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KIMPVVKNQHLLLCTLLICNAAAMEALPIFLD LVTAWGAVLISVTLILLFGEIIPQSVC
Sbjct: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDRLVTAWGAVLISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGST+APFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVN+HGNE
Sbjct: 121 SRYGLAIGSTIAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDR LMNLVLAKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRNLMNLVLAKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMAIVVKQCNKMNGKPDEQPG+DSQKEVRIDVDGERPSPSQEK MKIKTSFKKCKSFPAN
Sbjct: 301 HMAIVVKQCNKMNGKPDEQPGNDSQKEVRIDVDGERPSPSQEKTMKIKTSFKKCKSFPAN 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 420
           SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVG+ITMEDVIEELLQEEIFDETD
Sbjct: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGIITMEDVIEELLQEEIFDETD 420

Query: 421 HQTEDS 426
           HQTEDS
Sbjct: 421 HQTEDS 426

BLAST of Cp4.1LG04g07500 vs. NCBI nr
Match: KAG6588726.1 (DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 799 bits (2064), Expect = 6.25e-291
Identity = 417/426 (97.89%), Postives = 422/426 (99.06%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECCSTNFFIH+LIIAFLVIFAGLMSGLTLGLMSMS+VDLEVLAKSGTPKDRKYAA
Sbjct: 1   MAVEYECCSTNFFIHVLIIAFLVIFAGLMSGLTLGLMSMSIVDLEVLAKSGTPKDRKYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KIMPVVKNQHLLLCTLLICNAAAMEALPIFLD LVTAWGAVLISVTLILLFGEIIPQSVC
Sbjct: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDRLVTAWGAVLISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGST+APFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVN+HGNE
Sbjct: 121 SRYGLAIGSTIAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDR LMNLVLAKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRNLMNLVLAKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEK MKIKTSFKKCKSFPAN
Sbjct: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKTMKIKTSFKKCKSFPAN 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 420
           SS RSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD
Sbjct: 361 SSLRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 420

Query: 421 HQTEDS 426
           HQTEDS
Sbjct: 421 HQTEDS 426

BLAST of Cp4.1LG04g07500 vs. NCBI nr
Match: KAG7022511.1 (DUF21 domain-containing protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 798 bits (2062), Expect = 1.26e-290
Identity = 418/426 (98.12%), Postives = 421/426 (98.83%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMS+VDLEVLAKSGTPKDRKYAA
Sbjct: 1   MAVEYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSIVDLEVLAKSGTPKDRKYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KIMPVVKNQHLLLCTLLICNAAAMEALPIFLD LVTAWGAVLISVTLILLFGEIIPQSVC
Sbjct: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDRLVTAWGAVLISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGSTV PFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVN+HGNE
Sbjct: 121 SRYGLAIGSTVTPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDR LMNLVLAKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRNLMNLVLAKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEK MKIKTSFKKCKSFPAN
Sbjct: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKTMKIKTSFKKCKSFPAN 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 420
           SS RSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD
Sbjct: 361 SSLRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 420

Query: 421 HQTEDS 426
           HQTEDS
Sbjct: 421 HQTEDS 426

BLAST of Cp4.1LG04g07500 vs. NCBI nr
Match: XP_022988772.1 (DUF21 domain-containing protein At2g14520 isoform X2 [Cucurbita maxima])

HSP 1 Score: 778 bits (2008), Expect = 2.22e-282
Identity = 411/427 (96.25%), Postives = 419/427 (98.13%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECCSTNFFIHILIIAFLVIFAG+MSGLTLGLMSMS+VDLEVLAKSGTPKDR+YAA
Sbjct: 1   MAVEYECCSTNFFIHILIIAFLVIFAGMMSGLTLGLMSMSIVDLEVLAKSGTPKDRRYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC
Sbjct: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLG+GHVALFRRAELKALVN+HGNE
Sbjct: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGYGHVALFRRAELKALVNLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDR LMNLVLAKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRNLMNLVLAKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEEQTNIIGLILVKNLLTIHPDDEI VKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEILVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMAIVVKQCNKMNGK DEQ G+D QKEVRIDVDGERPSPSQEK MKIKTSFKKCKSFPAN
Sbjct: 301 HMAIVVKQCNKMNGKHDEQLGNDPQKEVRIDVDGERPSPSQEKTMKIKTSFKKCKSFPAN 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEE-AVGVITMEDVIEELLQEEIFDET 420
           SSFRSG SRSKKWTKDMY+DILKIDENPLPKLVEEE AVGVITMEDVIEELLQEEIFDET
Sbjct: 361 SSFRSGGSRSKKWTKDMYTDILKIDENPLPKLVEEEEAVGVITMEDVIEELLQEEIFDET 420

Query: 421 DHQTEDS 426
           DHQTEDS
Sbjct: 421 DHQTEDS 427

BLAST of Cp4.1LG04g07500 vs. ExPASy TrEMBL
Match: A0A6J1EIM9 (DUF21 domain-containing protein At2g14520 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111434602 PE=4 SV=1)

HSP 1 Score: 799 bits (2064), Expect = 3.03e-291
Identity = 416/426 (97.65%), Postives = 423/426 (99.30%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECCSTNFFIH+LIIAFLVIFAGLMSGLTLGLMSMS+VDLEVLAKSGTPKDRKYAA
Sbjct: 1   MAVEYECCSTNFFIHVLIIAFLVIFAGLMSGLTLGLMSMSIVDLEVLAKSGTPKDRKYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KIMPVVKNQHLLLCTLLICNAAAMEALPIFLD LVTAWGAVLISVTLILLFGEIIPQSVC
Sbjct: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDRLVTAWGAVLISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGST+APFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVN+HGNE
Sbjct: 121 SRYGLAIGSTIAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDR LMNLVLAKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRNLMNLVLAKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMAIVVKQCNKMNGKPDEQPG+DSQKEVRIDVDGERPSPSQEK MKIKTSFKKCKSFPAN
Sbjct: 301 HMAIVVKQCNKMNGKPDEQPGNDSQKEVRIDVDGERPSPSQEKTMKIKTSFKKCKSFPAN 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 420
           SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVG+ITMEDVIEELLQEEIFDETD
Sbjct: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGIITMEDVIEELLQEEIFDETD 420

Query: 421 HQTEDS 426
           HQTEDS
Sbjct: 421 HQTEDS 426

BLAST of Cp4.1LG04g07500 vs. ExPASy TrEMBL
Match: A0A6J1JDZ6 (DUF21 domain-containing protein At2g14520 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111486014 PE=4 SV=1)

HSP 1 Score: 778 bits (2008), Expect = 1.07e-282
Identity = 411/427 (96.25%), Postives = 419/427 (98.13%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECCSTNFFIHILIIAFLVIFAG+MSGLTLGLMSMS+VDLEVLAKSGTPKDR+YAA
Sbjct: 1   MAVEYECCSTNFFIHILIIAFLVIFAGMMSGLTLGLMSMSIVDLEVLAKSGTPKDRRYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC
Sbjct: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLG+GHVALFRRAELKALVN+HGNE
Sbjct: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGYGHVALFRRAELKALVNLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDR LMNLVLAKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRNLMNLVLAKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEEQTNIIGLILVKNLLTIHPDDEI VKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEILVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMAIVVKQCNKMNGK DEQ G+D QKEVRIDVDGERPSPSQEK MKIKTSFKKCKSFPAN
Sbjct: 301 HMAIVVKQCNKMNGKHDEQLGNDPQKEVRIDVDGERPSPSQEKTMKIKTSFKKCKSFPAN 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEE-AVGVITMEDVIEELLQEEIFDET 420
           SSFRSG SRSKKWTKDMY+DILKIDENPLPKLVEEE AVGVITMEDVIEELLQEEIFDET
Sbjct: 361 SSFRSGGSRSKKWTKDMYTDILKIDENPLPKLVEEEEAVGVITMEDVIEELLQEEIFDET 420

Query: 421 DHQTEDS 426
           DHQTEDS
Sbjct: 421 DHQTEDS 427

BLAST of Cp4.1LG04g07500 vs. ExPASy TrEMBL
Match: A0A6J1EJ35 (DUF21 domain-containing protein At2g14520 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111434602 PE=4 SV=1)

HSP 1 Score: 774 bits (1998), Expect = 3.87e-281
Identity = 402/412 (97.57%), Postives = 409/412 (99.27%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECCSTNFFIH+LIIAFLVIFAGLMSGLTLGLMSMS+VDLEVLAKSGTPKDRKYAA
Sbjct: 1   MAVEYECCSTNFFIHVLIIAFLVIFAGLMSGLTLGLMSMSIVDLEVLAKSGTPKDRKYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KIMPVVKNQHLLLCTLLICNAAAMEALPIFLD LVTAWGAVLISVTLILLFGEIIPQSVC
Sbjct: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDRLVTAWGAVLISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGST+APFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVN+HGNE
Sbjct: 121 SRYGLAIGSTIAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDR LMNLVLAKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRNLMNLVLAKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMAIVVKQCNKMNGKPDEQPG+DSQKEVRIDVDGERPSPSQEK MKIKTSFKKCKSFPAN
Sbjct: 301 HMAIVVKQCNKMNGKPDEQPGNDSQKEVRIDVDGERPSPSQEKTMKIKTSFKKCKSFPAN 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQ 412
           SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVG+ITMEDVIEELLQ
Sbjct: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGIITMEDVIEELLQ 412

BLAST of Cp4.1LG04g07500 vs. ExPASy TrEMBL
Match: A0A6J1JI68 (DUF21 domain-containing protein At2g14520 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486014 PE=4 SV=1)

HSP 1 Score: 752 bits (1942), Expect = 4.19e-272
Identity = 397/413 (96.13%), Postives = 405/413 (98.06%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECCSTNFFIHILIIAFLVIFAG+MSGLTLGLMSMS+VDLEVLAKSGTPKDR+YAA
Sbjct: 1   MAVEYECCSTNFFIHILIIAFLVIFAGMMSGLTLGLMSMSIVDLEVLAKSGTPKDRRYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC
Sbjct: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLG+GHVALFRRAELKALVN+HGNE
Sbjct: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGYGHVALFRRAELKALVNLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDR LMNLVLAKGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRNLMNLVLAKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEEQTNIIGLILVKNLLTIHPDDEI VKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEILVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMAIVVKQCNKMNGK DEQ G+D QKEVRIDVDGERPSPSQEK MKIKTSFKKCKSFPAN
Sbjct: 301 HMAIVVKQCNKMNGKHDEQLGNDPQKEVRIDVDGERPSPSQEKTMKIKTSFKKCKSFPAN 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEE-AVGVITMEDVIEELLQ 412
           SSFRSG SRSKKWTKDMY+DILKIDENPLPKLVEEE AVGVITMEDVIEELLQ
Sbjct: 361 SSFRSGGSRSKKWTKDMYTDILKIDENPLPKLVEEEEAVGVITMEDVIEELLQ 413

BLAST of Cp4.1LG04g07500 vs. ExPASy TrEMBL
Match: A0A1S4E167 (DUF21 domain-containing protein At2g14520-like OS=Cucumis melo OX=3656 GN=LOC103495339 PE=4 SV=1)

HSP 1 Score: 704 bits (1817), Expect = 1.24e-253
Identity = 376/428 (87.85%), Postives = 398/428 (92.99%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+Y+CCS NFFIHILII  LV+FAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDR +AA
Sbjct: 1   MAVEYQCCSPNFFIHILIIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRIHAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KI+PVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGA+LISVTLILLFGEIIPQSVC
Sbjct: 61  KILPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAILISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIGSTVAPFVRVLVWICFP+AYPISKLLDFLLGHG VALFRRAELK LVN+HGNE
Sbjct: 121 SRYGLAIGSTVAPFVRVLVWICFPVAYPISKLLDFLLGHGRVALFRRAELKTLVNLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALEL+EKTA DAMTPISETF IDINAKLDR LMNLVL KGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKTAGDAMTPISETFAIDINAKLDRNLMNLVLEKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYEE TNIIGLILVKNLLTIHPDDE+PVK+VTIRRIPRVPET+PLYDILNEFQKGHS
Sbjct: 241 VPVYYEEPTNIIGLILVKNLLTIHPDDEVPVKSVTIRRIPRVPETMPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPA- 360
           HMAIVVKQCNKMNGK D++ GDDSQ++VRIDVDGE+P   QEK +K K   +K KSFP  
Sbjct: 301 HMAIVVKQCNKMNGKSDDKTGDDSQRDVRIDVDGEKPP--QEKTLKNKRPLQKWKSFPTT 360

Query: 361 NSSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEE-AVGVITMEDVIEELLQEEIFDE 420
           N+SFRSGS RSKKWTKDMYSDIL+ID +PLPKL EEE AVGVITMEDVIEELLQEEIFDE
Sbjct: 361 NNSFRSGS-RSKKWTKDMYSDILQIDGSPLPKLAEEEEAVGVITMEDVIEELLQEEIFDE 420

Query: 421 TDHQTEDS 426
           TDH  EDS
Sbjct: 421 TDHHFEDS 425

BLAST of Cp4.1LG04g07500 vs. TAIR 10
Match: AT2G14520.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 631.3 bits (1627), Expect = 5.5e-181
Identity = 337/426 (79.11%), Postives = 382/426 (89.67%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+YECC T+FFIHI +I  LV+FAGLMSGLTLGLMSMSLVDLEVLAKSGTP+DR +AA
Sbjct: 1   MAVEYECCGTSFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KI+PVVKNQHLLLCTLLICNAAAMEALPIFLD+LVTAWGA+LISVTLILLFGEIIPQSVC
Sbjct: 61  KILPVVKNQHLLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SR+GLAIG+TVAPFVRVLVWIC P+A+PISKLLDFLLGHG VALFRRAELK LV++HGNE
Sbjct: 121 SRHGLAIGATVAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALEL+EK A DAMTPIS+TFVIDINAKLDR LMNL+L KGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYE++TNIIGL+LVKNLLTI+PD+EI VKNVTIRRIPRVPETLPLYDILNEFQKGHS
Sbjct: 241 VPVYYEQRTNIIGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMA+VV+QC+K++        +++  EVR+DVD ER SP QE  +K + S +K KSFP  
Sbjct: 301 HMAVVVRQCDKIHPLQSNDAANETVNEVRVDVDYER-SP-QETKLKRRRSLQKWKSFPNR 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEE-AVGVITMEDVIEELLQEEIFDET 420
           ++  S  SRSK+W+KD  +DIL+++E+PLPKL EEE AVG+ITMEDVIEELLQEEIFDET
Sbjct: 361 AN--SLGSRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDET 420

Query: 421 DHQTED 426
           DH  ED
Sbjct: 421 DHHFED 422

BLAST of Cp4.1LG04g07500 vs. TAIR 10
Match: AT4G33700.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 622.9 bits (1605), Expect = 2.0e-178
Identity = 331/427 (77.52%), Postives = 371/427 (86.89%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           MAV+Y CCS NFFIHI +I FLV+FAGLMSGLTLGLMS+SLVDLEVLAKSGTP+ RKYAA
Sbjct: 1   MAVEYVCCSPNFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAA 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KI+PVVKNQHLLL TLLICNAAAME LPIFLD LVTAWGA+LISVTLILLFGEIIPQS+C
Sbjct: 61  KILPVVKNQHLLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSIC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           SRYGLAIG+TVAPFVRVLV+IC P+A+PISKLLDFLLGH   ALFRRAELK LV+ HGNE
Sbjct: 121 SRYGLAIGATVAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGGELTHDETTIIAGALEL+EK   DAMTPIS+ FVIDINAKLDR LMNL+L KGHSR
Sbjct: 181 AGKGGELTHDETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVYYE+ TNIIGL+LVKNLLTI+PD+EIPVKNVTIRRIPRVPE LPLYDILNEFQKG S
Sbjct: 241 VPVYYEQPTNIIGLVLVKNLLTINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLS 300

Query: 301 HMAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPAN 360
           HMA+VV+QC+K++  P +   + S KE R+DVD E     QE+ ++ K S +K KSFP  
Sbjct: 301 HMAVVVRQCDKIHPLPSK---NGSVKEARVDVDSEGTPTPQERMLRTKRSLQKWKSFPNR 360

Query: 361 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLV-EEEAVGVITMEDVIEELLQEEIFDET 420
           +S   G S+SKKW+KD  +DIL+++ NPLPKL  EEEAVG+ITMEDVIEELLQEEIFDET
Sbjct: 361 ASSFKGGSKSKKWSKDNDADILQLNGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDET 420

Query: 421 DHQTEDS 427
           DH  EDS
Sbjct: 421 DHHFEDS 424

BLAST of Cp4.1LG04g07500 vs. TAIR 10
Match: AT1G47330.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 430.3 bits (1105), Expect = 1.9e-120
Identity = 244/438 (55.71%), Postives = 311/438 (71.00%), Query Frame = 0

Query: 1   MAVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAA 60
           M+ D  CC T F ++++II  LV FAGLM+GLTLGLMS+ LVDLEVL KSG P+DR  A 
Sbjct: 1   MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 60

Query: 61  KIMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVC 120
           KI PVVKNQHLLLCTLLI N+ AMEALPIFLD +V  W A+L+SVTLIL+FGEI+PQ+VC
Sbjct: 61  KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 120

Query: 121 SRYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNE 180
           +RYGL +G+ +APFVRVL+ + FPI+YPISK+LD++LG GH  L RRAELK  VN HGNE
Sbjct: 121 TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNE 180

Query: 181 AGKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSR 240
           AGKGG+LT DET+II GALELTEKTA DAMTPIS  F ++++  L+ + +N +++ GHSR
Sbjct: 181 AGKGGDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSR 240

Query: 241 VPVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHS 300
           VPVY+   T+IIGLILVKNLL +    E+P++ +++R+IPRV ET+PLYDILNEFQKGHS
Sbjct: 241 VPVYFRNPTHIIGLILVKNLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHS 300

Query: 301 HMAIVVKQCNKMNGKPD------EQPGDDSQKEVRIDVDGERPSP----SQEKAMKIKTS 360
           H+A+V K  ++    P+      E+  +   K+        +P      S+++  KI+T 
Sbjct: 301 HIAVVYKDLDEQEQSPETSENGIERRKNKKTKDELFKDSCRKPKAQFEVSEKEVFKIETG 360

Query: 361 FKKCKSFPANSSFRSGSSR-------SKKWTKDMYSDILKIDENPLPKL-VEEEAVGVIT 420
             K      N   + GS +       +KK  +     IL I+  P+P     EE VGVIT
Sbjct: 361 DAK-SGKSENGEEQQGSGKTSLLAAPAKKRHRGCSFCILDIENTPIPDFPTNEEVVGVIT 420

BLAST of Cp4.1LG04g07500 vs. TAIR 10
Match: AT5G52790.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 406.4 bits (1043), Expect = 2.9e-113
Identity = 233/424 (54.95%), Postives = 298/424 (70.28%), Query Frame = 0

Query: 2   AVDYECCSTNFFIHILIIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAAK 61
           A D  CC T F++++L+   LV+FAGLMSGLTLGLMS+S+V+LEV+ K+G P DRK A K
Sbjct: 3   ANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEK 62

Query: 62  IMPVVKNQHLLLCTLLICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVCS 121
           I+P+VKNQHLLLCTLLI NA AMEALPIF+DSL+ AWGA+LISVTLIL FGEIIPQ+VCS
Sbjct: 63  ILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCS 122

Query: 122 RYGLAIGSTVAPFVRVLVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNEA 181
           RYGL+IG+ ++  VR+++ + FP++YPISKLLD LLG  H  L  RAELK+LV MHGNEA
Sbjct: 123 RYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEA 182

Query: 182 GKGGELTHDETTIIAGALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSRV 241
           GKGGELTHDETTII+GAL++++K+A DAMTP+S+ F +DIN KLD K M L+ + GHSR+
Sbjct: 183 GKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRI 242

Query: 242 PVYYEEQTNIIGLILVKNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHSH 301
           P+Y      IIG ILVKNL+ + P+DE  ++++ IRR+P+V   LPLYDILN FQ G SH
Sbjct: 243 PIYSVNPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSH 302

Query: 302 MAIVVKQCNKMNGKPDEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPA-N 361
           MA VV   N  N                +       SP+++  + +        S PA N
Sbjct: 303 MAAVVGTKNHTN------------TNTPVHEKSINGSPNKDANVFL--------SIPALN 362

Query: 362 SSFRSGSSRSKKWTKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETD 421
           SS  S  S  +      Y D +  DE       +EE +G+IT+EDV+EEL+QEEI+DETD
Sbjct: 363 SSETSHQSPIR------YIDSIS-DE-------DEEVIGIITLEDVMEELIQEEIYDETD 392

Query: 422 HQTE 425
              E
Sbjct: 423 QYVE 392

BLAST of Cp4.1LG04g07500 vs. TAIR 10
Match: AT4G14240.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 377.9 bits (969), Expect = 1.1e-104
Identity = 216/411 (52.55%), Postives = 280/411 (68.13%), Query Frame = 0

Query: 18  IIAFLVIFAGLMSGLTLGLMSMSLVDLEVLAKSGTPKDRKYAAKIMPVVKNQHLLLCTLL 77
           I  FLV+FAG+MSGLTLGLMS+ LV+LE+L +SGTP ++K AA I PVV+ QH LL TLL
Sbjct: 41  ISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLL 100

Query: 78  ICNAAAMEALPIFLDSLVTAWGAVLISVTLILLFGEIIPQSVCSRYGLAIGSTVAPFVRV 137
           +CNA AME LPI+LD L   + A+++SVT +L FGE+IPQ++C+RYGLA+G+     VR+
Sbjct: 101 LCNAMAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRI 160

Query: 138 LVWICFPIAYPISKLLDFLLGHGHVALFRRAELKALVNMHGNEAGKGGELTHDETTIIAG 197
           L+ +C+PIA+PI K+LD +LGH   ALFRRA+LKALV++H  EAGKGGELTHDETTII+G
Sbjct: 161 LMTLCYPIAFPIGKILDLVLGHND-ALFRRAQLKALVSIHSQEAGKGGELTHDETTIISG 220

Query: 198 ALELTEKTASDAMTPISETFVIDINAKLDRKLMNLVLAKGHSRVPVYYEEQTNIIGLILV 257
           AL+LTEKTA +AMTPI  TF +D+N+KLD + M  +LA+GHSRVPVY     N+IGL+LV
Sbjct: 221 ALDLTEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLV 280

Query: 258 KNLLTIHPDDEIPVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAIVVKQCNKMNGKP- 317
           K+LLT+ P+ E  V  V IRRIPRVP  +PLYDILNEFQKG SHMA VVK   K    P 
Sbjct: 281 KSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPS 340

Query: 318 ---DEQPGDDSQKEVRIDVDGERPSPSQEKAMKIKTSFKKCKSFPANSSFRSGSSRSKKW 377
              +E   + +  ++   +  +R        + I  +  +   F  N S   G S + + 
Sbjct: 341 TLLEEHTDESNDSDLTAPLLLKREGNHDNVIVTIDKANGQ-SFFQNNESGPHGFSHTSEA 400

Query: 378 TKDMYSDILKIDENPLPKLVEEEAVGVITMEDVIEELLQEEIFDETDHQTE 425
            +D                   E +G+IT+EDV EELLQEEI DETD   +
Sbjct: 401 IED------------------GEVIGIITLEDVFEELLQEEIVDETDEYVD 431

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZQR47.8e-18079.11DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q8VZI22.8e-17777.52DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q8RY602.6e-11955.71DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9LTD84.1e-11254.95DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q67XQ01.5e-10352.55DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Match NameE-valueIdentityDescription
XP_023529505.11.18e-295100.00DUF21 domain-containing protein At2g14520 isoform X1 [Cucurbita pepo subsp. pepo... [more]
XP_022927831.16.25e-29197.65DUF21 domain-containing protein At2g14520 isoform X2 [Cucurbita moschata][more]
KAG6588726.16.25e-29197.89DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7022511.11.26e-29098.12DUF21 domain-containing protein [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022988772.12.22e-28296.25DUF21 domain-containing protein At2g14520 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1EIM93.03e-29197.65DUF21 domain-containing protein At2g14520 isoform X2 OS=Cucurbita moschata OX=36... [more]
A0A6J1JDZ61.07e-28296.25DUF21 domain-containing protein At2g14520 isoform X2 OS=Cucurbita maxima OX=3661... [more]
A0A6J1EJ353.87e-28197.57DUF21 domain-containing protein At2g14520 isoform X1 OS=Cucurbita moschata OX=36... [more]
A0A6J1JI684.19e-27296.13DUF21 domain-containing protein At2g14520 isoform X1 OS=Cucurbita maxima OX=3661... [more]
A0A1S4E1671.24e-25387.85DUF21 domain-containing protein At2g14520-like OS=Cucumis melo OX=3656 GN=LOC103... [more]
Match NameE-valueIdentityDescription
AT2G14520.15.5e-18179.11CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G33700.12.0e-17877.52CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT1G47330.11.9e-12055.71CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT5G52790.12.9e-11354.95CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G14240.11.1e-10452.55CBS domain-containing protein with a domain of unknown function (DUF21) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002550CNNM, transmembrane domainPFAMPF01595DUF21coord: 17..189
e-value: 1.5E-37
score: 129.0
IPR002550CNNM, transmembrane domainPROSITEPS51846CNNMcoord: 8..191
score: 57.917782
NoneNo IPR availableGENE3D3.10.580.10coord: 191..322
e-value: 4.2E-43
score: 148.8
NoneNo IPR availablePANTHERPTHR12064:SF79AND COBALT EFFLUX PROTEIN CORC, PUTATIVE-RELATEDcoord: 1..426
NoneNo IPR availableSUPERFAMILY54631CBS-domain paircoord: 194..412
IPR045095Ancient conserved domain protein familyPANTHERPTHR12064ANCIENT CONSERVED DOMAIN PROTEIN-RELATEDcoord: 1..426
IPR000644CBS domainPROSITEPS51371CBScoord: 210..271
score: 8.757091
IPR044751Ion transporter-like, CBS domainCDDcd04590CBS_pair_CorC_HlyC_assoccoord: 205..306
e-value: 2.36691E-26
score: 100.648

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g07500.1Cp4.1LG04g07500.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010960 magnesium ion homeostasis
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle