CmaCh19G006420 (gene) Cucurbita maxima (Rimu)

NameCmaCh19G006420
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionElectron transporter, putative isoform 2
LocationCma_Chr19 : 6828123 .. 6835704 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGTACTGCATTTAATACTGTTTCACTGCTTGTTTTTTCGACATTGATTCAGAACCCATTTTTTGTCTCTGTAATGGCTTCTTAAAGACATCAAATTTTTTACAAACCCCTCATCGTCATTCTTCTTGACCCATGTTTCGATTCAAGAAAACCCCATCTTCTTTTTGATGGTCATTTGAAAAGGATTCGTACGGACACTAAATGGTGAGAACATACAATGTTTCTATGAGAAGATTTGGTCTGTATGTCTTGTCTGCCACGGTTGTCTCTCATAGCAAACATTTAGCTTTTTGGGGATGAAAGTTTATTAGAATGTTTTTGTTGTAATGATTTTTGCTGTTCTACGGGAGATGTTTATGGATTTTTGTTTTGATTTTAGGAGGAAATGGGCAATACAAAGCAAGTTCTTGATGGGGATGTTCAGAATTCCTTGAAGCAGGAGGTATGGACATTCTTCTAGGCCTGTAGTGGCTTTGTTTTTGTAGAACTTCCATTGAGTTCAATCTGAAAAGGGTATTAAAATCATGGAGAAACTGTTGATAGCTTATCGAAAGGGCGTAATTCTGCAACGGGATAAGTAATGTCGAGAAGTTCGACGTTCTTTCGAACTTCTGACTCTATAAACTTTAGTGTTTCACGGGGTGCTTAGGTTGTGGATTATTCAGGAGAAGTGTTCTTAAGCGAAACTGTTTTGATATGTAATGCAACCAAACTCAATATGCAGTCTTTGAAAAGTAGTTTTGTGTTTGATGTTCTATAGCTTTTACTCTTTGTTTAGAATATGCTGTTTGCTTCGGATTCGGATGGTGTGTGTGAGATCCCATATCGGTTAGGGAGGAAATGAAACATTCTTTATAAGAGTGTGGAAACCTCTTCCTAGTAGACATGTTTTGAAAACTTTGAGGGGAAGCCCGAAAGGGAAAGTTTAAAGAGGATAATATATGCTAGCGGTGGGCTTGGGCTCTTTCAAATGGTATTAGAGCATGATATTGGGCAATATGCCAGTGAGGAGGTTGAGCCTTAAAAGGGGGTGAGCACGAGGCGGTGTGCTAGCAAAGACGCTGGGCTCTGAAAGGGGTGGATAGTGAGATACCACATCGGTTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGACGCGTTTTAAAAACCTTGAAGGAAAGCTCGAAAGAGAAAGCCTGAAGAGGATAATATTTGCTAGGAGTGAGTTTGGGCTGTTACAAATAATATCAGAGCTAGACATCGGGAGATGTGCTAGTGAGAAGGCTGAGCCCCGAAGGGGGGTGGAGATGATGCGGTGTGCTAGCAAGGACGCTGGCCCCAAAAGGGGTGGATTGGGGGTCCCACATCAATTGGAGAAGGGAACAAATGCTAGCGAGGACGTTGGGCCTTGAAGGACCTTGAGGGGAAGCCCGAAAGTGAAAGTCCAAAGAGGACAATATTTGCTAGTAGTGGGCTTGAACGGAACACTCTTGCTAGTAGCGGGCTTGAACGGAACACTCTTGCTAACGCCTGATAGAAAGACTATGAAATCTTTGAATCAAGCTCATTTCTGATATTCTTATACATTCTCCTTAGCCTGCTCCCTGCTCTTTTTTCTGCAGATTCTACAGCTTCAAGAACAATTACAAAGCCAGTTCGTCATTCGCCACGCCTTGGAGAAGGCAATGAATTTTCAGCCTCTCTCACTCTGTTCCGCTACCGAAAACTCGATTCCAAAGGTAACGTCATGGCCATACTTCCCATACATACATGTAATGTTATCTGGTTGTATTGTTGTACATAGTCTTACTAGTTCTCATCAATGTTAGGCTGAGATGGATCTGATTAAACAAATAGCCGTCTTGGAGTTAGAAGTCGTTTATTTGGAGAAATATCTTCTCTCGTTATACCGACGAACTTTCGAGCAACAAGTATCCTCTTTTTCTACCATGGATGATAGCCAAGAATCATTTGGCAATCAATGTAAAGAACAAAATGAAGTTGAGGAACCGGAGAAGTCGTTACACGTTCATCGCAGCTACTCGTCTCTTTCGCAGAGATCACCCGGTTCATCTCGAAGCTATCCATTATCAAAGTATATGGCTAAAGCAGTAGATTCATACCATTCCCTTCCATTATCAATGCTGGAGGTAACTTGATGTTGATCCATTTTGGACAAGGTAGAACTCCTCTCATTTGGGAACTTTTACATTGTTTTTTCAGCAATCTCGAGTTGATGCTTCGAATTCTACGAGCCTCGGGGAGCACCTCGGTTCCTGGAAATCGGATCAAGCAGACGAGTCGCCTAACTGGATTGCTGAGGAGATGATCAAGTCCATCTCTGCAATATACCGCGAACTTACCGAACCTCCTTCGACGAATCATAAGAATCGTTCTCCTATCTCACCTTTGTCATCCATGTACGAACTTTCTTCACAAGATTTGCGCAGTATGAGGAACTACGAAAAGTCGTTTAACTCGAATTTCGAGAACTCATTTCACACCGGAGAATTTAGTGCACCATACGACACGATGTTGAAGGTGCAATGGATTTCTAGAGAAAGAATGAAGGACATAGATATCAACCGCATGCTACAAGGCTTCAGGTGATAGATACCTCATTCAAAGAAGCTATCTGTTTCGTATGCCATGACTCTGATATTGTCACTGCACTGTTGGCAGGTCACTTACTTCTCGGCTCAAAGATGTCGATCTGAAAGTCATGAAACATGACGAAAAACTAGCGTTTTGGATTAATGTACACAACACGCTTGTAATGCATGTAAGTTCAGCACAAACTCTGTGTTTTTCCTTTGATAATGATAAAGCCTGCTAGTAAATGTTCACTATACTTTGTTCTATAGGCTTATTTGCAATATGGTATCCCCAAGAACAGTTTAAAGACTTTGATACTCAAGGTGAGGATAGCTTCTGTTGTATCCATTAACCAAAATAAATTCTCTAAGTACTTCTTTCATTAGCTGTTGTACTTTCCAGGCCGCGTATAATGTTGGGGGACACATAATAAGTGTCGATATGATACAAGGCTCGATTCTCGGGTGTCGTTTGCCTCGTCCGGGACAGGTTAGTATCATCTTTGTAACAGCCCAAGCCCACCGCTAACAAATATTGTCGTGTTTGGGCTTCCCAAGGTTTTAAAACACTTTTGGTAGGGAGAGGTTTCGACACTCTTGTAAAGAATGTTTCGCTTTCCTCCCCAATTGATGTGGGATCTCACAATCCACCTCCCTTTGAGGCTCAGCGTCCTCGCGGGCACTCATTCCCTTCTTTCATCGATGTGGGACCCCGCAATCCACCCCCCTTCGAGGCCCAGCGTCCTTGCTGGCACACTGCCTCGTGTTCACCCCCCTTCAAGGCTCAACCTCCTTGCTGGCACATCGCCCGGTGTCTGACTTTGATATTATTTGTAACAGCCCAAGCCCACTGCTAATAGATATTGTCCAGGCTTTTTTCATTTGAGCTTCCCCTCAAGGTTTTAAAACGTGTCTATTAGGGAGAGGTTTCCACACCCTTACAAAGAAAATGTTTCATTCTCTTCTCTAACCGATGTGGAATCTCACAATCTCAAAATTATGTTTATGTTCTCGGAATTTTGTGTCACCAATGTATATCAGTACATTCAATGTCTGACTGATTTGTTGCTTGATTAGTGGCTGCACTTGTTTCTCTCGTCGAAAACAAAACTTAAGGTCAACTATGCACAGAAATCCTTTCGAATCAACCACCCCGAACCTCGGTTATACTTTGCTCTATGTTGCGGGAGCCATTCCGATCCAGCAGTATGTTATAACATCTGATTCTAGTATTGATGTCAATCAAAGCTTCGGATGAAACCGGTTTTGATATCGCTATTTGTTGTTTTACGTAGGTTCGTGTCTATACGGCGAAGAGGCTGAACGAGGAGTTGGAGGTTGCAAAAGAAGACTACATCCTTTCGAATTTGAGGACACACAAAGGGCAGAGAATTCTACTCCCCAAGATTGTTGAGTCCTTTGCCAAGGATTCTGGCTTATGCCTGGAAGATTTGGAGGACGTTGTCGAGTGTCTAAGGCTCGACGGGCGGATAAACGACGGTCGGCAGCTGCAACGAAAGAAGTTGTGGAAGAGTATTGGGTGGATACCTCACAACTTCACCTTCAGCTTTCTGCTATCCAAAGAATTGGGATGCCACCAGTCCCTGTCTTGATAGTTTCATACTAAAATCTGAAACATTTTTGTCGAATCGAGATTCGATCATTTGGCGTTGAAGGACGGCTCGTGATTGCAGCTTGGAATGACAGATGAATCATAGCAATGGCTTACGAAGCAAGGTCATGATGTTGAATCCAAAAGGGGAAGAAAAAGGTGCGTTTTGGACCAAAAATTGAGCTGTTTTATCGGTCAAAAACAGGAACAGCAGGTTATGATTCAACTGTCATCCTAAGATGTCTTGTCTGATGTCTGTTATTGTTGATGCAATGTCTCAAAATGCTGCAGTTTTTGTAGACTTCCTTTGATTTTGTCAATCTGTTCTAAGGAAGTTTGTCTAATGGTTGTTTCAAATATGAGCTTTACCTACCGTACTATTGTTAGATCCTACATCGATTGGAGAGGTGAACGAAACATTTTTATAGTGTAGAAACCTCTTCTTATTATACGCGTTTTAAAAATCGTGAGGCTGAGGACAATACATAACGGGCCAAAACGGACAATATTTGCAAGCGGTTGATTTGAACAGCTACAAATGGTATCAGAGCCAGACACCGAACGTTGGGCCCCCAAGGAAGGTGAATTGTGAGATCCCACATCAATCAGAAAGGGGAACGAAACATTCCTTACAAGTGTGTGGAAACCTTTCTCTAGCAGACGCGTTTTAAAAACAATGAGGCTGACGACGATACGTAACGATTCCTTATAAGAGTGGAAACCTCTCCTTAGCAAACGCTTTTTAAAAAACAGTGAGGTTGATGACAACAAGTATCGGTTCCTTGTAAGGGTGTAGAAACCTCTCCCTAGCAGATGCAATATATAATAAGTCAAAACGGACAATATCTATTAGTGGTGGGTTTGGTACATATATGTAGCAATTTTCAAGTTTTTTAGTCGTCCAATCCTTAGAATCTCTTTCGGGTGGGGTCTTGGTTTATTTATGTATATCTCATTTATTCTGGGTCACAAAAATAATAAATAATTTTTTAATGCACGTCGATTTTAATCAATTTTAAATGAATTTAATAGTGTTGACAATCAAATTAAAAAAAAAAAATAATTTTTAAGTAAATAAATAAAAATGATTAAATGTTTAAATTTTTTGAAGCGTATAAACTACGATAACTGTTTTTTTTTTTTTCATAAAGTATTATTATAATATAGATTATATATATGACAAAGAAGCACCTGCCACAGACCTCATAGCTTGGCGGTTGTTTGAGACCTTTTCCTCCATTGTCATCTTCACTTTCAGAAGAAACCCTAATCCGCCATTTTTGTTCCTCGAAAACCCTCTCGGAGGGTTTGGTTTGGTGTGGGGAGAGATGGAAGAACAAAGCACGGCTATAATTTTAGCAAGAGCGACAGAGTTGAGGCTGAAGATTAGAAGCTCTGTTAACACCACCACCACGAGTTCGGCGGTAACTTCCTGGGAGAATCGGGATGATCGGTTCGCCGTGGATGAAAATAATGGCGTTGGTTCGCGGCGGAGTGTGGCCGACGCGATTGAGGTAACGGAAGAAGACGAGGAAGCGGTGAGGCTTTTGAATATCTGCGGTGCGCTCGAGTCTCTTGAGAATCAACTCTCTTCGTTTCAGGTTTGATTGTTCTTTCCTTGTTTTATGAATGAAGATTGAAGAATATGTTTCGATTTAGTGACTTATGTTGAAAATTGTTGTGCGTTATATGATCGATGTAGGCCTGTTTGGATCTGATAGTTGTCGTGATTGCTCAAAGTTGCATAGTCTAGTCCAAGTACTTACTGTATAAAGATCGATTTGAATCAGTTTAGGGTATAAATTGAATTCATTCGAGCATCATTTTAGATATGGAATTGTTTTCTGTTCCTAAACATATGCTAGTAGTCTGAAAGCTTGCATTTATATGTTGGTGCATTGTTGCACTTACTACCTCTCTGGTACTTGGATAAAAATATGAATGGAGTGTGAGATCCACGTCGGTTGGGGAGGAGAACTAAACATTCTTTATAAGAGTGGAAACCTCTCCCTATCCCTATTTGACATGTTATAGAACCTTGAGGGGAAGCTTGGAAGGGAAAGCCTAAAGATGATAATATCTGCTAGTAGTGGACTTGGGTGGTTACAAATAGTATTAGAGCCGGACATTGGGCTATGTGCCAATGAGAAGGTTGAGCCCTGAAGGGGGTGGACATGAGACGGTGTGCCATCAAGGACTTTGGGCCCTGAAGGAGGTGGATTGGGGGTCCCACATCGATTGGAGAAGGGAATGAGTGCCAGCAAGGACCGCTGGGCCTTGAAGGGGTAGATTGTGAGATCCCAAATCGGTTGGGGAGGAGAACAAACCATTCTTTATAAGGGTGTGAAAGCCTCTCTCTAGCAGACTTGTTTTAAAACTTTGAGAGAAAGTCCGAAAGGGAAAGCCCATATAATACAATATCTGCTAGCGGTGGACTTGGGTCGTTACATGTGGAGCATGATTCTCCCTCAATTTGAACATTTTATTGCATTTATTTGCTAATACTAAAAGTATCAGGCAGTTGAAAGCTGATGGTTTTAGTGCTTCAGCATATACAACTTATGAGGTTGTTAAAGCCTTCAACGCCTCACTTCACAATCCACCCCACCCTTTGGGCCCAGCGTCTTCGCTGGCACACTACTCGGTATCTACCTCTGATATCATTTGTAACAGCCCAAGTTCACCGCTAGCAAATATTATCATTTTTGAATTTTTCGTTTCAGGCTTCCCCTCAAGATTTTCAAAACACGTCTGCCAAGGAGAGGTTTCACACCCTTATAAGGAATGCTTATTGTCTTCTATATTATTTTATAGAACATTCTTATATTTTATCTTCTAATTTGCAGGATTTACAACAACTGCAAAGGTACGAGAAGGAAGTAGCCCTTTCCGAGATCGAGCATAGCCGCGAGATGTTACTGGATAAGCTGAAGAAGTACGAAGGAGAGGATTTGGAAGTGATACACGAGGCGTCAGCTTTTGTTGGGGAGACCGTGCAGCACAACCAGGATCTCATGCTTCCTCCATATACAAGCCATCCTGGTAATGGCTACTTACATCCCTTCCCTTCTGCACTCAAGTTTGTGAGGGCTGCTACAAATAAAGCTACAAAGGAACTTAACGAATCAGAACGGAAACACACGAAATTGAATTCGAGGAACTCGAGGAATAGATTAGGATCCTTCATTAGTATAGCTGCAAAATCCGTGGTTACCATTGTTGGCATAGTATCCCTACTGCACTTGGCTGGTTTTCGACCGAAGTTTGCAGCAAAAATTGCTGCTTTGAAGGCTTTGGACTGTTTTCGACGGTCTGCAGCTGTAAATAAAGAATCACGCCATGGATGCCCTCCGGGAAAGCTCAATGCGTCGTGA

mRNA sequence

TCGTACTGCATTTAATACTGTTTCACTGCTTGTTTTTTCGACATTGATTCAGAACCCATTTTTTGTCTCTGTAATGGCTTCTTAAAGACATCAAATTTTTTACAAACCCCTCATCGTCATTCTTCTTGACCCATGTTTCGATTCAAGAAAACCCCATCTTCTTTTTGATGGTCATTTGAAAAGGATTCGTACGGACACTAAATGGAGGAAATGGGCAATACAAAGCAAGTTCTTGATGGGGATGTTCAGAATTCCTTGAAGCAGGAGATTCTACAGCTTCAAGAACAATTACAAAGCCAGTTCGTCATTCGCCACGCCTTGGAGAAGGCAATGAATTTTCAGCCTCTCTCACTCTGTTCCGCTACCGAAAACTCGATTCCAAAGGCTGAGATGGATCTGATTAAACAAATAGCCGTCTTGGAGTTAGAAGTCGTTTATTTGGAGAAATATCTTCTCTCGTTATACCGACGAACTTTCGAGCAACAAGTATCCTCTTTTTCTACCATGGATGATAGCCAAGAATCATTTGGCAATCAATGTAAAGAACAAAATGAAGTTGAGGAACCGGAGAAGTCGTTACACGTTCATCGCAGCTACTCGTCTCTTTCGCAGAGATCACCCGGTTCATCTCGAAGCTATCCATTATCAAAGTATATGGCTAAAGCAGTAGATTCATACCATTCCCTTCCATTATCAATGCTGGAGCAATCTCGAGTTGATGCTTCGAATTCTACGAGCCTCGGGGAGCACCTCGGTTCCTGGAAATCGGATCAAGCAGACGAGTCGCCTAACTGGATTGCTGAGGAGATGATCAAGTCCATCTCTGCAATATACCGCGAACTTACCGAACCTCCTTCGACGAATCATAAGAATCGTTCTCCTATCTCACCTTTGTCATCCATGTACGAACTTTCTTCACAAGATTTGCGCAGTATGAGGAACTACGAAAAGTCGTTTAACTCGAATTTCGAGAACTCATTTCACACCGGAGAATTTAGTGCACCATACGACACGATGTTGAAGGTGCAATGGATTTCTAGAGAAAGAATGAAGGACATAGATATCAACCGCATGCTACAAGGCTTCAGGTCACTTACTTCTCGGCTCAAAGATGTCGATCTGAAAGTCATGAAACATGACGAAAAACTAGCGTTTTGGATTAATGTACACAACACGCTTGTAATGCATGCTTATTTGCAATATGGTATCCCCAAGAACAGTTTAAAGACTTTGATACTCAAGGCCGCGTATAATGTTGGGGGACACATAATAAGTGTCGATATGATACAAGGCTCGATTCTCGGGTGTCGTTTGCCTCGTCCGGGACAGTGGCTGCACTTGTTTCTCTCGTCGAAAACAAAACTTAAGGTCAACTATGCACAGAAATCCTTTCGAATCAACCACCCCGAACCTCGGTTATACTTTGCTCTATGTTGCGGGAGCCATTCCGATCCAGCAGTTCGTGTCTATACGGCGAAGAGGCTGAACGAGGAGTTGGAGGTTGCAAAAGAAGACTACATCCTTTCGAATTTGAGGACACACAAAGGGCAGAGAATTCTACTCCCCAAGATTGTTGAGTCCTTTGCCAAGGATTCTGGCTTATGCCTGGAAGATTTGGAGGACGTTGTCGAGTGTCTAAGGCTCGACGGGCGGATAAACGACGGTCGGCAGCTGCAACGAAAGAAGTTGTGGAAGAGTATTGGAAGAAACCCTAATCCGCCATTTTTGTTCCTCGAAAACCCTCTCGGAGGGTTTGGTTTGGTGTGGGGAGAGATGGAAGAACAAAGCACGGCTATAATTTTAGCAAGAGCGACAGAGTTGAGGCTGAAGATTAGAAGCTCTGTTAACACCACCACCACGAGTTCGGCGGTAACTTCCTGGGAGAATCGGGATGATCGGTTCGCCGTGGATGAAAATAATGGCGTTGGTTCGCGGCGGAGTGTGGCCGACGCGATTGAGGTAACGGAAGAAGACGAGGAAGCGGTGAGGCTTTTGAATATCTGCGGTGCGCTCGAGTCTCTTGAGAATCAACTCTCTTCGTTTCAGGATTTACAACAACTGCAAAGGTACGAGAAGGAAGTAGCCCTTTCCGAGATCGAGCATAGCCGCGAGATGTTACTGGATAAGCTGAAGAAGTACGAAGGAGAGGATTTGGAAGTGATACACGAGGCGTCAGCTTTTGTTGGGGAGACCGTGCAGCACAACCAGGATCTCATGCTTCCTCCATATACAAGCCATCCTGGTAATGGCTACTTACATCCCTTCCCTTCTGCACTCAAGTTTGTGAGGGCTGCTACAAATAAAGCTACAAAGGAACTTAACGAATCAGAACGGAAACACACGAAATTGAATTCGAGGAACTCGAGGAATAGATTAGGATCCTTCATTAGTATAGCTGCAAAATCCGTGGTTACCATTGTTGGCATAGTATCCCTACTGCACTTGGCTGGTTTTCGACCGAAGTTTGCAGCAAAAATTGCTGCTTTGAAGGCTTTGGACTGTTTTCGACGGTCTGCAGCTGTAAATAAAGAATCACGCCATGGATGCCCTCCGGGAAAGCTCAATGCGTCGTGA

Coding sequence (CDS)

ATGGAGGAAATGGGCAATACAAAGCAAGTTCTTGATGGGGATGTTCAGAATTCCTTGAAGCAGGAGATTCTACAGCTTCAAGAACAATTACAAAGCCAGTTCGTCATTCGCCACGCCTTGGAGAAGGCAATGAATTTTCAGCCTCTCTCACTCTGTTCCGCTACCGAAAACTCGATTCCAAAGGCTGAGATGGATCTGATTAAACAAATAGCCGTCTTGGAGTTAGAAGTCGTTTATTTGGAGAAATATCTTCTCTCGTTATACCGACGAACTTTCGAGCAACAAGTATCCTCTTTTTCTACCATGGATGATAGCCAAGAATCATTTGGCAATCAATGTAAAGAACAAAATGAAGTTGAGGAACCGGAGAAGTCGTTACACGTTCATCGCAGCTACTCGTCTCTTTCGCAGAGATCACCCGGTTCATCTCGAAGCTATCCATTATCAAAGTATATGGCTAAAGCAGTAGATTCATACCATTCCCTTCCATTATCAATGCTGGAGCAATCTCGAGTTGATGCTTCGAATTCTACGAGCCTCGGGGAGCACCTCGGTTCCTGGAAATCGGATCAAGCAGACGAGTCGCCTAACTGGATTGCTGAGGAGATGATCAAGTCCATCTCTGCAATATACCGCGAACTTACCGAACCTCCTTCGACGAATCATAAGAATCGTTCTCCTATCTCACCTTTGTCATCCATGTACGAACTTTCTTCACAAGATTTGCGCAGTATGAGGAACTACGAAAAGTCGTTTAACTCGAATTTCGAGAACTCATTTCACACCGGAGAATTTAGTGCACCATACGACACGATGTTGAAGGTGCAATGGATTTCTAGAGAAAGAATGAAGGACATAGATATCAACCGCATGCTACAAGGCTTCAGGTCACTTACTTCTCGGCTCAAAGATGTCGATCTGAAAGTCATGAAACATGACGAAAAACTAGCGTTTTGGATTAATGTACACAACACGCTTGTAATGCATGCTTATTTGCAATATGGTATCCCCAAGAACAGTTTAAAGACTTTGATACTCAAGGCCGCGTATAATGTTGGGGGACACATAATAAGTGTCGATATGATACAAGGCTCGATTCTCGGGTGTCGTTTGCCTCGTCCGGGACAGTGGCTGCACTTGTTTCTCTCGTCGAAAACAAAACTTAAGGTCAACTATGCACAGAAATCCTTTCGAATCAACCACCCCGAACCTCGGTTATACTTTGCTCTATGTTGCGGGAGCCATTCCGATCCAGCAGTTCGTGTCTATACGGCGAAGAGGCTGAACGAGGAGTTGGAGGTTGCAAAAGAAGACTACATCCTTTCGAATTTGAGGACACACAAAGGGCAGAGAATTCTACTCCCCAAGATTGTTGAGTCCTTTGCCAAGGATTCTGGCTTATGCCTGGAAGATTTGGAGGACGTTGTCGAGTGTCTAAGGCTCGACGGGCGGATAAACGACGGTCGGCAGCTGCAACGAAAGAAGTTGTGGAAGAGTATTGGAAGAAACCCTAATCCGCCATTTTTGTTCCTCGAAAACCCTCTCGGAGGGTTTGGTTTGGTGTGGGGAGAGATGGAAGAACAAAGCACGGCTATAATTTTAGCAAGAGCGACAGAGTTGAGGCTGAAGATTAGAAGCTCTGTTAACACCACCACCACGAGTTCGGCGGTAACTTCCTGGGAGAATCGGGATGATCGGTTCGCCGTGGATGAAAATAATGGCGTTGGTTCGCGGCGGAGTGTGGCCGACGCGATTGAGGTAACGGAAGAAGACGAGGAAGCGGTGAGGCTTTTGAATATCTGCGGTGCGCTCGAGTCTCTTGAGAATCAACTCTCTTCGTTTCAGGATTTACAACAACTGCAAAGGTACGAGAAGGAAGTAGCCCTTTCCGAGATCGAGCATAGCCGCGAGATGTTACTGGATAAGCTGAAGAAGTACGAAGGAGAGGATTTGGAAGTGATACACGAGGCGTCAGCTTTTGTTGGGGAGACCGTGCAGCACAACCAGGATCTCATGCTTCCTCCATATACAAGCCATCCTGGTAATGGCTACTTACATCCCTTCCCTTCTGCACTCAAGTTTGTGAGGGCTGCTACAAATAAAGCTACAAAGGAACTTAACGAATCAGAACGGAAACACACGAAATTGAATTCGAGGAACTCGAGGAATAGATTAGGATCCTTCATTAGTATAGCTGCAAAATCCGTGGTTACCATTGTTGGCATAGTATCCCTACTGCACTTGGCTGGTTTTCGACCGAAGTTTGCAGCAAAAATTGCTGCTTTGAAGGCTTTGGACTGTTTTCGACGGTCTGCAGCTGTAAATAAAGAATCACGCCATGGATGCCCTCCGGGAAAGCTCAATGCGTCGTGA

Protein sequence

MEEMGNTKQVLDGDVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSATENSIPKAEMDLIKQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDDSQESFGNQCKEQNEVEEPEKSLHVHRSYSSLSQRSPGSSRSYPLSKYMAKAVDSYHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRELTEPPSTNHKNRSPISPLSSMYELSSQDLRSMRNYEKSFNSNFENSFHTGEFSAPYDTMLKVQWISRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKTLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKTKLKVNYAQKSFRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRTHKGQRILLPKIVESFAKDSGLCLEDLEDVVECLRLDGRINDGRQLQRKKLWKSIGRNPNPPFLFLENPLGGFGLVWGEMEEQSTAIILARATELRLKIRSSVNTTTTSSAVTSWENRDDRFAVDENNGVGSRRSVADAIEVTEEDEEAVRLLNICGALESLENQLSSFQDLQQLQRYEKEVALSEIEHSREMLLDKLKKYEGEDLEVIHEASAFVGETVQHNQDLMLPPYTSHPGNGYLHPFPSALKFVRAATNKATKELNESERKHTKLNSRNSRNRLGSFISIAAKSVVTIVGIVSLLHLAGFRPKFAAKIAALKALDCFRRSAAVNKESRHGCPPGKLNAS
BLAST of CmaCh19G006420 vs. Swiss-Prot
Match: PDV2_ARATH (Plastid division protein PDV2 OS=Arabidopsis thaliana GN=PDV2 PE=1 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 2.6e-37
Identity = 107/276 (38.77%), Postives = 158/276 (57.25%), Query Frame = 1

Query: 526 EEQSTAIILARATELRLKIRSSVNTTTTSSAVTSWENRDDRFAVDENNG-----VGSRRS 585
           +E+   +ILARATELRLKI   ++ ++T+ +    +N D    +    G     +G++  
Sbjct: 3   DEEGIGLILARATELRLKISDCIDNSSTTVS----DNGDGNEDLSPGEGRKSEIIGNQDK 62

Query: 586 VADAIEVTEEDE-EAVRLLNICGALESLENQLSSFQDLQQLQRYEKEVALSEIEHSREML 645
             D+I   + DE EA RLL I  ALE+LE+QL+S Q+L+Q Q+YEK++ALSEI++SR+ML
Sbjct: 63  DFDSISSEDVDEAEAERLLRIRDALEALESQLASLQNLRQRQQYEKQLALSEIDYSRKML 122

Query: 646 LDKLKKYEGEDLEVIHEASAFVGETVQHNQDLMLPPYTSHP--------GNGYLHPFPSA 705
           L+KLK+Y+G+D EV+ E + F GE V +  DL+LPPY  HP         NGYL   PS 
Sbjct: 123 LEKLKEYKGKDFEVLRETTTFAGERVDYENDLLLPPYPVHPPLSLGLDNNNGYLSHLPSK 182

Query: 706 LKFVRAATNKATKELNESERKHTKLNSRNSRNRLGSFISIAAKSVVTIVGIVSLLHLAGF 765
            K             NE+E K     S  S + +  F+   AK V+ I+G++SLL  +G+
Sbjct: 183 KKSDANGFGSGHVR-NEAEAKSPNGGSGGSSHGVIRFLGSVAKIVLPIIGVISLLSASGY 242

Query: 766 RPKFAAKIAALKALDCFRRSAAVNKESRHGCPPGKL 788
            P+   + A+L         A   K + + CPPGK+
Sbjct: 243 GPEMRKRGASLNLFGLLPHRATRGKRTPNQCPPGKV 273

BLAST of CmaCh19G006420 vs. Swiss-Prot
Match: PDV1_ARATH (Plastid division protein PDV1 OS=Arabidopsis thaliana GN=PDV1 PE=1 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 7.0e-06
Identity = 48/162 (29.63%), Postives = 82/162 (50.62%), Query Frame = 1

Query: 523 GEMEEQSTAIILARATELRLKIRSSVNTTTTSSAVTSWE--NRDDRFAVDENNGVGSRRS 582
           GEME +    +L +  +L  K+   ++  + S  + S +  NR ++      N    +R 
Sbjct: 2   GEMEIEEIEAVLEKIWDLHDKLSDEIHLISKSHFLKSVKPSNRSEKRKNPHGNSGEDKRP 61

Query: 583 ---VADAIEVTEED---EEAVRLLNICGALESLENQLSSFQDLQQLQRYEKEVALSEIEH 642
                    V + D   +EA  L  I  ALE+LE+QL  F  +   QR EK+VA++ +E 
Sbjct: 62  GYVFIKGFAVDDNDSTIQEAKSLNAIRTALENLEDQLEFFHTIHTQQRTEKDVAIARLEQ 121

Query: 643 SREMLLDKLKKYEGEDLEVIHEASAFVGETVQHNQDLMLPPY 677
           SR +L  +L ++ G++  V+ EA AFVG +++ N   + P +
Sbjct: 122 SRILLAMRLAEHHGKNYGVLEEALAFVG-SIKSNSHYVSPDH 162

BLAST of CmaCh19G006420 vs. TrEMBL
Match: A0A0A0K861_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G075610 PE=4 SV=1)

HSP 1 Score: 764.2 bits (1972), Expect = 1.5e-217
Identity = 411/536 (76.68%), Postives = 439/536 (81.90%), Query Frame = 1

Query: 5   GNTKQVLDGDVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSATENSIPKAEM 64
           GN +Q+ DGD Q SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+AEM
Sbjct: 6   GNKQQISDGDAQISLKQEILQLEEQLQSQFATRHALEKAINFQPLSLYSATEDAIPEAEM 65

Query: 65  DLIKQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDDSQES---------------- 124
           +LIKQIAVLELEVVYLEKYLLSLYRRTF QQVSSFSTMDD  ES                
Sbjct: 66  ELIKQIAVLELEVVYLEKYLLSLYRRTFNQQVSSFSTMDDRLESYIEPNNVIEGEHSCIH 125

Query: 125 ----------FGNQCKEQNEVEEPEKSLHVHRSYSSLSQRSPGSSRSYPLSKYMAKAVDS 184
                     F NQ K +N VEEPE   H+HRS SSLSQRS GSSR+Y LSK MAKAVDS
Sbjct: 126 SDHIGSPETLFDNQSKGRNVVEEPENLSHLHRSNSSLSQRSLGSSRNYSLSKSMAKAVDS 185

Query: 185 YHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRELTEPP 244
           YHS PLSMLEQSR+D  +STSLGEHLG+  S + DESPNW++EEMIKSISAIYREL EPP
Sbjct: 186 YHSFPLSMLEQSRIDVPSSTSLGEHLGACLSIRVDESPNWLSEEMIKSISAIYRELAEPP 245

Query: 245 STNHKNRSPISPLSSMYELSSQDLRSMRNYEKSFNSNFENSFHTGEFSAPYDTMLKVQWI 304
             NH N SPISPLSSMYELSSQD  SMRNYEKS NS+FEN FHT EF APYDTMLKVQWI
Sbjct: 246 LMNHNNPSPISPLSSMYELSSQDFGSMRNYEKSLNSHFENPFHTEEFIAPYDTMLKVQWI 305

Query: 305 SRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK 364
           SRER  D DIN MLQGFRSL  RLK+V LK MKHDEKLAFWINVHNTLVMHAYLQYGI K
Sbjct: 306 SRERKNDSDINHMLQGFRSLIFRLKEVKLKAMKHDEKLAFWINVHNTLVMHAYLQYGISK 365

Query: 365 NSLK--TLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKTKLKVNYAQKS 424
           + LK  +LILKAAYN+GGHIISVD IQ SILGCRLPR GQWLHLFLSSKTK KVN  QKS
Sbjct: 366 HCLKRISLILKAAYNIGGHIISVDKIQSSILGCRLPRSGQWLHLFLSSKTKFKVNDVQKS 425

Query: 425 FRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRTHKGQRILLPK 484
           F INHPEPRLYFALCCGSHSDPAVR+YTAKR+NEELEVAKE+YILSNLR HKGQ+ILLPK
Sbjct: 426 FPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEEYILSNLRVHKGQKILLPK 485

Query: 485 IVESFAKDSGLCLEDLEDVVECLRLDGRINDGRQLQRKKLWKSIGRNP-NPPFLFL 512
           IVESFAKDSGLCLEDLE+ VECLR   RIND +Q QRKKLWKSIG  P N  F FL
Sbjct: 486 IVESFAKDSGLCLEDLENTVECLRSKRRINDIQQRQRKKLWKSIGWIPHNFTFSFL 541

BLAST of CmaCh19G006420 vs. TrEMBL
Match: M5VP81_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020192mg PE=4 SV=1)

HSP 1 Score: 493.0 bits (1268), Expect = 6.6e-136
Identity = 284/524 (54.20%), Postives = 357/524 (68.13%), Query Frame = 1

Query: 8   KQVLDGDVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSATENSIPKAEMDLI 67
           +Q  D DVQ+SL QEILQLQ+QLQ QF++RHAL+KA++++PLS  S  E S+PK+  +++
Sbjct: 14  RQSTDSDVQSSLNQEILQLQKQLQDQFLVRHALQKALSYRPLSPDSTIETSLPKSTKEVV 73

Query: 68  KQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDDSQE-------------------- 127
           K+IAVLELEVVYLE+YLLSLYR+TF+QQ+SS  ++   Q+                    
Sbjct: 74  KEIAVLELEVVYLERYLLSLYRKTFDQQISSELSVAPRQDATPENSNTVTYSDDLMSPRN 133

Query: 128 SFGNQCKEQNEVEEPEKSLH--VHRSYSSLSQRSPGSSRSYPLSKYMAKAVDSYHSLPLS 187
           S     KE N++ E +K L   +HR+YSSLSQRS  S+R+ P +K  AKAVDSYHSLP S
Sbjct: 134 SIVKPLKECNDIVEQQKLLDSSIHRTYSSLSQRSTCSTRTSPRTKSRAKAVDSYHSLPFS 193

Query: 188 MLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRELTEPPSTNHKNR 247
           MLEQ++  A+ +    E L ++ SDQ  ++PN I+EEMIK I+ I+ EL +PP  +H   
Sbjct: 194 MLEQAQ-SATTNVYPTEQLETYFSDQVPDTPNCISEEMIKCIATIFCELADPPVISHDYS 253

Query: 248 SPISPLSSMYELSS----QDLRSMRNYEKSFNSNFENSFHT---GEFSAPYDTMLKVQWI 307
           S     SS Y LSS    +   S R     FNS+F+  FH     E S PY  MLKV  I
Sbjct: 254 SSPITSSSTYNLSSHSQGEKWSSKRTKVPFFNSHFDKPFHIEGPDEMSGPYCRMLKVHSI 313

Query: 308 SRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK 367
            R+  K  D+   L+ FRSL  RL++VDL+ MKH+EKLAFWINVHN LVMHA+L YGIP+
Sbjct: 314 RRDAEKLGDVEHALKKFRSLIYRLEEVDLRTMKHEEKLAFWINVHNALVMHAFLVYGIPQ 373

Query: 368 NSLK--TLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKTKLKVNYAQKS 427
           N+LK  +L+LKAAYNVGGH ISVDMIQ SILGCRLPRPGQWL L  S KTK KV  A+K+
Sbjct: 374 NNLKRVSLLLKAAYNVGGHAISVDMIQRSILGCRLPRPGQWLRLLFSMKTKFKVGDARKA 433

Query: 428 FRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRTHKGQRILLPK 487
           + I HPEP L+FALC GSHSDPAVRVYT+KR+ EELE AK +YI S    HK Q+ILLPK
Sbjct: 434 YSIEHPEPLLHFALCSGSHSDPAVRVYTSKRVFEELETAKHEYIQSTFLVHKEQKILLPK 493

Query: 488 IVESFAKDSGLCLEDLEDVVECLRLDGRINDGRQLQRKKLWKSI 501
           IVESFAKD+GLC  DL  ++E    D +    +  Q K+ WK I
Sbjct: 494 IVESFAKDTGLCSADLMGMIEHFMPDFQRKSNKPFQHKRTWKGI 536

BLAST of CmaCh19G006420 vs. TrEMBL
Match: W9RBN6_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_026965 PE=4 SV=1)

HSP 1 Score: 487.3 bits (1253), Expect = 3.6e-134
Identity = 294/555 (52.97%), Postives = 373/555 (67.21%), Query Frame = 1

Query: 14  DVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSATENSIPKAEMDLIKQIAVL 73
           DVQ+SL QEI++LQ QLQ QF++RHALEKA+++QPLS   A ++S+PK   +LIK+I VL
Sbjct: 80  DVQSSLNQEIMKLQRQLQDQFLVRHALEKALSYQPLSHEVAIDDSVPKPTKELIKEITVL 139

Query: 74  ELEVVYLEKYLLSLYRRTFEQQVSSFSTMD------------DSQESFG----------- 133
           ELE++YLE+YLLSLYR+TF+QQ+SS ST D             S E  G           
Sbjct: 140 ELEILYLERYLLSLYRKTFDQQMSSASTADGRLDSALVTEKGTSSEVSGQADIMPHRENS 199

Query: 134 ---------------NQCKEQNEVEEPEKSL--HVHRSYSSLSQRSPGSSRSYPLSKYMA 193
                             +E +++ EPEK L   +HRS+SSLSQRS  S R+ P  K++ 
Sbjct: 200 VLHHSRNLMPPRNSLGSLQECDDMWEPEKLLDSSIHRSHSSLSQRSFCSFRTSP-RKFLN 259

Query: 194 KAVDSYHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRE 253
            AV+SYHSLPLSMLEQ+     N+ SL EHLG+   D   E+PN ++EEMI+ ISAIY E
Sbjct: 260 NAVNSYHSLPLSMLEQAESSTPNA-SLAEHLGASLYDHVPETPNCLSEEMIRCISAIYCE 319

Query: 254 LTEPPSTNHKN-RSPISPLSSMYELS----SQDLRSMRNYEKSFNSNFENSFHTGE---F 313
           L++PP  N  +  SP++  SS+YE S    S+   S+      FNS+ +N FH  E    
Sbjct: 320 LSDPPLINQDSPSSPVAFSSSIYEASIHSHSEKWSSISRKIPFFNSHLDNPFHLDESEGL 379

Query: 314 SAPYDTMLKVQWISRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNT 373
           + PY  MLKVQWI R+  K  +I+ MLQ FRSL  +L++VDL+ MKH+EKLAFWINVHN 
Sbjct: 380 TRPYFRMLKVQWICRDSEKLREIDNMLQKFRSLVYQLEEVDLRKMKHEEKLAFWINVHNA 439

Query: 374 LVMHAYLQYGIPKNSLK--TLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLS 433
           LVMHA+L YGIP+N+LK  +L+LKAAYNVGGH ISVDMIQ SILGCRLPRPGQWL L  S
Sbjct: 440 LVMHAFLVYGIPQNNLKRVSLVLKAAYNVGGHTISVDMIQNSILGCRLPRPGQWLRLLFS 499

Query: 434 SKTKLKVNYAQKSFRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSN 493
           SKTK KV  A+K+F I+HPEP L+FALC GS  DPAVRVYT KR+ EELE AKE+Y+ SN
Sbjct: 500 SKTKFKVGDARKAFGIDHPEPLLHFALCSGSQYDPAVRVYTPKRVFEELETAKEEYVQSN 559

Query: 494 LRTHKGQRILLPKIVESFAKDSGLCLEDLEDVVECLRLDGRIND--GRQLQRKKLWKSIG 517
              HK Q+ILLPKIVESFAKDSGL L DL ++V+    D +  +      Q KK+ + + 
Sbjct: 560 FIVHKEQKILLPKIVESFAKDSGLSLIDLVEMVDHFMPDSQRKNISIHHCQHKKICQLMN 619

BLAST of CmaCh19G006420 vs. TrEMBL
Match: A0A061DWR5_THECC (Uncharacterized protein isoform 3 OS=Theobroma cacao GN=TCM_006317 PE=4 SV=1)

HSP 1 Score: 463.8 bits (1192), Expect = 4.3e-127
Identity = 281/552 (50.91%), Postives = 359/552 (65.04%), Query Frame = 1

Query: 2   EEMGNTK--------QVLDGDVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCS 61
           +EMG TK        Q  + +VQNSLKQEIL LQE+L  QFV+R ALEKA++ +P +   
Sbjct: 49  KEMGQTKGRAEAKKSQTSNTEVQNSLKQEILHLQERLLDQFVVRRALEKALSHRPFTHDV 108

Query: 62  ATENSIPKAEMDLIKQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDD--------- 121
           A EN IPKA M++IK+IAVLELEV YLEKYLLSLYR+ F+++ SS +T+ +         
Sbjct: 109 AVENLIPKAAMEVIKEIAVLELEVAYLEKYLLSLYRKNFDKRFSSLTTVGEVLRRTSVAH 168

Query: 122 --------------SQESFGNQCKE----QNEVEEPEKSL------------HVHRSYSS 181
                          +E+   Q  +    +N +  P K               +HRS+SS
Sbjct: 169 KEMFPEVQAHYIMSDKENLATQSSDLETSRNSIGNPPKECSDIWGAEKLLDSSIHRSHSS 228

Query: 182 LSQRSPGSSRSYPLSKYMAKAVDSYHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADE 241
           LSQRS  S  S    K +AKAVD YHSLPLSMLEQ+++  S+  SL EHLGS  S    E
Sbjct: 229 LSQRSAFSVTS--PQKTVAKAVDLYHSLPLSMLEQAQIGTSDGFSLAEHLGSSISHHVPE 288

Query: 242 SPNWIAEEMIKSISAIYRELTEPPSTNHKNRSPISPLSSMYELSSQDLRSMR-NYEKSFN 301
           +PNW++EEMIK+ISAIY EL +PP  NH   S  SP+S+       D+ S +     SFN
Sbjct: 289 TPNWLSEEMIKTISAIYCELADPPLINHGYLS--SPVSNSSSQGQGDMWSPQCGKFSSFN 348

Query: 302 SNFENSFHTG---EFSAPYDTMLKVQWISRERMKDIDINRMLQGFRSLTSRLKDVDLKVM 361
           S+F++ F  G   EFS PY +M+KVQWI R+  K  DI   LQ +RSL  RL++VD++ M
Sbjct: 349 SHFDSPFGIGESKEFSGPYCSMVKVQWICRDSKKLQDIEHKLQYYRSLVCRLEEVDVRRM 408

Query: 362 KHDEKLAFWINVHNTLVMHAYLQYGIPKNSLK--TLILKAAYNVGGHIISVDMIQGSILG 421
           KH+EKLAFWINVHN LVMHA+L YGIPKN+LK  +L+LKAAYNVGG  IS+D IQ SILG
Sbjct: 409 KHEEKLAFWINVHNALVMHAFLVYGIPKNNLKRLSLLLKAAYNVGGQTISIDTIQSSILG 468

Query: 422 CRLPRPGQWLHLFLSSKTKLKVNYAQKSFRINHPEPRLYFALCCGSHSDPAVRVYTAKRL 481
           CRLPRPGQWL     SKTK KV  A++++ I  PEP L+FALC GS+SDPAVR+YT K++
Sbjct: 469 CRLPRPGQWLRFLFPSKTKFKVVDARRAYAIESPEPLLHFALCSGSYSDPAVRIYTPKKV 528

Query: 482 NEELEVAKEDYILSNLRTHKGQRILLPKIVESFAKDSGLCLEDLEDVVECLRLDGRINDG 501
            +ELEVAKE+YI SNL  +K Q+ILLPK++E FA+DS +C   L  +VE    D    + 
Sbjct: 529 FQELEVAKEEYIQSNLSVNKEQKILLPKVMEYFARDSDVCSAGLLQMVEQFMPDSLRKNL 588

BLAST of CmaCh19G006420 vs. TrEMBL
Match: A0A061E4R4_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_006317 PE=4 SV=1)

HSP 1 Score: 462.2 bits (1188), Expect = 1.2e-126
Identity = 280/550 (50.91%), Postives = 357/550 (64.91%), Query Frame = 1

Query: 4   MGNTK--------QVLDGDVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSAT 63
           MG TK        Q  + +VQNSLKQEIL LQE+L  QFV+R ALEKA++ +P +   A 
Sbjct: 1   MGQTKGRAEAKKSQTSNTEVQNSLKQEILHLQERLLDQFVVRRALEKALSHRPFTHDVAV 60

Query: 64  ENSIPKAEMDLIKQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDD----------- 123
           EN IPKA M++IK+IAVLELEV YLEKYLLSLYR+ F+++ SS +T+ +           
Sbjct: 61  ENLIPKAAMEVIKEIAVLELEVAYLEKYLLSLYRKNFDKRFSSLTTVGEVLRRTSVAHKE 120

Query: 124 ------------SQESFGNQCKE----QNEVEEPEKSL------------HVHRSYSSLS 183
                        +E+   Q  +    +N +  P K               +HRS+SSLS
Sbjct: 121 MFPEVQAHYIMSDKENLATQSSDLETSRNSIGNPPKECSDIWGAEKLLDSSIHRSHSSLS 180

Query: 184 QRSPGSSRSYPLSKYMAKAVDSYHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESP 243
           QRS  S  S    K +AKAVD YHSLPLSMLEQ+++  S+  SL EHLGS  S    E+P
Sbjct: 181 QRSAFSVTS--PQKTVAKAVDLYHSLPLSMLEQAQIGTSDGFSLAEHLGSSISHHVPETP 240

Query: 244 NWIAEEMIKSISAIYRELTEPPSTNHKNRSPISPLSSMYELSSQDLRSMR-NYEKSFNSN 303
           NW++EEMIK+ISAIY EL +PP  NH   S  SP+S+       D+ S +     SFNS+
Sbjct: 241 NWLSEEMIKTISAIYCELADPPLINHGYLS--SPVSNSSSQGQGDMWSPQCGKFSSFNSH 300

Query: 304 FENSFHTG---EFSAPYDTMLKVQWISRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKH 363
           F++ F  G   EFS PY +M+KVQWI R+  K  DI   LQ +RSL  RL++VD++ MKH
Sbjct: 301 FDSPFGIGESKEFSGPYCSMVKVQWICRDSKKLQDIEHKLQYYRSLVCRLEEVDVRRMKH 360

Query: 364 DEKLAFWINVHNTLVMHAYLQYGIPKNSLK--TLILKAAYNVGGHIISVDMIQGSILGCR 423
           +EKLAFWINVHN LVMHA+L YGIPKN+LK  +L+LKAAYNVGG  IS+D IQ SILGCR
Sbjct: 361 EEKLAFWINVHNALVMHAFLVYGIPKNNLKRLSLLLKAAYNVGGQTISIDTIQSSILGCR 420

Query: 424 LPRPGQWLHLFLSSKTKLKVNYAQKSFRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNE 483
           LPRPGQWL     SKTK KV  A++++ I  PEP L+FALC GS+SDPAVR+YT K++ +
Sbjct: 421 LPRPGQWLRFLFPSKTKFKVVDARRAYAIESPEPLLHFALCSGSYSDPAVRIYTPKKVFQ 480

Query: 484 ELEVAKEDYILSNLRTHKGQRILLPKIVESFAKDSGLCLEDLEDVVECLRLDGRINDGRQ 501
           ELEVAKE+YI SNL  +K Q+ILLPK++E FA+DS +C   L  +VE    D    + +Q
Sbjct: 481 ELEVAKEEYIQSNLSVNKEQKILLPKVMEYFARDSDVCSAGLLQMVEQFMPDSLRKNLQQ 540

BLAST of CmaCh19G006420 vs. TAIR10
Match: AT5G66600.4 (AT5G66600.4 Protein of unknown function, DUF547)

HSP 1 Score: 297.4 bits (760), Expect = 2.7e-80
Identity = 184/390 (47.18%), Postives = 243/390 (62.31%), Query Frame = 1

Query: 102 MDDSQESFGNQCKEQN--EVEEPEKSLHVHRSYSSLSQRSPGSSRSYPLSKYMAKAVDSY 161
           +DD+Q    NQ K+     V+  +      RS+S   QRS   SR         KA  S 
Sbjct: 213 LDDNQ----NQSKKTEIAAVDRDQMDPSFRRSHS---QRSAFGSRKASPEDSWGKASRSC 272

Query: 162 HSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRELTEPPS 221
           HS PL +      +  N  SL EHLG+  SD   E+PN ++E M+K +S IY +L EPPS
Sbjct: 273 HSQPLYVQ-----NGDNLISLAEHLGTRISDHVPETPNKLSEGMVKCMSEIYCKLAEPPS 332

Query: 222 TNHKN-RSPISPLSS-------MYELSSQDLRSMRNYEKSFNSNFENSFHTG---EFSAP 281
             H+   SP S LSS        Y+ SS    +      SF+   +NSFH     +FS P
Sbjct: 333 VLHRGLSSPNSSLSSSAFSPSDQYDTSSPGFGN----SSSFDVRLDNSFHVEGEKDFSGP 392

Query: 282 YDTMLKVQWISRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVM 341
           Y ++++V  I R+  K  ++  +LQ F+SL SRL++VD + +KH+EKLAFWINVHN LVM
Sbjct: 393 YSSIVEVLCIYRDAKKASEVEDLLQNFKSLISRLEEVDPRKLKHEEKLAFWINVHNALVM 452

Query: 342 HAYLQYGIPKNSLK--TLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKT 401
           HA+L YGIP+N++K   L+LKAAYN+GGH IS + IQ SILGC++  PGQWL L  +S+ 
Sbjct: 453 HAFLAYGIPQNNVKRVLLLLKAAYNIGGHTISAEAIQSSILGCKMSHPGQWLRLLFASR- 512

Query: 402 KLKVNYAQKSFRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRT 461
           K K    + ++ I+HPEP L+FAL  GSHSDPAVRVYT KR+ +ELE +KE+YI  NL  
Sbjct: 513 KFKAGDERLAYAIDHPEPLLHFALTSGSHSDPAVRVYTPKRIQQELETSKEEYIRMNLSI 572

Query: 462 HKGQRILLPKIVESFAKDSGLCLEDLEDVV 477
            K QRILLPK+VE+FAKDSGLC   L ++V
Sbjct: 573 RK-QRILLPKLVETFAKDSGLCPAGLTEMV 584

BLAST of CmaCh19G006420 vs. TAIR10
Match: AT2G23700.1 (AT2G23700.1 Protein of unknown function, DUF547)

HSP 1 Score: 283.5 bits (724), Expect = 4.0e-76
Identity = 179/416 (43.03%), Postives = 250/416 (60.10%), Query Frame = 1

Query: 90  RTFEQQVSSFSTMDDSQESFGNQCKEQNEVEEPEKSLHVHRSYSSLSQRSPGSSRSYPLS 149
           R F+Q+ S    +D    SF N+ K+Q  +E+ +    V R  SSL+QRS  ++R  P  
Sbjct: 288 RQFDQESSR---IDSRCFSFDNRLKDQCFIEKEDIDSCVRRCQSSLNQRSTFNNRISPPE 347

Query: 150 KYMAKAVDSYHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISA 209
                +V + HS PLS+ E  + + SN  SL EH+G+  SD    +PN ++EEMIK  SA
Sbjct: 348 D----SVFACHSQPLSIHEYIQ-NGSNDASLAEHMGTRISDHIFMTPNKLSEEMIKCASA 407

Query: 210 IYRELTEPPSTNHKNRSPISPLSSMYELSSQDLRSMRNYEKSFNSNFENSFHTGEFSAPY 269
           IY +L +PPS NH   SP S  SS  E S QD   M +     NS+F++ F   EFS PY
Sbjct: 408 IYSKLADPPSINHGFSSPSSSPSSTSEFSPQDQYDMWSPSFRKNSSFDDQF---EFSGPY 467

Query: 270 DTMLKVQWISRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVMH 329
            +M++V  I R R K  D++ M + F  L  +L+ VD + + H EKLAFWINVHN LVMH
Sbjct: 468 SSMIEVSHIHRNR-KRRDLDLMNRNFSLLLKQLESVDPRKLTHQEKLAFWINVHNALVMH 527

Query: 330 AYLQYGIPKNSLKTLIL--KAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKTK 389
            +L  GIP+N+ K  +L  K AY +GG ++S++ IQ  IL  ++PRPGQWL L L  K K
Sbjct: 528 TFLANGIPQNNGKRFLLLSKPAYKIGGRMVSLEAIQSYILRIKMPRPGQWLKLLLIPK-K 587

Query: 390 LKVNYAQKSFRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRTH 449
            +     + + + H EP LYFALC G+HSDPA+RV+T K + +ELE AKE+YI +     
Sbjct: 588 FRTGDEHQEYSLEHSEPLLYFALCSGNHSDPAIRVFTPKGIYQELETAKEEYIRATFGVK 647

Query: 450 KGQRILLPKIVESFAKDSGLCLEDLEDVV-ECL-----RLDGRINDGRQLQRKKLW 498
           K Q+++LPKI+ESF+KDSGL    L +++ ECL     +   ++N GR  +    W
Sbjct: 648 KDQKLVLPKIIESFSKDSGLGQAALMEMIQECLPETMKKTIKKLNSGRSRKSIVEW 690

BLAST of CmaCh19G006420 vs. TAIR10
Match: AT3G18900.2 (AT3G18900.2 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 258.5 bits (659), Expect = 1.4e-68
Identity = 178/443 (40.18%), Postives = 258/443 (58.24%), Query Frame = 1

Query: 56  ENSIPKAEMDLIKQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDDSQESFGN-QCK 115
           ++SIPK    L+++IA LEL+V+YLE YLL LYRR F  +++S    ++ + S    +C 
Sbjct: 99  DSSIPKEAKKLVEEIAGLELQVMYLETYLLLLYRRFFNNKITSKLESEEKERSEDLLECT 158

Query: 116 E-----QNEVEEPEKSLH---VHRSYSSLSQRSPGSSRSYPLSKYMAKAVDS-YH-SLPL 175
           +     +  V  P+K +    + RS+SSLS  S  S R  P       A+DS YH SLP 
Sbjct: 159 KLIDSPKKGVCSPQKLVEDSGIFRSHSSLSHCSGYSFRMSP------HAMDSSYHRSLPF 218

Query: 176 SMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRELTEPPSTNHKN 235
           SMLEQS +D        E +G++ S+   +SPN ++EEM+K IS + R+L +P S ++  
Sbjct: 219 SMLEQSDID--------ELIGTYVSENVHKSPNSLSEEMVKCISELCRQLVDPGSLDN-- 278

Query: 236 RSPISPLSSMYELSSQDLRSMRNYEKSFNSNFENSFHTGEFSAPYDTMLKVQWISRERMK 295
                           DL S        +S F         S PYD +L V+ ISR+  K
Sbjct: 279 ----------------DLES--------SSPFRGKEPLKIISRPYDKLLMVKSISRDSEK 338

Query: 296 DIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPKNSLKTL 355
              +   L+ FRSL ++L+ V+ + + H+EKLAFWIN+HN+LVMH+ L YG PKNS+K +
Sbjct: 339 LNAVEPALKHFRSLVNKLEGVNPRKLNHEEKLAFWINIHNSLVMHSILVYGNPKNSMKRV 398

Query: 356 --ILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKTKLKVNYAQKSFRINHP 415
             +LKAAYNVGG  +++D IQ SILGCR+ R         +S++K +     + + I H 
Sbjct: 399 SGLLKAAYNVGGRSLNLDTIQTSILGCRVFR------FLFASRSKGRAGDLGRDYAITHR 458

Query: 416 EPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRTHKGQRILLPKIVESFA 475
           E  L+FALC GS SDP+VR+YT K +  ELE  +E+Y+ SNL   K  +ILLPK+VE +A
Sbjct: 459 ESLLHFALCSGSLSDPSVRIYTPKNVMMELECGREEYVRSNLGISKDNKILLPKLVEIYA 495

Query: 476 KDSGLCLEDLEDVV-ECLRLDGR 485
           KD+ LC   + D++ +CL  + R
Sbjct: 519 KDTELCNVGVLDMIGKCLPCEAR 495

BLAST of CmaCh19G006420 vs. TAIR10
Match: AT1G76620.1 (AT1G76620.1 Protein of unknown function, DUF547)

HSP 1 Score: 212.6 bits (540), Expect = 8.7e-55
Identity = 171/472 (36.23%), Postives = 250/472 (52.97%), Query Frame = 1

Query: 40  LEKAM-NFQPLSLCSATENSIPKAEMDLIKQIAVLELEVVYLEKYLLSLYRRTFEQQV-S 99
           +EK +   Q  SLC      IPK+  +L K+IA +E+E++++E+YLLSLYR++FEQQ+ +
Sbjct: 61  IEKTLIRHQNSSLCL-----IPKSSEELKKEIASIEIEILHMERYLLSLYRKSFEQQLPN 120

Query: 100 SFSTMDDSQESFGNQCKEQNEVEEPEKSLHVHRSYSSLSQRSPGSSRSYPLSKYMAKAVD 159
           SFS +  +             V     SL  +++Y    Q+     RS+  S    KA  
Sbjct: 121 SFSNLSVTTTL-------PRSVTTSPTSLTHYQAY----QKPISYPRSFNTS---LKA-- 180

Query: 160 SYHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADES----PNWIAEEMIKSISAIYRE 219
                 LS  E +RV  S + SLGE LGS  S   D +    PN ++E++++ IS++Y  
Sbjct: 181 ------LSSREGTRV-VSGTHSLGELLGS--SHIVDHNNFINPNKLSEDIMRCISSVYCT 240

Query: 220 LTEPPSTNHKNRSPISPLSSMYELSSQDLRSMRNYEKSFNSNFENSFHTGEFSAPYDTML 279
           L+   ++      P SP+SS    +S    S  NYE  ++ N  +  H        D +L
Sbjct: 241 LSRGSTSTTSTCFPASPVSSN---ASTIFSSKFNYEDKWSLNGASEDHFLNHCQDQDNVL 300

Query: 280 KVQWISRERMK-DIDINR------MLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTL 339
               +  E ++  +D         MLQ FRSL   L+ VD   MK +EKLAFWIN+HN L
Sbjct: 301 PCGVVVIEALRVHLDDGSFGYAALMLQNFRSLVQNLEKVDPSRMKREEKLAFWINIHNAL 360

Query: 340 VMHAYLQYGIPKNSLKTLILKAAYNVGGHIISVDMIQGSILGCR--LPRPGQWLHLFLSS 399
           VMHAYL YG    +  T +LKAAY++GG+ I+  +IQ SILG R     P   L    S 
Sbjct: 361 VMHAYLAYGTHNRARNTSVLKAAYDIGGYRINPYIIQSSILGIRPHYTSPSPLLQTLFSP 420

Query: 400 KTKLKVNYAQKSFRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNL 459
             K K    +  + + +PE   +FA+  G+ +DP VRVYTA R+  +L  AK++YI SN+
Sbjct: 421 SRKSKTCSVRHIYALEYPEALAHFAISSGAFTDPTVRVYTADRIFRDLRQAKQEYIRSNV 480

Query: 460 RTHKGQRILLPKIVESFAKDSGLCLEDL-EDVVECLRLDGRINDGRQLQRKK 496
           R +KG +ILLPKI + + KD  + +  L E   +CL  D R    + L+ KK
Sbjct: 481 RVYKGTKILLPKIFQHYVKDMSMDVSKLMEATSQCLPEDARKIAEKCLKEKK 499

BLAST of CmaCh19G006420 vs. TAIR10
Match: AT1G21060.1 (AT1G21060.1 Protein of unknown function, DUF547)

HSP 1 Score: 202.2 bits (513), Expect = 1.2e-51
Identity = 161/467 (34.48%), Postives = 236/467 (50.54%), Query Frame = 1

Query: 39  ALEKAMNFQPLSLCSATENSIPKAEMDLIKQIAVLELEVVYLEKYLLSLYRRTFEQQVSS 98
           ++EK +     S C+ T    PK+  DL K+IA LE E++  E+YLLSLYR  F++QVSS
Sbjct: 45  SIEKRLLTYQDSNCNVT----PKSSEDLRKEIASLEFEILRTEQYLLSLYRTAFDEQVSS 104

Query: 99  FSTMDDSQESFGNQCKEQNEVEEPEKSLHVHRSYSSLSQRSPGSSRSYPLSKYMAKAVDS 158
           FS   ++     NQ   ++E  +       H   S  S+     S   P   + A     
Sbjct: 105 FSPHTET-SLVSNQFLPKSEQSDVTSVFSYHYQASPASE----CSSLCPPRSFQASL--- 164

Query: 159 YHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRELTEPP 218
                LS  E+SR  +S+ T+LG+ LGS        +P+ ++E++++ I ++Y  L+   
Sbjct: 165 ---KALSAREKSRYVSSSHTTLGDLLGSTLIVDNIANPSRLSEDILRCICSVYCTLSSKA 224

Query: 219 STNH-KNRSPISPLSSMYELSSQDLRSMRNYEKSFNSNFENSFHTGEFSAPYDTMLKVQW 278
             N     SP SP SS+   ++ D  + R+ E+             E + P   +++   
Sbjct: 225 RINSCLQASPSSP-SSVSSKATFDSLNSRHEERK------------EANVPGVVVIESLE 284

Query: 279 ISRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIP 338
           +  +         MLQ FRSL  +L+ VD   MK +EKLAFWIN+HN L MHAYL YG  
Sbjct: 285 LHLDDGSFNHAAVMLQNFRSLVQKLEKVDPSRMKREEKLAFWINIHNALTMHAYLAYGTH 344

Query: 339 KNSLKTLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKTKLKVNYAQKSF 398
             +  T +LKAAY+VGG+ ++  +IQ SILG R       L    S   K K    +  +
Sbjct: 345 NRARNTSVLKAAYDVGGYSVNPYIIQSSILGIRPHFSQPLLQTLFSPSRKSKTCNVKHIY 404

Query: 399 RINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRTHKGQRILLPKI 458
            + +PE   +FAL  G  +DP VRVYTA  +  +L  +KE++I +N+R H   +ILLPKI
Sbjct: 405 ALEYPEALAHFALSSGFSTDPPVRVYTADCVFRDLRKSKEEFIRNNVRIHNETKILLPKI 464

Query: 459 VESFAKDSGLCLEDL-EDVVECLRLDGRINDGRQLQRKKLWKSIGRN 504
           V  +AKD  L    L E  V+CL       D  +   +KL K   RN
Sbjct: 465 VHYYAKDMSLEPSALMETTVKCL------PDSTKRTAQKLLKKKSRN 477

BLAST of CmaCh19G006420 vs. NCBI nr
Match: gi|659109793|ref|XP_008454883.1| (PREDICTED: uncharacterized protein LOC103495193 [Cucumis melo])

HSP 1 Score: 767.3 bits (1980), Expect = 2.6e-218
Identity = 412/536 (76.87%), Postives = 443/536 (82.65%), Query Frame = 1

Query: 5   GNTKQVLDGDVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSATENSIPKAEM 64
           GN +Q+ DGDVQ SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+AEM
Sbjct: 6   GNKQQISDGDVQISLKQEILQLEEQLQSQFATRHALEKAINFQPLSLYSATEDAIPEAEM 65

Query: 65  DLIKQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDDSQES---------------- 124
           +LIKQIAVLELEVVYLEKYLLSLYRRTF QQVSSFSTMDD  ES                
Sbjct: 66  ELIKQIAVLELEVVYLEKYLLSLYRRTFNQQVSSFSTMDDRLESYIEPNNVIEGEHSCIH 125

Query: 125 ----------FGNQCKEQNEVEEPEKSLHVHRSYSSLSQRSPGSSRSYPLSKYMAKAVDS 184
                     F NQ K +N VEEPEK  H+HRS SSLSQRS GSSR+Y LSKYMAKAVDS
Sbjct: 126 SDHIVSPETLFDNQSKGRNVVEEPEKLSHLHRSNSSLSQRSLGSSRNYSLSKYMAKAVDS 185

Query: 185 YHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRELTEPP 244
           YHS PLSMLEQSR+D  +STSLGEHLG+  S + DESPNW++EEMIKSISAIYREL EPP
Sbjct: 186 YHSFPLSMLEQSRIDVPSSTSLGEHLGACLSIRVDESPNWLSEEMIKSISAIYRELAEPP 245

Query: 245 STNHKNRSPISPLSSMYELSSQDLRSMRNYEKSFNSNFENSFHTGEFSAPYDTMLKVQWI 304
             NH N SPISPLSSMYELSSQD  SMRNYEKS NS+FEN FH  EF APYDTMLKVQWI
Sbjct: 246 LMNHNNPSPISPLSSMYELSSQDFGSMRNYEKSLNSHFENPFHIEEFIAPYDTMLKVQWI 305

Query: 305 SRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK 364
           SRER KD DIN MLQGFRSL  RLK+V LKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK
Sbjct: 306 SRERKKDSDINHMLQGFRSLIFRLKEVKLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK 365

Query: 365 NSLK--TLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKTKLKVNYAQKS 424
           + LK  +LILKAAYN+GGHIISVD IQ SILGCRLPR GQWLHLFLSSKTK KVN  QKS
Sbjct: 366 HCLKRISLILKAAYNIGGHIISVDKIQSSILGCRLPRSGQWLHLFLSSKTKFKVNDVQKS 425

Query: 425 FRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRTHKGQRILLPK 484
           F INHPEPRLYFALCCG+ SDPAVR+YTAKR+NE+LEVAK++YILSNLR HKGQRILLPK
Sbjct: 426 FPINHPEPRLYFALCCGNLSDPAVRLYTAKRVNEQLEVAKDEYILSNLRVHKGQRILLPK 485

Query: 485 IVESFAKDSGLCLEDLEDVVECLRLDGRINDGRQLQRKKLWKSIGRNP-NPPFLFL 512
           IVESFAKDSGLCLEDLE+ VECLR + RIND +Q QRKK WKSIG  P N  F FL
Sbjct: 486 IVESFAKDSGLCLEDLENTVECLRSNRRINDIQQRQRKKFWKSIGWIPHNFTFSFL 541

BLAST of CmaCh19G006420 vs. NCBI nr
Match: gi|778725281|ref|XP_011658927.1| (PREDICTED: uncharacterized protein LOC101203131 [Cucumis sativus])

HSP 1 Score: 764.2 bits (1972), Expect = 2.2e-217
Identity = 411/536 (76.68%), Postives = 439/536 (81.90%), Query Frame = 1

Query: 5   GNTKQVLDGDVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSATENSIPKAEM 64
           GN +Q+ DGD Q SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+AEM
Sbjct: 6   GNKQQISDGDAQISLKQEILQLEEQLQSQFATRHALEKAINFQPLSLYSATEDAIPEAEM 65

Query: 65  DLIKQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDDSQES---------------- 124
           +LIKQIAVLELEVVYLEKYLLSLYRRTF QQVSSFSTMDD  ES                
Sbjct: 66  ELIKQIAVLELEVVYLEKYLLSLYRRTFNQQVSSFSTMDDRLESYIEPNNVIEGEHSCIH 125

Query: 125 ----------FGNQCKEQNEVEEPEKSLHVHRSYSSLSQRSPGSSRSYPLSKYMAKAVDS 184
                     F NQ K +N VEEPE   H+HRS SSLSQRS GSSR+Y LSK MAKAVDS
Sbjct: 126 SDHIGSPETLFDNQSKGRNVVEEPENLSHLHRSNSSLSQRSLGSSRNYSLSKSMAKAVDS 185

Query: 185 YHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRELTEPP 244
           YHS PLSMLEQSR+D  +STSLGEHLG+  S + DESPNW++EEMIKSISAIYREL EPP
Sbjct: 186 YHSFPLSMLEQSRIDVPSSTSLGEHLGACLSIRVDESPNWLSEEMIKSISAIYRELAEPP 245

Query: 245 STNHKNRSPISPLSSMYELSSQDLRSMRNYEKSFNSNFENSFHTGEFSAPYDTMLKVQWI 304
             NH N SPISPLSSMYELSSQD  SMRNYEKS NS+FEN FHT EF APYDTMLKVQWI
Sbjct: 246 LMNHNNPSPISPLSSMYELSSQDFGSMRNYEKSLNSHFENPFHTEEFIAPYDTMLKVQWI 305

Query: 305 SRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK 364
           SRER  D DIN MLQGFRSL  RLK+V LK MKHDEKLAFWINVHNTLVMHAYLQYGI K
Sbjct: 306 SRERKNDSDINHMLQGFRSLIFRLKEVKLKAMKHDEKLAFWINVHNTLVMHAYLQYGISK 365

Query: 365 NSLK--TLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKTKLKVNYAQKS 424
           + LK  +LILKAAYN+GGHIISVD IQ SILGCRLPR GQWLHLFLSSKTK KVN  QKS
Sbjct: 366 HCLKRISLILKAAYNIGGHIISVDKIQSSILGCRLPRSGQWLHLFLSSKTKFKVNDVQKS 425

Query: 425 FRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRTHKGQRILLPK 484
           F INHPEPRLYFALCCGSHSDPAVR+YTAKR+NEELEVAKE+YILSNLR HKGQ+ILLPK
Sbjct: 426 FPINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELEVAKEEYILSNLRVHKGQKILLPK 485

Query: 485 IVESFAKDSGLCLEDLEDVVECLRLDGRINDGRQLQRKKLWKSIGRNP-NPPFLFL 512
           IVESFAKDSGLCLEDLE+ VECLR   RIND +Q QRKKLWKSIG  P N  F FL
Sbjct: 486 IVESFAKDSGLCLEDLENTVECLRSKRRINDIQQRQRKKLWKSIGWIPHNFTFSFL 541

BLAST of CmaCh19G006420 vs. NCBI nr
Match: gi|595801682|ref|XP_007201863.1| (hypothetical protein PRUPE_ppa020192mg [Prunus persica])

HSP 1 Score: 493.0 bits (1268), Expect = 9.5e-136
Identity = 284/524 (54.20%), Postives = 357/524 (68.13%), Query Frame = 1

Query: 8   KQVLDGDVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSATENSIPKAEMDLI 67
           +Q  D DVQ+SL QEILQLQ+QLQ QF++RHAL+KA++++PLS  S  E S+PK+  +++
Sbjct: 14  RQSTDSDVQSSLNQEILQLQKQLQDQFLVRHALQKALSYRPLSPDSTIETSLPKSTKEVV 73

Query: 68  KQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDDSQE-------------------- 127
           K+IAVLELEVVYLE+YLLSLYR+TF+QQ+SS  ++   Q+                    
Sbjct: 74  KEIAVLELEVVYLERYLLSLYRKTFDQQISSELSVAPRQDATPENSNTVTYSDDLMSPRN 133

Query: 128 SFGNQCKEQNEVEEPEKSLH--VHRSYSSLSQRSPGSSRSYPLSKYMAKAVDSYHSLPLS 187
           S     KE N++ E +K L   +HR+YSSLSQRS  S+R+ P +K  AKAVDSYHSLP S
Sbjct: 134 SIVKPLKECNDIVEQQKLLDSSIHRTYSSLSQRSTCSTRTSPRTKSRAKAVDSYHSLPFS 193

Query: 188 MLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRELTEPPSTNHKNR 247
           MLEQ++  A+ +    E L ++ SDQ  ++PN I+EEMIK I+ I+ EL +PP  +H   
Sbjct: 194 MLEQAQ-SATTNVYPTEQLETYFSDQVPDTPNCISEEMIKCIATIFCELADPPVISHDYS 253

Query: 248 SPISPLSSMYELSS----QDLRSMRNYEKSFNSNFENSFHT---GEFSAPYDTMLKVQWI 307
           S     SS Y LSS    +   S R     FNS+F+  FH     E S PY  MLKV  I
Sbjct: 254 SSPITSSSTYNLSSHSQGEKWSSKRTKVPFFNSHFDKPFHIEGPDEMSGPYCRMLKVHSI 313

Query: 308 SRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNTLVMHAYLQYGIPK 367
            R+  K  D+   L+ FRSL  RL++VDL+ MKH+EKLAFWINVHN LVMHA+L YGIP+
Sbjct: 314 RRDAEKLGDVEHALKKFRSLIYRLEEVDLRTMKHEEKLAFWINVHNALVMHAFLVYGIPQ 373

Query: 368 NSLK--TLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLSSKTKLKVNYAQKS 427
           N+LK  +L+LKAAYNVGGH ISVDMIQ SILGCRLPRPGQWL L  S KTK KV  A+K+
Sbjct: 374 NNLKRVSLLLKAAYNVGGHAISVDMIQRSILGCRLPRPGQWLRLLFSMKTKFKVGDARKA 433

Query: 428 FRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSNLRTHKGQRILLPK 487
           + I HPEP L+FALC GSHSDPAVRVYT+KR+ EELE AK +YI S    HK Q+ILLPK
Sbjct: 434 YSIEHPEPLLHFALCSGSHSDPAVRVYTSKRVFEELETAKHEYIQSTFLVHKEQKILLPK 493

Query: 488 IVESFAKDSGLCLEDLEDVVECLRLDGRINDGRQLQRKKLWKSI 501
           IVESFAKD+GLC  DL  ++E    D +    +  Q K+ WK I
Sbjct: 494 IVESFAKDTGLCSADLMGMIEHFMPDFQRKSNKPFQHKRTWKGI 536

BLAST of CmaCh19G006420 vs. NCBI nr
Match: gi|703097866|ref|XP_010096229.1| (hypothetical protein L484_026965 [Morus notabilis])

HSP 1 Score: 487.3 bits (1253), Expect = 5.2e-134
Identity = 294/555 (52.97%), Postives = 373/555 (67.21%), Query Frame = 1

Query: 14  DVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSATENSIPKAEMDLIKQIAVL 73
           DVQ+SL QEI++LQ QLQ QF++RHALEKA+++QPLS   A ++S+PK   +LIK+I VL
Sbjct: 80  DVQSSLNQEIMKLQRQLQDQFLVRHALEKALSYQPLSHEVAIDDSVPKPTKELIKEITVL 139

Query: 74  ELEVVYLEKYLLSLYRRTFEQQVSSFSTMD------------DSQESFG----------- 133
           ELE++YLE+YLLSLYR+TF+QQ+SS ST D             S E  G           
Sbjct: 140 ELEILYLERYLLSLYRKTFDQQMSSASTADGRLDSALVTEKGTSSEVSGQADIMPHRENS 199

Query: 134 ---------------NQCKEQNEVEEPEKSL--HVHRSYSSLSQRSPGSSRSYPLSKYMA 193
                             +E +++ EPEK L   +HRS+SSLSQRS  S R+ P  K++ 
Sbjct: 200 VLHHSRNLMPPRNSLGSLQECDDMWEPEKLLDSSIHRSHSSLSQRSFCSFRTSP-RKFLN 259

Query: 194 KAVDSYHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSISAIYRE 253
            AV+SYHSLPLSMLEQ+     N+ SL EHLG+   D   E+PN ++EEMI+ ISAIY E
Sbjct: 260 NAVNSYHSLPLSMLEQAESSTPNA-SLAEHLGASLYDHVPETPNCLSEEMIRCISAIYCE 319

Query: 254 LTEPPSTNHKN-RSPISPLSSMYELS----SQDLRSMRNYEKSFNSNFENSFHTGE---F 313
           L++PP  N  +  SP++  SS+YE S    S+   S+      FNS+ +N FH  E    
Sbjct: 320 LSDPPLINQDSPSSPVAFSSSIYEASIHSHSEKWSSISRKIPFFNSHLDNPFHLDESEGL 379

Query: 314 SAPYDTMLKVQWISRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWINVHNT 373
           + PY  MLKVQWI R+  K  +I+ MLQ FRSL  +L++VDL+ MKH+EKLAFWINVHN 
Sbjct: 380 TRPYFRMLKVQWICRDSEKLREIDNMLQKFRSLVYQLEEVDLRKMKHEEKLAFWINVHNA 439

Query: 374 LVMHAYLQYGIPKNSLK--TLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLHLFLS 433
           LVMHA+L YGIP+N+LK  +L+LKAAYNVGGH ISVDMIQ SILGCRLPRPGQWL L  S
Sbjct: 440 LVMHAFLVYGIPQNNLKRVSLVLKAAYNVGGHTISVDMIQNSILGCRLPRPGQWLRLLFS 499

Query: 434 SKTKLKVNYAQKSFRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDYILSN 493
           SKTK KV  A+K+F I+HPEP L+FALC GS  DPAVRVYT KR+ EELE AKE+Y+ SN
Sbjct: 500 SKTKFKVGDARKAFGIDHPEPLLHFALCSGSQYDPAVRVYTPKRVFEELETAKEEYVQSN 559

Query: 494 LRTHKGQRILLPKIVESFAKDSGLCLEDLEDVVECLRLDGRIND--GRQLQRKKLWKSIG 517
              HK Q+ILLPKIVESFAKDSGL L DL ++V+    D +  +      Q KK+ + + 
Sbjct: 560 FIVHKEQKILLPKIVESFAKDSGLSLIDLVEMVDHFMPDSQRKNISIHHCQHKKICQLMN 619

BLAST of CmaCh19G006420 vs. NCBI nr
Match: gi|1009165030|ref|XP_015900820.1| (PREDICTED: uncharacterized protein LOC107433935 [Ziziphus jujuba])

HSP 1 Score: 484.6 bits (1246), Expect = 3.4e-133
Identity = 285/541 (52.68%), Postives = 361/541 (66.73%), Query Frame = 1

Query: 8   KQVLDGDVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLCSATENSIPKAEMDLI 67
           +Q      Q+SL QEIL+LQ+QLQ QF +RHA+EKA++++PLS  +  +NS+PK   +LI
Sbjct: 51  RQPTQTSAQSSLNQEILKLQKQLQDQFAVRHAMEKALSYRPLSHDATIDNSVPKPTKELI 110

Query: 68  KQIAVLELEVVYLEKYLLSLYRRTFEQQVSSFSTMDD----------------------- 127
           K++AVLELE+VYLE++LLSLYR+TF +Q+SS S MD                        
Sbjct: 111 KEVAVLELEIVYLERHLLSLYRKTFYEQMSSVSAMDGRSSSAIVTQKELSIVAPGQYIMP 170

Query: 128 --------------SQESFGNQCKEQNEVEEPEKSL--HVHRSYSSLSQRSPGSSRSYPL 187
                          Q S  N+ KE N++ EP+K L   +HRS+SSLSQRS  S R+ P 
Sbjct: 171 DKEKSVIQSSNHVYPQNSLANRLKECNDIWEPQKLLDSSIHRSHSSLSQRSACSIRTSPP 230

Query: 188 SKYMAKAVDSYHSLPLSMLEQSRVDASNSTSLGEHLGSWKSDQADESPNWIAEEMIKSIS 247
            K + K VDSYHSLPLSMLEQ++  A+ + +L EHL ++ +D   E+PN ++EEMIK IS
Sbjct: 231 RKSLNKVVDSYHSLPLSMLEQAQ-SATPNPNLEEHLDTYLTDNIPETPNCLSEEMIKCIS 290

Query: 248 AIYRELTEPPSTNHK-NRSPISPLSSMYELSSQ---DLRSMRNYEKSFNSNFENSFH--- 307
            IY EL +PP  N     SP++  SS+YE+SSQ   +  S R+ +  F +   N FH   
Sbjct: 291 VIYCELADPPLINQDYTSSPVAFSSSLYEVSSQGQHEKWSSRSRKLPFFNLNVNPFHFEG 350

Query: 308 TGEFSAPYDTMLKVQWISRERMKDIDINRMLQGFRSLTSRLKDVDLKVMKHDEKLAFWIN 367
           + E S PY  MLKV  I R+  K   + + LQ FRSL  RL+++DL+ MKH+EKLAFWIN
Sbjct: 351 SEELSGPYCRMLKVHLICRDTEKLEVVEQTLQKFRSLVCRLEELDLRKMKHEEKLAFWIN 410

Query: 368 VHNTLVMHAYLQYGIPKNSLK--TLILKAAYNVGGHIISVDMIQGSILGCRLPRPGQWLH 427
           VHN LVMHAYL YGIP+N+LK  +L+LKAAYNVGGH ISVDMIQ SILGCRLPRPGQWL 
Sbjct: 411 VHNALVMHAYLVYGIPQNNLKRVSLLLKAAYNVGGHTISVDMIQNSILGCRLPRPGQWLR 470

Query: 428 LFLSSKTKLKVNYAQKSFRINHPEPRLYFALCCGSHSDPAVRVYTAKRLNEELEVAKEDY 487
           L  SSKTK KV  A+K++ I HPEP L+FALC G  SDPAVR+YT KR+ EELE AKE+Y
Sbjct: 471 LLFSSKTKFKVGDARKAYAIEHPEPLLHFALCSGIQSDPAVRIYTPKRVFEELETAKEEY 530

Query: 488 ILSNLRTHKGQRILLPKIVESFAKDSGLCLEDLEDVVECLRLDGRINDGRQLQRKKLWKS 501
           I S L  HK Q+ILLPKIVE FAKDSG+C   L +++E L  D R    +Q Q KK WK 
Sbjct: 531 IQSTLILHKEQKILLPKIVECFAKDSGMCSVGLAEMIERLMPDYRRKSIQQCQHKKNWKG 590

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PDV2_ARATH2.6e-3738.77Plastid division protein PDV2 OS=Arabidopsis thaliana GN=PDV2 PE=1 SV=1[more]
PDV1_ARATH7.0e-0629.63Plastid division protein PDV1 OS=Arabidopsis thaliana GN=PDV1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K861_CUCSA1.5e-21776.68Uncharacterized protein OS=Cucumis sativus GN=Csa_7G075610 PE=4 SV=1[more]
M5VP81_PRUPE6.6e-13654.20Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020192mg PE=4 SV=1[more]
W9RBN6_9ROSA3.6e-13452.97Uncharacterized protein OS=Morus notabilis GN=L484_026965 PE=4 SV=1[more]
A0A061DWR5_THECC4.3e-12750.91Uncharacterized protein isoform 3 OS=Theobroma cacao GN=TCM_006317 PE=4 SV=1[more]
A0A061E4R4_THECC1.2e-12650.91Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_006317 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66600.42.7e-8047.18 Protein of unknown function, DUF547[more]
AT2G23700.14.0e-7643.03 Protein of unknown function, DUF547[more]
AT3G18900.21.4e-6840.18 FUNCTIONS IN: molecular_function unknown[more]
AT1G76620.18.7e-5536.23 Protein of unknown function, DUF547[more]
AT1G21060.11.2e-5134.48 Protein of unknown function, DUF547[more]
Match NameE-valueIdentityDescription
gi|659109793|ref|XP_008454883.1|2.6e-21876.87PREDICTED: uncharacterized protein LOC103495193 [Cucumis melo][more]
gi|778725281|ref|XP_011658927.1|2.2e-21776.68PREDICTED: uncharacterized protein LOC101203131 [Cucumis sativus][more]
gi|595801682|ref|XP_007201863.1|9.5e-13654.20hypothetical protein PRUPE_ppa020192mg [Prunus persica][more]
gi|703097866|ref|XP_010096229.1|5.2e-13452.97hypothetical protein L484_026965 [Morus notabilis][more]
gi|1009165030|ref|XP_015900820.1|3.4e-13352.68PREDICTED: uncharacterized protein LOC107433935 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006869DUF547
IPR025757MIP1_Leuzipper
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0010020 chloroplast fission
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh19G006420.1CmaCh19G006420.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006869Domain of unknown function DUF547PFAMPF04784DUF547coord: 309..440
score: 7.3
IPR025757Ternary complex factor MIP1, leucine-zipperPFAMPF14389Lzipper-MIP1coord: 16..94
score: 1.8
NoneNo IPR availableunknownCoilCoilcoord: 692..719
score: -coord: 629..649
scor
NoneNo IPR availablePANTHERPTHR23054UNCHARACTERIZEDcoord: 8..496
score: 1.8E
NoneNo IPR availablePANTHERPTHR23054:SF20SUBFAMILY NOT NAMEDcoord: 8..496
score: 1.8E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh19G006420CmaCh11G017820Cucurbita maxima (Rimu)cmacmaB149
CmaCh19G006420CmaCh02G004190Cucurbita maxima (Rimu)cmacmaB452
CmaCh19G006420CmaCh07G007410Cucurbita maxima (Rimu)cmacmaB462
The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh19G006420Cucumber (Chinese Long) v3cmacucB0608
CmaCh19G006420Cucurbita maxima (Rimu)cmacmaB446
CmaCh19G006420Cucumber (Gy14) v1cgycmaB0252
CmaCh19G006420Cucurbita moschata (Rifu)cmacmoB518
CmaCh19G006420Wild cucumber (PI 183967)cmacpiB512
CmaCh19G006420Cucurbita pepo (Zucchini)cmacpeB526
CmaCh19G006420Bottle gourd (USVL1VR-Ls)cmalsiB469
CmaCh19G006420Cucumber (Gy14) v2cgybcmaB212