CmoCh13G002250 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh13G002250
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionCCHC-type domain-containing protein
LocationCmo_Chr13: 1760710 .. 1762791 (+)
RNA-Seq ExpressionCmoCh13G002250
SyntenyCmoCh13G002250
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAAGCTGGTTGGCAATAACTATAGTTATTGGAAGTTATGCATGGAAGCTTATCTACAAGGGCAAGATTTATGGGATTTAATTGAAGGTGATGACACAGAAATTCCAGCCGATACTCCACAAAATGCAAAATTACGTCAACAATGGAAGATCAAGTGCGGAAAAACCTTATTTACCTTGCGAACTTTGATTAGCAAGGAGTATATTGATCATGTTCGTGATTTAAAGTCACCAAAGCAAGTATGGGATACACTTCAAAAGTTGTTCACCAAGAAAAATACCGCTCGACTGCAATTTCTAGAGAATGAACTAGCTATGGTAACTCAAGGTAATTTTTCTGTTGAAGAATATTTTTTGAAGGTGAAAAATCTATGTTCTGAAATTTCAGAATTAGATGCTGAGGAGCCAGTGAGTGAGGCTCGATTACGGCGTTATCTTATTCGTGGATTACGAAAAGAGTTTATGCCCTTTGTTTCCTCAATACAAGGGTGGGCAAATCAACCTACGGTAATTGAATTGGAGAATCTTCTTTCCAATCAGGAAGCCTTGATTAAGCAAATGACTAGCAGCAACGAGTTTTCCCCAAAGTTAGAAGGTGTGTTGTATGTCAAAGATCAAAGGAGACAGAATTTTCACTCAAGGCCTTCATCTAGCAATGAGAATCAATTCAGAAGTGACGAGTCGTCAAAGAAGCCATTTAAAGCTTGTTACAGGTGTGGAAAACCAGGCCACTTTAAACGAGATTGTCAGGCGAAAGTGGTGTGTGATCATTGCGGAAAGCCAGGCCATATTAAACCAAATTGTCGAGTCAAGATGCAGGAATCAGAAGCAAATGCGGTACATGAAAATAAAAGTTCTCCAGATCCAATTTGGGAATATTGCTTAACCACTGAGGTTCTTGACCAGCCAACAAACGTGACTTCAGCCGTATATCAAGATGATGTTTCTACAGGTGATCAAAATTCTAATTCCACTACTTACGCTTCTGAGTTTGATTCTCTCCAAATTTCGCATCTGGACTCTCTTTTTTTCTCCGATTTCAATTCTATTGCCCCTGATGACCCTTCTGTTTACTCCACGGCCTTGGATTTGAGATTCAACGAAAATGAAGTTGTCGAGCTAACGTTTGACGATCTTGACGGTCTTTACCTCCCCTCTGAGGCTGACGATTTTCTCATCTTGGAGAATTTAGATCAAACTACCAATTTGCAGGATTCGGCTACTGATGCTCCGCCCCAGTCTGACCCAGAAATAGCTGCACTGCTTGATGATGATGATTTATCACGATTTGGTTCTGATGTTGAGAACTTGGAGGAGGATTTCGTAGTTCAGGTAAATTTATGTGAAGAAGGGGAGGATGGTTCAATGGATAATAAGTTTAGTGTGGCTAAGGATTCTGAGAAGGCGACAGGAAGCCAGAATATTATTAATAATAAATCTTTTGGCGATGACATTTTTGAGGATGCAGATGTTGACCATGTGGAGGATGATGTTGATAAGCCACGAACTCGACAGCTTTTGGATGATCGATTTGATACCCTCTTAAGTCAGGACTATGCATCTAGTGACAACGGTGACAGTGCTTGTGATGAGCATGATGGTTGGGTAGCTGAAGAAGATGAGTCCCTTGCTCAAAAGCTCAAGCATGCCCTTGATGATCATAGTAAGGATGACTTGGACCTTGACCAAGGATATAAAGCTTCTGCAGATCAAGAGCTTTTGCAGTCTACCAGTGATGTAATTCACCGGTGTATGGAGTATGCTGAGTCTGCACAAGTTGAGATGAATTATTACTCTTTCTCATTAGAATTGAATCGCTGCGATGTGGAACTTATTAAGAAGGAAATCATTGAACATGATGTGTCAGTGTCTTATTCAGGTGGTGGCGTGCCTAAGAAGTCCATACGCAAGCCTAACTGGAAGAAATTCTTCGTAGATTATTTTGTTCGAGAGAACGTGTTAACGCAGGATGTTGAATTGCAGGAAATACATACTAATGAGCAAGTTGCAGACATATTTACTAAGGCACTTGCCAAAGTGAAGTTTGGAGTTTTTCGTAGAAGCTCTCGGAGTTATTGA

mRNA sequence

ATGGAAAAGCTGGTTGGCAATAACTATAGTTATTGGAAGTTATGCATGGAAGCTTATCTACAAGGGCAAGATTTATGGGATTTAATTGAAGGTGATGACACAGAAATTCCAGCCGATACTCCACAAAATGCAAAATTACGTCAACAATGGAAGATCAAGTGCGGAAAAACCTTATTTACCTTGCGAACTTTGATTAGCAAGGAGTATATTGATCATGTTCGTGATTTAAAGTCACCAAAGCAAGTATGGGATACACTTCAAAAGTTGTTCACCAAGAAAAATACCGCTCGACTGCAATTTCTAGAGAATGAACTAGCTATGGTAACTCAAGGTAATTTTTCTGTTGAAGAATATTTTTTGAAGGTGAAAAATCTATGTTCTGAAATTTCAGAATTAGATGCTGAGGAGCCAGTGAGTGAGGCTCGATTACGGCGTTATCTTATTCGTGGATTACGAAAAGAGTTTATGCCCTTTGTTTCCTCAATACAAGGGTGGGCAAATCAACCTACGGTAATTGAATTGGAGAATCTTCTTTCCAATCAGGAAGCCTTGATTAAGCAAATGACTAGCAGCAACGAGTTTTCCCCAAAGTTAGAAGGTGTGTTGTATGTCAAAGATCAAAGGAGACAGAATTTTCACTCAAGGCCTTCATCTAGCAATGAGAATCAATTCAGAAGTGACGAGTCGTCAAAGAAGCCATTTAAAGCTTGTTACAGGTGTGGAAAACCAGGCCACTTTAAACGAGATTGTCAGGCGAAAGTGGTGTGTGATCATTGCGGAAAGCCAGGCCATATTAAACCAAATTGTCGAGTCAAGATGCAGGAATCAGAAGCAAATGCGGTACATGAAAATAAAAGTTCTCCAGATCCAATTTGGGAATATTGCTTAACCACTGAGGTTCTTGACCAGCCAACAAACGTGACTTCAGCCGTATATCAAGATGATGTTTCTACAGGTGATCAAAATTCTAATTCCACTACTTACGCTTCTGAGTTTGATTCTCTCCAAATTTCGCATCTGGACTCTCTTTTTTTCTCCGATTTCAATTCTATTGCCCCTGATGACCCTTCTGTTTACTCCACGGCCTTGGATTTGAGATTCAACGAAAATGAAGTTGTCGAGCTAACGTTTGACGATCTTGACGGTCTTTACCTCCCCTCTGAGGCTGACGATTTTCTCATCTTGGAGAATTTAGATCAAACTACCAATTTGCAGGATTCGGCTACTGATGCTCCGCCCCAGTCTGACCCAGAAATAGCTGCACTGCTTGATGATGATGATTTATCACGATTTGGTTCTGATGTTGAGAACTTGGAGGAGGATTTCGTAGTTCAGGTAAATTTATGTGAAGAAGGGGAGGATGGTTCAATGGATAATAAGTTTAGTGTGGCTAAGGATTCTGAGAAGGCGACAGGAAGCCAGAATATTATTAATAATAAATCTTTTGGCGATGACATTTTTGAGGATGCAGATGTTGACCATGTGGAGGATGATGTTGATAAGCCACGAACTCGACAGCTTTTGGATGATCGATTTGATACCCTCTTAAGTCAGGACTATGCATCTAGTGACAACGGTGACAGTGCTTGTGATGAGCATGATGGTTGGGTAGCTGAAGAAGATGAGTCCCTTGCTCAAAAGCTCAAGCATGCCCTTGATGATCATAGTAAGGATGACTTGGACCTTGACCAAGGATATAAAGCTTCTGCAGATCAAGAGCTTTTGCAGTCTACCAGTGATGTAATTCACCGGTGTATGGAGTATGCTGAGTCTGCACAAGTTGAGATGAATTATTACTCTTTCTCATTAGAATTGAATCGCTGCGATGTGGAACTTATTAAGAAGGAAATCATTGAACATGATGTGTCAGTGTCTTATTCAGGTGGTGGCGTGCCTAAGAAGTCCATACGCAAGCCTAACTGGAAGAAATTCTTCGTAGATTATTTTGTTCGAGAGAACGTGTTAACGCAGGATGTTGAATTGCAGGAAATACATACTAATGAGCAAGTTGCAGACATATTTACTAAGGCACTTGCCAAAGTGAAGTTTGGAGTTTTTCGTAGAAGCTCTCGGAGTTATTGA

Coding sequence (CDS)

ATGGAAAAGCTGGTTGGCAATAACTATAGTTATTGGAAGTTATGCATGGAAGCTTATCTACAAGGGCAAGATTTATGGGATTTAATTGAAGGTGATGACACAGAAATTCCAGCCGATACTCCACAAAATGCAAAATTACGTCAACAATGGAAGATCAAGTGCGGAAAAACCTTATTTACCTTGCGAACTTTGATTAGCAAGGAGTATATTGATCATGTTCGTGATTTAAAGTCACCAAAGCAAGTATGGGATACACTTCAAAAGTTGTTCACCAAGAAAAATACCGCTCGACTGCAATTTCTAGAGAATGAACTAGCTATGGTAACTCAAGGTAATTTTTCTGTTGAAGAATATTTTTTGAAGGTGAAAAATCTATGTTCTGAAATTTCAGAATTAGATGCTGAGGAGCCAGTGAGTGAGGCTCGATTACGGCGTTATCTTATTCGTGGATTACGAAAAGAGTTTATGCCCTTTGTTTCCTCAATACAAGGGTGGGCAAATCAACCTACGGTAATTGAATTGGAGAATCTTCTTTCCAATCAGGAAGCCTTGATTAAGCAAATGACTAGCAGCAACGAGTTTTCCCCAAAGTTAGAAGGTGTGTTGTATGTCAAAGATCAAAGGAGACAGAATTTTCACTCAAGGCCTTCATCTAGCAATGAGAATCAATTCAGAAGTGACGAGTCGTCAAAGAAGCCATTTAAAGCTTGTTACAGGTGTGGAAAACCAGGCCACTTTAAACGAGATTGTCAGGCGAAAGTGGTGTGTGATCATTGCGGAAAGCCAGGCCATATTAAACCAAATTGTCGAGTCAAGATGCAGGAATCAGAAGCAAATGCGGTACATGAAAATAAAAGTTCTCCAGATCCAATTTGGGAATATTGCTTAACCACTGAGGTTCTTGACCAGCCAACAAACGTGACTTCAGCCGTATATCAAGATGATGTTTCTACAGGTGATCAAAATTCTAATTCCACTACTTACGCTTCTGAGTTTGATTCTCTCCAAATTTCGCATCTGGACTCTCTTTTTTTCTCCGATTTCAATTCTATTGCCCCTGATGACCCTTCTGTTTACTCCACGGCCTTGGATTTGAGATTCAACGAAAATGAAGTTGTCGAGCTAACGTTTGACGATCTTGACGGTCTTTACCTCCCCTCTGAGGCTGACGATTTTCTCATCTTGGAGAATTTAGATCAAACTACCAATTTGCAGGATTCGGCTACTGATGCTCCGCCCCAGTCTGACCCAGAAATAGCTGCACTGCTTGATGATGATGATTTATCACGATTTGGTTCTGATGTTGAGAACTTGGAGGAGGATTTCGTAGTTCAGGTAAATTTATGTGAAGAAGGGGAGGATGGTTCAATGGATAATAAGTTTAGTGTGGCTAAGGATTCTGAGAAGGCGACAGGAAGCCAGAATATTATTAATAATAAATCTTTTGGCGATGACATTTTTGAGGATGCAGATGTTGACCATGTGGAGGATGATGTTGATAAGCCACGAACTCGACAGCTTTTGGATGATCGATTTGATACCCTCTTAAGTCAGGACTATGCATCTAGTGACAACGGTGACAGTGCTTGTGATGAGCATGATGGTTGGGTAGCTGAAGAAGATGAGTCCCTTGCTCAAAAGCTCAAGCATGCCCTTGATGATCATAGTAAGGATGACTTGGACCTTGACCAAGGATATAAAGCTTCTGCAGATCAAGAGCTTTTGCAGTCTACCAGTGATGTAATTCACCGGTGTATGGAGTATGCTGAGTCTGCACAAGTTGAGATGAATTATTACTCTTTCTCATTAGAATTGAATCGCTGCGATGTGGAACTTATTAAGAAGGAAATCATTGAACATGATGTGTCAGTGTCTTATTCAGGTGGTGGCGTGCCTAAGAAGTCCATACGCAAGCCTAACTGGAAGAAATTCTTCGTAGATTATTTTGTTCGAGAGAACGTGTTAACGCAGGATGTTGAATTGCAGGAAATACATACTAATGAGCAAGTTGCAGACATATTTACTAAGGCACTTGCCAAAGTGAAGTTTGGAGTTTTTCGTAGAAGCTCTCGGAGTTATTGA

Protein sequence

MEKLVGNNYSYWKLCMEAYLQGQDLWDLIEGDDTEIPADTPQNAKLRQQWKIKCGKTLFTLRTLISKEYIDHVRDLKSPKQVWDTLQKLFTKKNTARLQFLENELAMVTQGNFSVEEYFLKVKNLCSEISELDAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSNQEALIKQMTSSNEFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVYQDDVSTGDQNSNSTTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRFNENEVVELTFDDLDGLYLPSEADDFLILENLDQTTNLQDSATDAPPQSDPEIAALLDDDDLSRFGSDVENLEEDFVVQVNLCEEGEDGSMDNKFSVAKDSEKATGSQNIINNKSFGDDIFEDADVDHVEDDVDKPRTRQLLDDRFDTLLSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQGYKASADQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNRCDVELIKKEIIEHDVSVSYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTKALAKVKFGVFRRSSRSY
Homology
BLAST of CmoCh13G002250 vs. ExPASy Swiss-Prot
Match: P03370 (Gag-Pol polyprotein OS=Maedi visna virus (strain 1514) OX=11742 GN=pol PE=1 SV=2)

HSP 1 Score: 60.8 bits (146), Expect = 6.8e-08
Identity = 21/41 (51.22%), Postives = 29/41 (70.73%), Query Frame = 0

Query: 237 CYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQESE 278
           CY CGKPGH  R C+  ++C HCGK GH++ +CR K Q+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGK 427

BLAST of CmoCh13G002250 vs. ExPASy Swiss-Prot
Match: P23426 (Gag-Pol polyprotein OS=Maedi visna virus (strain 1514 / clone LV1-1KS1) OX=11743 GN=pol PE=3 SV=2)

HSP 1 Score: 60.8 bits (146), Expect = 6.8e-08
Identity = 21/41 (51.22%), Postives = 29/41 (70.73%), Query Frame = 0

Query: 237 CYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQESE 278
           CY CGKPGH  R C+  ++C HCGK GH++ +CR K Q+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGK 427

BLAST of CmoCh13G002250 vs. ExPASy Swiss-Prot
Match: P23427 (Gag-Pol polyprotein OS=Maedi visna virus (strain 1514 / clone LV1-1KS2) OX=11744 GN=pol PE=3 SV=2)

HSP 1 Score: 60.8 bits (146), Expect = 6.8e-08
Identity = 21/41 (51.22%), Postives = 29/41 (70.73%), Query Frame = 0

Query: 237 CYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQESE 278
           CY CGKPGH  R C+  ++C HCGK GH++ +CR K Q+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGK 427

BLAST of CmoCh13G002250 vs. ExPASy Swiss-Prot
Match: P35956 (Gag-Pol polyprotein OS=Maedi visna virus (strain KV1772) OX=36374 GN=pol PE=1 SV=2)

HSP 1 Score: 60.8 bits (146), Expect = 6.8e-08
Identity = 21/41 (51.22%), Postives = 29/41 (70.73%), Query Frame = 0

Query: 237 CYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQESE 278
           CY CGKPGH  R C+  ++C HCGK GH++ +CR K Q+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGK 427

BLAST of CmoCh13G002250 vs. ExPASy Swiss-Prot
Match: P03352 (Gag polyprotein OS=Maedi visna virus (strain 1514) OX=11742 GN=gag PE=3 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 8.9e-08
Identity = 21/39 (53.85%), Postives = 28/39 (71.79%), Query Frame = 0

Query: 237 CYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQE 276
           CY CGKPGH  R C+  ++C HCGK GH++ +CR K Q+
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQ 425

BLAST of CmoCh13G002250 vs. ExPASy TrEMBL
Match: A0A6J1HJ50 (uncharacterized protein LOC111464046 OS=Cucurbita moschata OX=3662 GN=LOC111464046 PE=4 SV=1)

HSP 1 Score: 604.4 bits (1557), Expect = 6.1e-169
Identity = 320/369 (86.72%), Postives = 323/369 (87.53%), Query Frame = 0

Query: 325 STTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRFNENEVVELTFDDLDGLY 384
           STTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRF+ENEVVELTFDDLDGLY
Sbjct: 11  STTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRFDENEVVELTFDDLDGLY 70

Query: 385 LPSEADDFLILENLDQTTNLQDSATDAPPQSDPEIAALLDDDDLSRFGSDVENLEEDFVV 444
           LPSEADDFLILENLDQTTNLQDSATDAPPQSDPEIAALLDDDDLSRFGSDVE+LEEDFVV
Sbjct: 71  LPSEADDFLILENLDQTTNLQDSATDAPPQSDPEIAALLDDDDLSRFGSDVEDLEEDFVV 130

Query: 445 QVNLCEEGEDGSMDNKFSVAKDSEKATGSQNIINNKSFGDDIFEDADVDHVEDDVDKPRT 504
           Q                                           DADVDHVEDDVDKPRT
Sbjct: 131 Q-------------------------------------------DADVDHVEDDVDKPRT 190

Query: 505 RQLLDDRFDTLLSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQ 564
           RQLLDDRFD LLSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQ
Sbjct: 191 RQLLDDRFDILLSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQ 250

Query: 565 GYKASADQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNRCDVELIKKEIIEHDVSV 624
           GYKAS DQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNRC+VELIKKEIIEHDVSV
Sbjct: 251 GYKASTDQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNRCNVELIKKEIIEHDVSV 310

Query: 625 SYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTKALAKVKFG 684
           SYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTKALAKVKF 
Sbjct: 311 SYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTKALAKVKFE 336

Query: 685 VFRRSSRSY 694
           VFRRSSRSY
Sbjct: 371 VFRRSSRSY 336

BLAST of CmoCh13G002250 vs. ExPASy TrEMBL
Match: A0A5J5BCB3 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_024753 PE=4 SV=1)

HSP 1 Score: 436.0 bits (1120), Expect = 2.9e-118
Identity = 213/319 (66.77%), Postives = 253/319 (79.31%), Query Frame = 0

Query: 1   MEKLVGNNYSYWKLCMEAYLQGQDLWDLIEGDDTEIPADTPQNAKLRQQWKIKCGKTLFT 60
           ++KLVGNNYSYWKLCMEAYLQGQDLWDLI GDD  IP DTPQNA+LR++WKIKCGK LF 
Sbjct: 9   IDKLVGNNYSYWKLCMEAYLQGQDLWDLISGDDVVIPEDTPQNAELRRKWKIKCGKALFA 68

Query: 61  LRTLISKEYIDHVRDLKSPKQVWDTLQKLFTKKNTARLQFLENELAMVTQGNFSVEEYFL 120
           LRT IS+EYI HVRD KSPKQVW+TL++LFT+KNT RLQFLEN+LA +TQ N S+ EYFL
Sbjct: 69  LRTSISQEYIQHVRDGKSPKQVWETLERLFTQKNTMRLQFLENQLAGMTQDNLSISEYFL 128

Query: 121 KVKNLCSEISELDAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSN 180
           K+K LCSEISELD EEPVS+ARLRRYLIRGLRKEFMPF+SSIQGWANQP++IELENLLSN
Sbjct: 129 KIKTLCSEISELDTEEPVSDARLRRYLIRGLRKEFMPFISSIQGWANQPSIIELENLLSN 188

Query: 181 QEALIKQMTSSNEFSP-KLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYR 240
           QEAL+KQM SSN+ SP ++E  LY KD+ + N  S+ SS +  Q +++  S+   ++CYR
Sbjct: 189 QEALMKQMASSNKQSPSQVEDALYTKDKAKSNSFSKHSSGDNKQSKTEGQSRGNSRSCYR 248

Query: 241 CGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTE 300
           CGK GH KRDC+ KVVC+ CGK GHIK NCRV +  + AN  HE        WE CL+ E
Sbjct: 249 CGKLGHLKRDCRVKVVCNRCGKSGHIKQNCRVNL--TGANVAHETSEFEQLKWEQCLSIE 308

Query: 301 VLDQPTNVTSAVYQDDVST 319
            +DQP  + S V Q +V T
Sbjct: 309 AVDQPVILNSVVQQTNVET 325

BLAST of CmoCh13G002250 vs. ExPASy TrEMBL
Match: A0A6J1GDV3 (uncharacterized protein LOC111453101 OS=Cucurbita moschata OX=3662 GN=LOC111453101 PE=4 SV=1)

HSP 1 Score: 423.3 bits (1087), Expect = 1.9e-114
Identity = 209/212 (98.58%), Postives = 211/212 (99.53%), Query Frame = 0

Query: 133 DAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSNQEALIKQMTSSN 192
           DAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSNQEALI+QMT+SN
Sbjct: 5   DAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSNQEALIEQMTTSN 64

Query: 193 EFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGKPGHFKRDCQA 252
           EFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGK GHFKRDCQA
Sbjct: 65  EFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGKSGHFKRDCQA 124

Query: 253 KVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVY 312
           KVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVY
Sbjct: 125 KVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVY 184

Query: 313 QDDVSTGDQNSNSTTYASEFDSLQISHLDSLF 345
           QDDVSTGDQNSNSTTYASEFDSLQISHLDSLF
Sbjct: 185 QDDVSTGDQNSNSTTYASEFDSLQISHLDSLF 216

BLAST of CmoCh13G002250 vs. ExPASy TrEMBL
Match: A0A443N8T5 (Integrase, catalytic core OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKAN_00329300 PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 3.0e-107
Identity = 205/323 (63.47%), Postives = 245/323 (75.85%), Query Frame = 0

Query: 1   MEKLVGNNYSYWKLCMEAYLQGQDLWDLIEGDDTEIPADTPQNAKLRQQWKIKCGKTLFT 60
           ++KLVGNNY+YWKLCMEAYLQGQDLWDLI GD+  IP DT QNA L ++WKIKCGK LF 
Sbjct: 9   IDKLVGNNYNYWKLCMEAYLQGQDLWDLISGDNAVIPEDTSQNADLWRKWKIKCGKALFA 68

Query: 61  LRTLISKEYIDHVRDLKSPKQVWDTLQKLFTKKNTARLQFLENELAMVTQGNFSVEEYFL 120
           LRT IS++YI  VRD+ SPKQVW+ L++LFT+KNT RLQ+LENELA +TQG  S+ EYFL
Sbjct: 69  LRTSISQDYIARVRDVSSPKQVWEILERLFTQKNTMRLQYLENELAGMTQGTLSIPEYFL 128

Query: 121 KVKNLCSEISELDAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSN 180
           KVK LC+EISELD EEPVS+ARL RYLIRGLRKEFMPF+SSIQGWA QP++IELENLLSN
Sbjct: 129 KVKTLCAEISELDTEEPVSDARLHRYLIRGLRKEFMPFISSIQGWATQPSIIELENLLSN 188

Query: 181 QEALIKQMTSSNEFSPKL-EGVLYVKDQRRQNFHSRPS-----SSNENQFRSDESSKKPF 240
           QEAL+KQMTS+++ S  L E  LY KDQ  +NF  + S     S+NE +FR +       
Sbjct: 189 QEALVKQMTSNDKKSLSLVEDALYTKDQGNKNFFKQGSDDTEQSNNEGKFRGNS------ 248

Query: 241 KACYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEY 300
           K C+RCG+ GH KRDC A+VVC+ CGK GHIK NCRVK+ E+ AN   E   S    WE+
Sbjct: 249 KGCFRCGQLGHIKRDCHARVVCNRCGKSGHIKANCRVKLMEAGANVAQEKDESEQSTWEH 308

Query: 301 CLTTEVLDQPTNVTSAVYQDDVS 318
            L+    +Q T VTSA  Q DV+
Sbjct: 309 GLSI-TANQSTIVTSA--QTDVN 322

BLAST of CmoCh13G002250 vs. ExPASy TrEMBL
Match: A0A443N8T5 (Integrase, catalytic core OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKAN_00329300 PE=4 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 1.2e-04
Identity = 28/40 (70.00%), Postives = 33/40 (82.50%), Query Frame = 0

Query: 648  YFVRENVLTQDVELQEIHTNEQVADIFTKALAKVKFGVFR 688
            +F RE VLTQD++LQ+I T+ QVADIFTKAL K KF VFR
Sbjct: 1237 HFAREKVLTQDIQLQKIRTDVQVADIFTKALGKAKFEVFR 1276


HSP 2 Score: 394.0 bits (1011), Expect = 1.3e-105
Identity = 201/319 (63.01%), Postives = 240/319 (75.24%), Query Frame = 0

Query: 1   MEKLVGNNYSYWKLCMEAYLQGQDLWDLIEGDDTEIPADTPQNAKLRQQWKIKCGKTLFT 60
           ++KLVGNNYSY KLCMEAYLQGQ+LWDLI GDD  I  DTPQN +LR++WKIK GK LF 
Sbjct: 9   VDKLVGNNYSYRKLCMEAYLQGQNLWDLISGDDVVILEDTPQNVELRRKWKIKYGKALFA 68

Query: 61  LRTLISKEYIDHVRDLKSPKQVWDTLQKLFTKKNTARLQFLENELAMVTQGNFSVEEYFL 120
           LRT IS+EYI HVRD KSPKQVW TL++LFT+KNT RLQFL+NELA +TQ N S+ EYFL
Sbjct: 69  LRTSISQEYIQHVRDGKSPKQVWKTLERLFTQKNTMRLQFLKNELAGMTQDNLSILEYFL 128

Query: 121 KVKNLCSEISELDAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSN 180
           K+K LCSEISELD EEPVS+ARL RYLI GLRKEFMPF+SSIQGWANQP +IELENLLSN
Sbjct: 129 KIKTLCSEISELDTEEPVSDARLHRYLILGLRKEFMPFISSIQGWANQPFIIELENLLSN 188

Query: 181 QEALIKQMTSSNEFSP-KLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYR 240
           QEAL+KQ+ S+N+ SP ++E  LY KD+ + N  S+ SS++  Q ++   S+   K+ YR
Sbjct: 189 QEALMKQIASNNKQSPSQVEDALYTKDKAKSNSFSKHSSADSKQSKTKGQSRGNSKSYYR 248

Query: 241 CGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTE 300
           CGK GH KRDC  KVVC+ C K  HIK NCRV +    AN  H+        WE CL+ E
Sbjct: 249 CGKLGHLKRDCHVKVVCNRCEKSVHIKQNCRVNL--IGANVAHKTSKFEQLKWEQCLSIE 308

Query: 301 VLDQPTNVTSAVYQDDVST 319
            +DQP  + S V Q +V T
Sbjct: 309 AVDQPDILNSVVQQTNVET 325

BLAST of CmoCh13G002250 vs. NCBI nr
Match: XP_023542088.1 (uncharacterized protein LOC111802075 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1052.7 bits (2721), Expect = 1.3e-303
Identity = 533/538 (99.07%), Postives = 535/538 (99.44%), Query Frame = 0

Query: 156 MPFVSSIQGWANQPTVIELENLLSNQEALIKQMTSSNEFSPKLEGVLYVKDQRRQNFHSR 215
           MPFVSSIQGWANQPTVIELENLLSNQEALIKQMTSSNEFSPKLEGVLYVKDQRRQNFHSR
Sbjct: 1   MPFVSSIQGWANQPTVIELENLLSNQEALIKQMTSSNEFSPKLEGVLYVKDQRRQNFHSR 60

Query: 216 PSSSNENQFRSDESSKKPFKACYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQE 275
           PSSSNENQFRSDESSKKPFKACYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQE
Sbjct: 61  PSSSNENQFRSDESSKKPFKACYRCGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQE 120

Query: 276 SEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVYQDDVSTGDQNSNSTTYASEFDSL 335
           SEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVYQDDVSTGDQNSNSTTYASEFDSL
Sbjct: 121 SEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVYQDDVSTGDQNSNSTTYASEFDSL 180

Query: 336 QISHLDSLFFSDFNSIAPDDPSVYSTALDLRFNENEVVELTFDDLDGLYLPSEADDFLIL 395
           QISHLDSLFFSDFNSIAPDDPSVYSTALDLRF+ENEVVELTFDDLDGLYLPSEADDFLIL
Sbjct: 181 QISHLDSLFFSDFNSIAPDDPSVYSTALDLRFDENEVVELTFDDLDGLYLPSEADDFLIL 240

Query: 396 ENLDQTTNLQDSATDAPPQSDPEIAALLDDDDLSRFGSDVENLEEDFVVQVNLCEEGEDG 455
           ENLDQTTN QDSATDAPPQSDPEIAALLDDDDLSRFGSDVE+LEEDFVVQVNLCEEGEDG
Sbjct: 241 ENLDQTTNSQDSATDAPPQSDPEIAALLDDDDLSRFGSDVEDLEEDFVVQVNLCEEGEDG 300

Query: 456 SMDNKFSVAKDSEKATGSQNIINNKSFGDDIFEDADVDHVEDDVDKPRTRQLLDDRFDTL 515
           SMDNKFSVAKDSEKATGSQNIINNKSFGDDIFEDADVDHVEDDVDKPRTRQLLDDRFDTL
Sbjct: 301 SMDNKFSVAKDSEKATGSQNIINNKSFGDDIFEDADVDHVEDDVDKPRTRQLLDDRFDTL 360

Query: 516 LSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQGYKASADQELL 575
           LSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQGYKASADQELL
Sbjct: 361 LSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQGYKASADQELL 420

Query: 576 QSTSDVIHRCMEYAESAQVEMNYYSFSLELNRCDVELIKKEIIEHDVSVSYSGGGVPKKS 635
           QSTSDVIHRCMEYAESAQVEMNYYSFSLELNRCDVELIKKEIIEHDVSVSYSGGGVPKKS
Sbjct: 421 QSTSDVIHRCMEYAESAQVEMNYYSFSLELNRCDVELIKKEIIEHDVSVSYSGGGVPKKS 480

Query: 636 IRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTKALAKVKFGVFRRSSRSY 694
           IRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTK LAKVKF VFRRSSRSY
Sbjct: 481 IRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTKVLAKVKFEVFRRSSRSY 538

BLAST of CmoCh13G002250 vs. NCBI nr
Match: XP_023542089.1 (uncharacterized protein LOC111802075 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 990.7 bits (2560), Expect = 6.3e-285
Identity = 501/506 (99.01%), Postives = 503/506 (99.41%), Query Frame = 0

Query: 188 MTSSNEFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGKPGHFK 247
           MTSSNEFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGKPGHFK
Sbjct: 1   MTSSNEFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGKPGHFK 60

Query: 248 RDCQAKVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNV 307
           RDCQAKVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNV
Sbjct: 61  RDCQAKVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNV 120

Query: 308 TSAVYQDDVSTGDQNSNSTTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRF 367
           TSAVYQDDVSTGDQNSNSTTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRF
Sbjct: 121 TSAVYQDDVSTGDQNSNSTTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRF 180

Query: 368 NENEVVELTFDDLDGLYLPSEADDFLILENLDQTTNLQDSATDAPPQSDPEIAALLDDDD 427
           +ENEVVELTFDDLDGLYLPSEADDFLILENLDQTTN QDSATDAPPQSDPEIAALLDDDD
Sbjct: 181 DENEVVELTFDDLDGLYLPSEADDFLILENLDQTTNSQDSATDAPPQSDPEIAALLDDDD 240

Query: 428 LSRFGSDVENLEEDFVVQVNLCEEGEDGSMDNKFSVAKDSEKATGSQNIINNKSFGDDIF 487
           LSRFGSDVE+LEEDFVVQVNLCEEGEDGSMDNKFSVAKDSEKATGSQNIINNKSFGDDIF
Sbjct: 241 LSRFGSDVEDLEEDFVVQVNLCEEGEDGSMDNKFSVAKDSEKATGSQNIINNKSFGDDIF 300

Query: 488 EDADVDHVEDDVDKPRTRQLLDDRFDTLLSQDYASSDNGDSACDEHDGWVAEEDESLAQK 547
           EDADVDHVEDDVDKPRTRQLLDDRFDTLLSQDYASSDNGDSACDEHDGWVAEEDESLAQK
Sbjct: 301 EDADVDHVEDDVDKPRTRQLLDDRFDTLLSQDYASSDNGDSACDEHDGWVAEEDESLAQK 360

Query: 548 LKHALDDHSKDDLDLDQGYKASADQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNR 607
           LKHALDDHSKDDLDLDQGYKASADQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNR
Sbjct: 361 LKHALDDHSKDDLDLDQGYKASADQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNR 420

Query: 608 CDVELIKKEIIEHDVSVSYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTN 667
           CDVELIKKEIIEHDVSVSYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTN
Sbjct: 421 CDVELIKKEIIEHDVSVSYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTN 480

Query: 668 EQVADIFTKALAKVKFGVFRRSSRSY 694
           EQVADIFTK LAKVKF VFRRSSRSY
Sbjct: 481 EQVADIFTKVLAKVKFEVFRRSSRSY 506

BLAST of CmoCh13G002250 vs. NCBI nr
Match: XP_022963858.1 (uncharacterized protein LOC111464046 [Cucurbita moschata])

HSP 1 Score: 604.4 bits (1557), Expect = 1.3e-168
Identity = 320/369 (86.72%), Postives = 323/369 (87.53%), Query Frame = 0

Query: 325 STTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRFNENEVVELTFDDLDGLY 384
           STTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRF+ENEVVELTFDDLDGLY
Sbjct: 11  STTYASEFDSLQISHLDSLFFSDFNSIAPDDPSVYSTALDLRFDENEVVELTFDDLDGLY 70

Query: 385 LPSEADDFLILENLDQTTNLQDSATDAPPQSDPEIAALLDDDDLSRFGSDVENLEEDFVV 444
           LPSEADDFLILENLDQTTNLQDSATDAPPQSDPEIAALLDDDDLSRFGSDVE+LEEDFVV
Sbjct: 71  LPSEADDFLILENLDQTTNLQDSATDAPPQSDPEIAALLDDDDLSRFGSDVEDLEEDFVV 130

Query: 445 QVNLCEEGEDGSMDNKFSVAKDSEKATGSQNIINNKSFGDDIFEDADVDHVEDDVDKPRT 504
           Q                                           DADVDHVEDDVDKPRT
Sbjct: 131 Q-------------------------------------------DADVDHVEDDVDKPRT 190

Query: 505 RQLLDDRFDTLLSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQ 564
           RQLLDDRFD LLSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQ
Sbjct: 191 RQLLDDRFDILLSQDYASSDNGDSACDEHDGWVAEEDESLAQKLKHALDDHSKDDLDLDQ 250

Query: 565 GYKASADQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNRCDVELIKKEIIEHDVSV 624
           GYKAS DQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNRC+VELIKKEIIEHDVSV
Sbjct: 251 GYKASTDQELLQSTSDVIHRCMEYAESAQVEMNYYSFSLELNRCNVELIKKEIIEHDVSV 310

Query: 625 SYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTKALAKVKFG 684
           SYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTKALAKVKF 
Sbjct: 311 SYSGGGVPKKSIRKPNWKKFFVDYFVRENVLTQDVELQEIHTNEQVADIFTKALAKVKFE 336

Query: 685 VFRRSSRSY 694
           VFRRSSRSY
Sbjct: 371 VFRRSSRSY 336

BLAST of CmoCh13G002250 vs. NCBI nr
Match: KAA8540328.1 (hypothetical protein F0562_024753 [Nyssa sinensis])

HSP 1 Score: 436.0 bits (1120), Expect = 5.9e-118
Identity = 213/319 (66.77%), Postives = 253/319 (79.31%), Query Frame = 0

Query: 1   MEKLVGNNYSYWKLCMEAYLQGQDLWDLIEGDDTEIPADTPQNAKLRQQWKIKCGKTLFT 60
           ++KLVGNNYSYWKLCMEAYLQGQDLWDLI GDD  IP DTPQNA+LR++WKIKCGK LF 
Sbjct: 9   IDKLVGNNYSYWKLCMEAYLQGQDLWDLISGDDVVIPEDTPQNAELRRKWKIKCGKALFA 68

Query: 61  LRTLISKEYIDHVRDLKSPKQVWDTLQKLFTKKNTARLQFLENELAMVTQGNFSVEEYFL 120
           LRT IS+EYI HVRD KSPKQVW+TL++LFT+KNT RLQFLEN+LA +TQ N S+ EYFL
Sbjct: 69  LRTSISQEYIQHVRDGKSPKQVWETLERLFTQKNTMRLQFLENQLAGMTQDNLSISEYFL 128

Query: 121 KVKNLCSEISELDAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSN 180
           K+K LCSEISELD EEPVS+ARLRRYLIRGLRKEFMPF+SSIQGWANQP++IELENLLSN
Sbjct: 129 KIKTLCSEISELDTEEPVSDARLRRYLIRGLRKEFMPFISSIQGWANQPSIIELENLLSN 188

Query: 181 QEALIKQMTSSNEFSP-KLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYR 240
           QEAL+KQM SSN+ SP ++E  LY KD+ + N  S+ SS +  Q +++  S+   ++CYR
Sbjct: 189 QEALMKQMASSNKQSPSQVEDALYTKDKAKSNSFSKHSSGDNKQSKTEGQSRGNSRSCYR 248

Query: 241 CGKPGHFKRDCQAKVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTE 300
           CGK GH KRDC+ KVVC+ CGK GHIK NCRV +  + AN  HE        WE CL+ E
Sbjct: 249 CGKLGHLKRDCRVKVVCNRCGKSGHIKQNCRVNL--TGANVAHETSEFEQLKWEQCLSIE 308

Query: 301 VLDQPTNVTSAVYQDDVST 319
            +DQP  + S V Q +V T
Sbjct: 309 AVDQPVILNSVVQQTNVET 325

BLAST of CmoCh13G002250 vs. NCBI nr
Match: XP_022949820.1 (uncharacterized protein LOC111453101 [Cucurbita moschata] >XP_022949832.1 uncharacterized protein LOC111453107 [Cucurbita moschata])

HSP 1 Score: 423.3 bits (1087), Expect = 4.0e-114
Identity = 209/212 (98.58%), Postives = 211/212 (99.53%), Query Frame = 0

Query: 133 DAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSNQEALIKQMTSSN 192
           DAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSNQEALI+QMT+SN
Sbjct: 5   DAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSNQEALIEQMTTSN 64

Query: 193 EFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGKPGHFKRDCQA 252
           EFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGK GHFKRDCQA
Sbjct: 65  EFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSKKPFKACYRCGKSGHFKRDCQA 124

Query: 253 KVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVY 312
           KVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVY
Sbjct: 125 KVVCDHCGKPGHIKPNCRVKMQESEANAVHENKSSPDPIWEYCLTTEVLDQPTNVTSAVY 184

Query: 313 QDDVSTGDQNSNSTTYASEFDSLQISHLDSLF 345
           QDDVSTGDQNSNSTTYASEFDSLQISHLDSLF
Sbjct: 185 QDDVSTGDQNSNSTTYASEFDSLQISHLDSLF 216

BLAST of CmoCh13G002250 vs. TAIR 10
Match: AT3G49990.1 (unknown protein; Has 1524 Blast hits to 1298 proteins in 225 species: Archae - 9; Bacteria - 84; Metazoa - 474; Fungi - 184; Plants - 98; Viruses - 17; Other Eukaryotes - 658 (source: NCBI BLink). )

HSP 1 Score: 82.4 bits (202), Expect = 1.6e-15
Identity = 62/189 (32.80%), Postives = 98/189 (51.85%), Query Frame = 0

Query: 416 DPEIAALLDDDDLSRFGSDVENLEEDFVVQVNLCEEGEDGSMDN---KFSVAKDSEKATG 475
           DPE+AALL++ D S FGSDVE+LEEDFVVQ NL ++GE   + N   +FSV ++  +   
Sbjct: 177 DPEVAALLENSDGSEFGSDVEDLEEDFVVQANLTQKGESSGVSNGELEFSVRREVRERES 236

Query: 476 SQNIINNKSFGDDIFEDADVDHVEDDVDKPRTRQLLDDRFDTLLSQDYASSDNGDSACDE 535
            + +  N                      PR  + +D+ FD L   +Y S  +GD    E
Sbjct: 237 DEPVAEN----------------------PRVPRQIDELFDQLELNEYGSDSDGDGYIAE 296

Query: 536 HDGWVAEEDESLAQKLKHALDDHSKDDLDLDQGYKASA----------DQELLQSTSDVI 592
            DG   EE++ +AQ++++ +   +K D +L++ Y   A          D+E + + + VI
Sbjct: 297 -DGEEEEEEDFMAQEVQNLIHGKAK-DYELEEKYMNPADILKNSDSVRDKEEVDTAAHVI 341

BLAST of CmoCh13G002250 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 65.5 bits (158), Expect = 2.0e-10
Identity = 52/228 (22.81%), Postives = 102/228 (44.74%), Query Frame = 0

Query: 8   NYSYWKLCMEAYLQGQDLWDLIEGDDTEIPADTPQNAKLRQQWKIKCGKTLFTLRTLISK 67
           NY  W+   E       +   I+G  T  P          ++WK + G     +   I+ 
Sbjct: 32  NYDVWRELFETLCLSFGVLGHIDGSSTPTP-------MTEKRWKERDGLVKMWIYGTITD 91

Query: 68  EYIDHVRDLK-SPKQVWDTLQKLFTKKNTARLQFLENELAMVTQGNFSVEEYFLKVKNLC 127
             +D +  +  + + +W +L+ LF     AR    ENEL   T  + SV EY  K+K+L 
Sbjct: 92  SLLDTIIKVGCTARDLWLSLENLFRDNKEARALQFENELRTTTIDDLSVHEYCQKLKSLS 151

Query: 128 SEISELDAEEPVSEARLRRYLIRGLRKEFMPFVSSIQGWANQPTVIELENLLSNQEALIK 187
             ++ +D+  P+S+  L  +L+ GL +++   ++ I+  +  P+  E  ++L  +E+ + 
Sbjct: 152 DLLTNVDS--PISDRVLVMHLLNGLTEKYDYILNVIKHKSPFPSFTEARSMLLMEESRLS 211

Query: 188 QMTS---SNEFSPKLEGVLYVKDQRRQNFHSRPSSSNENQFRSDESSK 232
             +    S+   P L  VL+   ++++ +     ++N N  R     K
Sbjct: 212 NKSKSSLSHTNHPSLSNVLFTVPRQQERYPQEYHNNNSNMGRGRSKKK 250

BLAST of CmoCh13G002250 vs. TAIR 10
Match: AT1G05150.1 (Calcium-binding tetratricopeptide family protein )

HSP 1 Score: 53.5 bits (127), Expect = 7.7e-07
Identity = 23/36 (63.89%), Postives = 29/36 (80.56%), Query Frame = 0

Query: 605 LNRCDVELIKKEIIEHDVSVSYSGGGVPKKSIRKPN 641
           L +CDVE ++KE+ ++DV VSYSG G P KSIRKPN
Sbjct: 545 LGKCDVEAVRKEMRDNDVPVSYSGSGGPTKSIRKPN 580

BLAST of CmoCh13G002250 vs. TAIR 10
Match: AT1G21280.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 52.4 bits (124), Expect = 1.7e-06
Identity = 32/125 (25.60%), Postives = 63/125 (50.40%), Query Frame = 0

Query: 7   NNYSYWKLCMEAYLQGQDLWDLIEGDDTEIPADTPQNAKLRQQWKIKCGKTLFTLRTLIS 66
           +NY  WK+   ++L+    +  I+G    +P   P  + L Q W+      ++ L   ++
Sbjct: 40  DNYVAWKIRFRSFLRVTKKFGFIDG---TLPKPDP-FSPLYQPWEQCNAMVMYWLMNSMT 99

Query: 67  KEYIDHVRDLKSPKQVWDTLQKLFTKKNTARLQFLENELAMVTQGNFSVEEYFLKVKNLC 126
            + ++ V   ++  ++W+ L+++F      ++  L   LA + QG  SVEEYF K+  + 
Sbjct: 100 DKLLESVMYAETAHKMWEDLRRVFVPCVDLKIYQLRRRLATLRQGGDSVEEYFGKLSKVW 159

Query: 127 SEISE 132
            E+SE
Sbjct: 160 MELSE 160

BLAST of CmoCh13G002250 vs. TAIR 10
Match: AT1G75560.1 (zinc knuckle (CCHC-type) family protein )

HSP 1 Score: 47.8 bits (112), Expect = 4.2e-05
Identity = 24/64 (37.50%), Postives = 32/64 (50.00%), Query Frame = 0

Query: 209 RQNFHSRPSSSNENQFRSDESSKKPF---KACYRCGKPGHFKRDCQAKVVCDHCGKPGHI 268
           R ++H  PS       R +   ++ F     C  C +PGHF RDC    VC++CG PGHI
Sbjct: 33  RVSYHDAPS-------RREREPRRAFSQGNLCNNCKRPGHFARDCSNVSVCNNCGLPGHI 89

Query: 269 KPNC 270
              C
Sbjct: 93  AAEC 89

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P033706.8e-0851.22Gag-Pol polyprotein OS=Maedi visna virus (strain 1514) OX=11742 GN=pol PE=1 SV=2[more]
P234266.8e-0851.22Gag-Pol polyprotein OS=Maedi visna virus (strain 1514 / clone LV1-1KS1) OX=11743... [more]
P234276.8e-0851.22Gag-Pol polyprotein OS=Maedi visna virus (strain 1514 / clone LV1-1KS2) OX=11744... [more]
P359566.8e-0851.22Gag-Pol polyprotein OS=Maedi visna virus (strain KV1772) OX=36374 GN=pol PE=1 SV... [more]
P033528.9e-0853.85Gag polyprotein OS=Maedi visna virus (strain 1514) OX=11742 GN=gag PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1HJ506.1e-16986.72uncharacterized protein LOC111464046 OS=Cucurbita moschata OX=3662 GN=LOC1114640... [more]
A0A5J5BCB32.9e-11866.77Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_024753 PE=4 SV=1[more]
A0A6J1GDV31.9e-11498.58uncharacterized protein LOC111453101 OS=Cucurbita moschata OX=3662 GN=LOC1114531... [more]
A0A443N8T53.0e-10763.47Integrase, catalytic core OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKA... [more]
A0A443N8T51.2e-0470.00Integrase, catalytic core OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKA... [more]
Match NameE-valueIdentityDescription
XP_023542088.11.3e-30399.07uncharacterized protein LOC111802075 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023542089.16.3e-28599.01uncharacterized protein LOC111802075 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022963858.11.3e-16886.72uncharacterized protein LOC111464046 [Cucurbita moschata][more]
KAA8540328.15.9e-11866.77hypothetical protein F0562_024753 [Nyssa sinensis][more]
XP_022949820.14.0e-11498.58uncharacterized protein LOC111453101 [Cucurbita moschata] >XP_022949832.1 unchar... [more]
Match NameE-valueIdentityDescription
AT3G49990.11.6e-1532.80unknown protein; Has 1524 Blast hits to 1298 proteins in 225 species: Archae - 9... [more]
AT5G48050.12.0e-1022.81CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
AT1G05150.17.7e-0763.89Calcium-binding tetratricopeptide family protein [more]
AT1G21280.11.7e-0625.60CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Ha... [more]
AT1G75560.14.2e-0537.50zinc knuckle (CCHC-type) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 171..191
NoneNo IPR availableGENE3D4.10.60.10coord: 210..280
e-value: 1.3E-10
score: 42.9
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 50..181
e-value: 1.8E-13
score: 50.5
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 58..252
NoneNo IPR availablePANTHERPTHR34222:SF8SUBFAMILY NOT NAMEDcoord: 58..252
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 255..271
e-value: 0.016
score: 22.3
coord: 236..252
e-value: 6.8E-5
score: 32.3
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 236..251
e-value: 1.9E-6
score: 27.7
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 256..270
score: 9.026372
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 237..250
score: 11.119688
IPR025314Domain of unknown function DUF4219PFAMPF13961DUF4219coord: 6..30
e-value: 2.3E-9
score: 36.7
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 228..272

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh13G002250.1CmoCh13G002250.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044260 cellular macromolecule metabolic process
biological_process GO:0019538 protein metabolic process
biological_process GO:0042274 ribosomal small subunit biogenesis
molecular_function GO:0140096 catalytic activity, acting on a protein
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding