Cp4.1LG01g10530.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG01g10530.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionRNA binding protein, putative
LocationCp4.1LG01 : 6538968 .. 6542897 (-)
Sequence length1732
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTGCTTCCCTCAACATGATGAATTGGCCAATGCATCACTTCTCTTTCTTTTTTACAACTTCCATTGCCTTCCTCTAAACTTCTATTCCATCTTCTTCCATTCATCATTGCATGCTTTGGAATCAAAACCAGAGGAAATACACACATACCCATCAATACCCATCATGTCTTTTTAACCCAAATACTTGAACCCCTCCTTTCCTTTTGAGTTATTGTTTCATAAATTTCGAATTTGAGCGTTTTTCTTTATGTGAATTCTATCGTATTCTTCGATTCTCTTCGTGGGTTCTTGTGGGTGTTTCTTCTTTTGCCTCATCTTTGCTTCTTTTTCGAGTTTCTAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGCAGAAGAAGGCACAGAAAAAGGTTGTCCAATATGTCTTCGCTGCTGCTTCACTGTCGTTTCAGCCACAGCCCTTGGATTGGTGGGATGAGTTCTCACAGAGGATTACTGGTAAACCACTTTTTGCTTCTCTTCTACTTGAATTTGTGCATTGTTATTGCTCTTAGTTGCTTTCTGTTTATTGGGTCTCTCCGTTGAATTCTTTTGCTGTTTGGTGAGCTTTTTGGACGGTGGTGGAATGTAGTGTTTAATAATCATATATCTTTGATGCCTGCAATTATAGAACCATGATATGTTCCTAATAACATTGAGAGGGTTAGAAGGGGCAGCCTGTTGAAGTGATCCTATGGGCATGGAGATGGATCAACCCTGTTTAGCAGCAGATATTAGCATAGATATGTTAGTTCATTTGATAATTGATGTGTACAAGTTGTGGCTACTCACTTAGCTGGTTTATAGAGTGTTAACTGTGTTGTGTTCTGATGTTCTTCTTCTTTTTCTCATTGAATTGTTGAAATATTAATTATTTCTTAGCTGCTCCTACTTTTTTGAGCTTTGGTCACTGTAACGGCCCCAATCCCACCGCTAGTAGATATTGTCCTCTTTCGGCTTTCCCTCAAAGTTTTTAAAACGTGTCTACTAGGGAGAGGTTTCCACACCCTTATAAAGAATGCTTCGTTCCCCTCTCTAACCGATGTGGGATCTCACAATCCACCCCCTCTGGGGCCCAGTGTCCTTGCGGGCACTCGTTCCCCTCTCCAATCGATGTGCGATCTCACGGCAATAAACATCTTTAGAGAATTGAGTCTAATGTGATCCAAAATGCTCGTTTTATTCTTTATTGATTCGTATTAAAGATATCACTCTGAATATATAGTACCTCGAAGTTAGAATCAGTCATCTGTTGACATTTGTAAGAAGGCAGTGTCTGAGAAATCGAAGATGAAGAAGAAGGGCGGGGGATGAACAATATTCAACTTCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCATAGAAATGGAAGGATGACTATAAATATCTTTATCTGGTTGTTTTAGAGGCATCTTGGAAACGGGTATGTTTAAGTTTCGATTCGAGTCAATGTAGTAATGCTACAATGCATAGAAGTTCATAGAAAATGGATCTCTTCTAGGTATTATAAGATTTGTTTCAACCTACTGATATGGCTAAGTGCTCAATATTTGATTTTGGTACTCCAGAGGACCTAACAACATGATCAAAGTGGGTAGGTTTTGTTAGATGTTAGTAAAGCAAGAAGACTAGATGAGCTTTTAACGCATGCTGGAGTCTAGTTTCACTAGATATCAGTTCTTATATTTTGATATTCCATGGTTCTGAAATGTTGGGTACACATTTAGCAGTTACTTTGTTCTGGCCATGATTTATGATGCAACAAGTAAATCTTTACTTTTAGCTTAAAACCAACGGATTTGACTAAATATGTGGACCTATGTGTAATAGCCCAAACTCACTACTAGTAGATACTGTCTTCTTTGGGATACCCTCTAGGTTTTAAAACGCGTATGTTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTTCGTTTCCCTCTCCAATCAATGTGGGGTCTCACAATCCACTTCCCTTGGGGCCCAACATCTCGTTGGCACACCGTCCGGTGCATGACTCTAATACCATTTGTACCAGCTCAAGCCCACCACTAGTAGATATTATCTTCTTTGGGCTTTCCCTTTCGAGCTTCTCCTCAAGGTTTTAAAACGTATCTATTAGGGAGAGGTTTTTACACTCTTATAATGCTTCGTTCCCCTCTCTAACCAATGTGGAATCTCACAATTCACTCCTCTTGGGCGCCCAGTGTCCTCGTTCGCACACCGCTCGGTGCCTGACTCTGATACCATTTGTAATAGTCTAAGCTCACCGCTAGCAAATATTGTCTTCTTTGGACTTTTTCTTCCAGACTTCCTCTCAAAGTTTTAAAACGCGTATGTTCAGGAGAGATTTCACACCCTTGTAAAGAATGTTTTGTTCCCATCTCCAACCAATGTAGGATCTCATACAATGTGTTTGTCGAATTTTATCGTTGCATCGAGTATAAGTACAGCATTTCTTCTGGCGTTATGCTCACAAAATTTTACTTTATCTTTGTTCGGTGATCAATGAGTTGGATATATTGATCTGATATTGTCAGTTGTTCTTTCTGTCTGTCATATCTGAAAAAGTACAATGCCTTTTCTTAGTAGCAGAACATTTGCAGCAGGGACTGATTATGAGTTTTATCCTTGTTTTTGCCTCCAATCAGGACCATTATCTCAGTCCAAGAACACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAAACATTCAGCTATATCTGTTCTCTTGTTAAGGAAGCTATGATGGCTAAAACCTCAAATTTTACCGATTTAAACGGCAAGCCTTTGTCTCTAAACGACCAAGTCGCCGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCAAATATTGGTGACTCATTTGGAATGAATCAATCATCAGTTTCTCAAATAACATGGCGTTTTGTGGAGGCAATGGAAGAGAAAGGCATCCGCCATCTCTCGTGGCCTTCGACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTCCAGAAAATCAGAGGTCTACCTAATTGTTGCGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGACAGAATCTGCAAATGGCGTCTGGCTTGATCGCGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGATCCAGAAATGAGATTCTGTGACATCATAACAGGTTGGCCAGGAAGCTTAAGCGATGCACTCGTGCTTGAAAGCTCAGGTTTTTTCAAACGTTCACAAGATGGCGAGCGGTTGAACGGAAAGAAGATGAAACTCTCAGAAAATTCAGAACTAGGAGAGTATATTATAGGAGACTCTGGCTTTCCACTGTTGCCATGGCTACTAACTCCTTACCAAGGGAAAGGCCTTGCAGATTACCAGACCGAGTTCAATAAGCGCCATTTCAGTACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGATAATGTGGAAGCCTGATAAACATAAGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAATGATCGATATGGAGGACGAGATGCAAGACGAAATGCCGTTGTCTCATCATCACGATCCGAGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGGTTCGATCACGAGGGAGAAACTTTCCATGTACTTATCTGAAAAGTTAACACCTTAAGAGAATCTCGTCGTGCACGAAAACATTCTTTTTTTTCTTTTCTTTTTGATAGATTGATCTGTGCTGCTGTTTATTCAAATTGCCCAAATTGTTGTTCACTCATTTAGATGCTATTGATAGTTCATTATAGATATCTCTGGTGATTATATGGTCTGCTAAATCTAAACAGATAGATACCCAGTTTCCTTCTCTACAAACATTATGCT

mRNA sequence

TGTTGCTTCCCTCAACATGATGAATTGGCCAATGCATCACTTCTCTTTCTTTTTTACAACTTCCATTGCCTTCCTCTAAACTTCTATTCCATCTTCTTCCATTCATCATTGCATGCTTTGGAATCAAAACCAGAGGAAATACACACATACCCATCAATACCCATCATGTCTTTTTAACCCAAATACTTGAACCCCTCCTTTCCTTTTGAGTTATTGTTTCATAAATTTCGAATTTGAGCGTTTTTCTTTATGTGAATTCTATCGTATTCTTCGATTCTCTTCGTGGGTTCTTGTGGGTGTTTCTTCTTTTGCCTCATCTTTGCTTCTTTTTCGAGTTTCTAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGCAGAAGAAGGCACAGAAAAAGGTTGTCCAATATGTCTTCGCTGCTGCTTCACTGTCGTTTCAGCCACAGCCCTTGGATTGGTGGGATGAGTTCTCACAGAGGATTACTGGACCATTATCTCAGTCCAAGAACACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAAACATTCAGCTATATCTGTTCTCTTGTTAAGGAAGCTATGATGGCTAAAACCTCAAATTTTACCGATTTAAACGGCAAGCCTTTGTCTCTAAACGACCAAGTCGCCGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCAAATATTGGTGACTCATTTGGAATGAATCAATCATCAGTTTCTCAAATAACATGGCGTTTTGTGGAGGCAATGGAAGAGAAAGGCATCCGCCATCTCTCGTGGCCTTCGACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTCCAGAAAATCAGAGGTCTACCTAATTGTTGCGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGACAGAATCTGCAAATGGCGTCTGGCTTGATCGCGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGATCCAGAAATGAGATTCTGTGACATCATAACAGGTTGGCCAGGAAGCTTAAGCGATGCACTCGTGCTTGAAAGCTCAGGTTTTTTCAAACGTTCACAAGATGGCGAGCGGTTGAACGGAAAGAAGATGAAACTCTCAGAAAATTCAGAACTAGGAGAGTATATTATAGGAGACTCTGGCTTTCCACTGTTGCCATGGCTACTAACTCCTTACCAAGGGAAAGGCCTTGCAGATTACCAGACCGAGTTCAATAAGCGCCATTTCAGTACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGATAATGTGGAAGCCTGATAAACATAAGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAATGATCGATATGGAGGACGAGATGCAAGACGAAATGCCGTTGTCTCATCATCACGATCCGAGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGGTTCGATCACGAGGGAGAAACTTTCCATGTACTTATCTGAAAAGTTAACACCTTAAGAGAATCTCGTCGTGCACGAAAACATTCTTTTTTTTCTTTTCTTTTTGATAGATTGATCTGTGCTGCTGTTTATTCAAATTGCCCAAATTGTTGTTCACTCATTTAGATGCTATTGATAGTTCATTATAGATATCTCTGGTGATTATATGGTCTGCTAAATCTAAACAGATAGATACCCAGTTTCCTTCTCTACAAACATTATGCT

Coding sequence (CDS)

ATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGCAGAAGAAGGCACAGAAAAAGGTTGTCCAATATGTCTTCGCTGCTGCTTCACTGTCGTTTCAGCCACAGCCCTTGGATTGGTGGGATGAGTTCTCACAGAGGATTACTGGACCATTATCTCAGTCCAAGAACACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAAACATTCAGCTATATCTGTTCTCTTGTTAAGGAAGCTATGATGGCTAAAACCTCAAATTTTACCGATTTAAACGGCAAGCCTTTGTCTCTAAACGACCAAGTCGCCGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCAAATATTGGTGACTCATTTGGAATGAATCAATCATCAGTTTCTCAAATAACATGGCGTTTTGTGGAGGCAATGGAAGAGAAAGGCATCCGCCATCTCTCGTGGCCTTCGACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTCCAGAAAATCAGAGGTCTACCTAATTGTTGCGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGACAGAATCTGCAAATGGCGTCTGGCTTGATCGCGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGATCCAGAAATGAGATTCTGTGACATCATAACAGGTTGGCCAGGAAGCTTAAGCGATGCACTCGTGCTTGAAAGCTCAGGTTTTTTCAAACGTTCACAAGATGGCGAGCGGTTGAACGGAAAGAAGATGAAACTCTCAGAAAATTCAGAACTAGGAGAGTATATTATAGGAGACTCTGGCTTTCCACTGTTGCCATGGCTACTAACTCCTTACCAAGGGAAAGGCCTTGCAGATTACCAGACCGAGTTCAATAAGCGCCATTTCAGTACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGATAATGTGGAAGCCTGATAAACATAAGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAATGATCGATATGGAGGACGAGATGCAAGACGAAATGCCGTTGTCTCATCATCACGATCCGAGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGGTTCGATCACGAGGGAGAAACTTTCCATGTACTTATCTGAAAAGTTAACACCTTAA

Protein sequence

MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP
BLAST of Cp4.1LG01g10530.1 vs. Swiss-Prot
Match: HARB1_DANRE (Putative nuclease HARBI1 OS=Danio rerio GN=harbi1 PE=2 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.6e-27
Identity = 78/291 (26.80%), Postives = 145/291 (49.83%), Query Frame = 1

Query: 60  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 119
           + F   R+   Y+  L+K++++ +T        + +S + Q+  AL    SG   S +GD
Sbjct: 37  NTFGFPREFIYYLVELLKDSLLRRTQR-----SRAISPDVQILAALGFYTSGSFQSKMGD 96

Query: 120 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETT 179
           + G++Q+S+S+      +A+ EK    + +   E    Q K +F +I G+PN  GV++  
Sbjct: 97  AIGISQASMSRCVSNVTKALIEKAPEFIGFTRDEATKQQFKDEFYRIAGIPNVTGVVDCA 156

Query: 180 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 239
           HI +  P  + ++  +++++   S+  Q++ D         T WPGSL+D  V + S   
Sbjct: 157 HIAIKAPNADDSS--YVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNVA 216

Query: 240 KRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQG-KGLADYQTEFNKRHF 299
           K  ++            EN + G +++GD+ +PL  WL+TP Q  +  ADY+  +N  H 
Sbjct: 217 KLFEE-----------QENDDEG-WLLGDNRYPLKKWLMTPVQSPESPADYR--YNLAHT 276

Query: 300 STRLVAQRALTRLKEMWKIIKG----IMWKPDKHKLPRIILVCCLLHNIMI 346
           +T  +  R    ++  ++ + G    + + P+  K   II  CC+LHNI +
Sbjct: 277 TTHEIVDRTFRAIQTRFRCLDGAKGYLQYSPE--KCSHIIQACCVLHNISL 304

BLAST of Cp4.1LG01g10530.1 vs. Swiss-Prot
Match: HARB1_MOUSE (Putative nuclease HARBI1 OS=Mus musculus GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.3e-24
Identity = 78/292 (26.71%), Postives = 142/292 (48.63%), Query Frame = 1

Query: 60  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 119
           S++   R+   ++  L+  ++   T        + +S   Q+  AL    SG   + +GD
Sbjct: 37  SMYGFPRQFIYFLVELLGASLSRPTQR-----SRAISPETQILAALGFYTSGSFQTRMGD 96

Query: 120 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETT 179
           + G++Q+S+S+      EA+ E+  + + +P  E  +  +K +F  + G+P   GV +  
Sbjct: 97  AIGISQASMSRCVANVTEALVERASQFIHFPVDEAAVQSLKDEFYGLAGMPGVIGVADCI 156

Query: 180 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 239
           H+ +  P  E  +  +++R+   S+   V+ D       + T WPGSL D  VL+ S   
Sbjct: 157 HVAIKAPNAEDLS--YVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQRSSLT 216

Query: 240 KRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQ-GKGLADYQTEFNKRHF 299
            + + G         + ++S    +++GDS F L  WLLTP    +  A+Y+  +N+ H 
Sbjct: 217 SQFETG---------MPKDS----WLLGDSSFFLRSWLLTPLPIPETAAEYR--YNRAHS 276

Query: 300 STRLVAQRALTRLKEMWKIIKG----IMWKPDKHKLPRIILVCCLLHNIMID 347
           +T  V +R L  L   ++ + G    + + P+K     IIL CC+LHNI +D
Sbjct: 277 ATHSVIERTLQTLCCRFRCLDGSKGALQYSPEK--CSHIILACCVLHNISLD 304

BLAST of Cp4.1LG01g10530.1 vs. Swiss-Prot
Match: HARB1_RAT (Putative nuclease HARBI1 OS=Rattus norvegicus GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.3e-24
Identity = 76/292 (26.03%), Postives = 144/292 (49.32%), Query Frame = 1

Query: 60  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 119
           S++   R+   Y+  L+  ++   T        + +S   Q+  AL    SG   + +GD
Sbjct: 37  SMYGFPRQFIYYLVELLGASLSRPTQR-----SRAISPETQILAALGFYTSGSFQTRMGD 96

Query: 120 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETT 179
           + G++Q+S+S+      EA+ E+  + + +P+ E  +  +K +F  + G+P   G ++  
Sbjct: 97  AIGISQASMSRCVANVTEALVERASQFIHFPADEAAIQSLKDEFYGLAGMPGVIGAVDCI 156

Query: 180 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 239
           H+ +  P  E  +  +++R+   S+   V+ D       + T WPGSL D  VL+ S   
Sbjct: 157 HVAIKAPNAEDLS--YVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSLS 216

Query: 240 KRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTP-YQGKGLADYQTEFNKRHF 299
            + + G         + ++S    +++GDS F L  WLLTP +  +  A+Y+  +N+ H 
Sbjct: 217 SQFETG---------MPKDS----WLLGDSSFFLHTWLLTPLHIPETPAEYR--YNRAHS 276

Query: 300 STRLVAQRALTRLKEMWKIIKG----IMWKPDKHKLPRIILVCCLLHNIMID 347
           +T  V ++ L  L   ++ + G    + + P+K     IIL CC+LHNI ++
Sbjct: 277 ATHSVIEKTLRTLCCRFRCLDGSKGALQYSPEKSS--HIILACCVLHNISLE 304

BLAST of Cp4.1LG01g10530.1 vs. TrEMBL
Match: A0A0A0KS64_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G180900 PE=4 SV=1)

HSP 1 Score: 741.9 bits (1914), Expect = 4.1e-211
Identity = 360/394 (91.37%), Postives = 379/394 (96.19%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKK  K +KKV Q VFA+ASLS Q QPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKK--KVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FG+NQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMD+IKSKF+KIRGLPNCCGV+ETTH
Sbjct: 121 FGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVL+SSGFFK
Sbjct: 181 IMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
            SQDGERLNGKKMKLSE+SELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHF+T
Sbjct: 241 LSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFAT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDMEDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 395
           HDPSYRQQSC+FVDNT SI+REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of Cp4.1LG01g10530.1 vs. TrEMBL
Match: A0A061FMZ6_THECC (RNA binding protein, putative OS=Theobroma cacao GN=TCM_042838 PE=4 SV=1)

HSP 1 Score: 600.5 bits (1547), Expect = 1.5e-168
Identity = 288/401 (71.82%), Postives = 342/401 (85.29%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQY------VFAAASLSFQPQPLDWWDEFSQRITGPLSQSK 60
           MGPIRGFKR+K K A KKVV           A+SL  QPQPLDWWDEFS+RI+G LSQSK
Sbjct: 1   MGPIRGFKRRK-KAADKKVVDQNVLPSSAAVASSLGSQPQPLDWWDEFSKRISGTLSQSK 60

Query: 61  NTK-FESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGES 120
           ++K FESVF+ISRKTF YICSLVKE MMA+ S+FTDLNGKPLSLNDQVAVALRRL SGES
Sbjct: 61  DSKSFESVFRISRKTFDYICSLVKEDMMARQSSFTDLNGKPLSLNDQVAVALRRLSSGES 120

Query: 121 LSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCC 180
           LS IGD+FGMNQS+VSQITWRFVEAMEE+G+ HLSWPSTE +M+QIKSKF+KIRGLPNCC
Sbjct: 121 LSIIGDTFGMNQSTVSQITWRFVEAMEERGLHHLSWPSTEAEMEQIKSKFEKIRGLPNCC 180

Query: 181 GVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVL 240
           G I+ TH++MTLPT + +N VW DREKN SMILQ +VDPEMRF D+I GWPGSLSDA+VL
Sbjct: 181 GAIDITHVVMTLPTMDPSNNVWFDREKNYSMILQAVVDPEMRFRDVIAGWPGSLSDAIVL 240

Query: 241 ESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEF 300
            SSGFF+ S++G+RLNGKK+ +SE +++ EYIIGD+GFPLLPWL TPYQGKGL+D Q EF
Sbjct: 241 RSSGFFRLSEEGKRLNGKKLNISEGTDIREYIIGDAGFPLLPWLFTPYQGKGLSDLQVEF 300

Query: 301 NKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQD 360
           NKRH +TR+VAQ AL RLKEMW+II G+MW PDK++LPRI+LVCCLLHNI+ID+EDE+ D
Sbjct: 301 NKRHAATRMVAQMALARLKEMWRIIHGVMWMPDKNRLPRIVLVCCLLHNILIDLEDEVLD 360

Query: 361 EMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 395
           +M LSHHHD  YR+Q+C+ +D +  I R+KLS+YL+ KL P
Sbjct: 361 DMSLSHHHDTGYRRQNCESLDKSALIMRDKLSLYLTGKLPP 400

BLAST of Cp4.1LG01g10530.1 vs. TrEMBL
Match: A0A0B0NEP8_GOSAR (Putative nuclease HARBI1 OS=Gossypium arboreum GN=F383_15215 PE=4 SV=1)

HSP 1 Score: 590.9 bits (1522), Expect = 1.2e-165
Identity = 279/395 (70.63%), Postives = 340/395 (86.08%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTK-FE 60
           MGPI+GFKR+K K A KKVV +    +SL  QPQPLDWWDEFS RI+GPLSQSK ++ FE
Sbjct: 1   MGPIKGFKRRK-KTADKKVVDHNVLPSSLGSQPQPLDWWDEFSNRISGPLSQSKGSQSFE 60

Query: 61  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 120
           SVF+ISRKTF+YICSLVK+ +MA+ S++TD+ GKPLSLNDQVAVALRRL SGESLS IGD
Sbjct: 61  SVFRISRKTFNYICSLVKDDLMARQSSYTDIYGKPLSLNDQVAVALRRLSSGESLSIIGD 120

Query: 121 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETT 180
           +FGMNQS+VSQITWRFVE+MEE+G+ HLSWPSTEE+M+QIKSKF+KIRGLPNCCG I+ T
Sbjct: 121 TFGMNQSTVSQITWRFVESMEERGLHHLSWPSTEEEMEQIKSKFEKIRGLPNCCGAIDIT 180

Query: 181 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 240
           HI+MTLPT + +N VW DREKN SM+LQ +VDPEMRF D+I GWPGSLSDA+VL+SSG F
Sbjct: 181 HIVMTLPTMDPSNHVWFDREKNYSMVLQAVVDPEMRFRDVIVGWPGSLSDAVVLQSSGLF 240

Query: 241 KRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFS 300
           + S++G+RLNGKK+ +SE +E+ EYIIGD+GFPLLPWL TPYQGK L+D Q EFNKRH +
Sbjct: 241 RLSEEGKRLNGKKLNISEGTEIREYIIGDAGFPLLPWLFTPYQGKSLSDLQIEFNKRHAA 300

Query: 301 TRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSH 360
           TR+VA+ AL RLKEMW+II G+MW PD+++LPRIILVCCLLHNI+ID+EDE+ D+M LSH
Sbjct: 301 TRMVAEMALARLKEMWRIIHGVMWMPDRNRLPRIILVCCLLHNILIDLEDEVLDDMSLSH 360

Query: 361 HHDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 395
            HD  Y +Q+C+  D + SITR+KLS+YL+ KL P
Sbjct: 361 QHDIDYHRQNCESFDQSASITRDKLSLYLTGKLPP 394

BLAST of Cp4.1LG01g10530.1 vs. TrEMBL
Match: A0A067FX22_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015432mg PE=4 SV=1)

HSP 1 Score: 562.4 bits (1448), Expect = 4.4e-157
Identity = 275/409 (67.24%), Postives = 327/409 (79.95%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAA--------------SLSFQPQPLDWWDEFSQRI 60
           MGPIRG KR+K  KA+KKV Q V AAA              SL  QPQPLDWWD FS+RI
Sbjct: 1   MGPIRGLKRRK--KAEKKVDQNVLAAAAASDGDGDGDADADSLVAQPQPLDWWDNFSRRI 60

Query: 61  TGPLSQSKNTK-FESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVAL 120
           +GPL  SK +K FESVFKISRKTF YICSLVKE + A+ SNF+  NGKPLS ND VA+AL
Sbjct: 61  SGPLFGSKTSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIAL 120

Query: 121 RRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQK 180
           RRL SGESL  IGD FG+NQS+VSQ+TWRFVE+MEE+G+ HL WPS E +M+ IKSKF+K
Sbjct: 121 RRLSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEK 180

Query: 181 IRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPG 240
           IRG  NCCG I+ THI+M +P  + AN VW DREKN SMILQ IVDPEMRF DII GWPG
Sbjct: 181 IRGFRNCCGAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPG 240

Query: 241 SLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKG 300
           SL+DALVL +SGFFK +++G+RL+GK ++LSE  EL EYIIGD+GFPLLPWLLTPYQGKG
Sbjct: 241 SLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKG 300

Query: 301 LADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMI 360
           L+D + E+NKRH +TR+VAQ AL RLK++W+II G+MW PDK++LPRI+LVCCLLHNI+I
Sbjct: 301 LSDIEAEYNKRHSATRMVAQMALARLKDVWRIIHGVMWMPDKNRLPRIVLVCCLLHNIVI 360

Query: 361 DMEDEMQDEMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 395
           DMEDEM DE+PLS+HHD  Y QQ+C+ VD T S+ R+ LS+YLS KL P
Sbjct: 361 DMEDEMLDELPLSYHHDSGYHQQTCESVDKTASVMRDNLSLYLSGKLPP 407

BLAST of Cp4.1LG01g10530.1 vs. TrEMBL
Match: A0A067K0W6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18441 PE=4 SV=1)

HSP 1 Score: 561.6 bits (1446), Expect = 7.5e-157
Identity = 270/398 (67.84%), Postives = 330/398 (82.91%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQ---PLDWWDEFSQRITGPLSQSKNT- 60
           MGPIRGFKR+K  KA+KKV Q V AAA  S  PQ   PLDWWD+FS+RITGPLS+S+N+ 
Sbjct: 1   MGPIRGFKRRK--KAEKKVDQNVLAAALSSLHPQSQQPLDWWDDFSKRITGPLSESRNSM 60

Query: 61  KFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSN 120
           KFESVFKISRKTF+YICSLV + + A+ SNF+  NGKPLSLNDQVA+ALRRL SGESLSN
Sbjct: 61  KFESVFKISRKTFNYICSLVNDVLTARQSNFSSTNGKPLSLNDQVAIALRRLSSGESLSN 120

Query: 121 IGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVI 180
           IGD+FG+NQS+VS +TWRFVEAMEE+G+ HL WPS++ +M+++KSKF+K+ GLPNCCGVI
Sbjct: 121 IGDAFGINQSTVSHLTWRFVEAMEERGLDHLRWPSSQTEMEEVKSKFEKLHGLPNCCGVI 180

Query: 181 ETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESS 240
           +TTHI+MTL   + +N VW+DREKN SM+LQ IVDP+MR  D+I G+PGSLSDALVL++S
Sbjct: 181 DTTHIVMTLSAVDHSNDVWIDREKNHSMVLQAIVDPDMRIRDVIVGYPGSLSDALVLQNS 240

Query: 241 GFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKR 300
            F+K S++G+RLNGKK+KL E +ELGEYIIGD+GFPLLPWLLTP+Q   L  +Q EFNK 
Sbjct: 241 SFYKLSEEGKRLNGKKIKLMEGAELGEYIIGDAGFPLLPWLLTPFQ-HALPGHQAEFNKL 300

Query: 301 HFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMP 360
           H + R+VAQ AL RLKEMW+I+ G+MW PDK+KLPRII VCCLLHNI+IDMED+  +EMP
Sbjct: 301 HSAARVVAQIALARLKEMWRIMHGVMWLPDKNKLPRIIFVCCLLHNIVIDMEDKALEEMP 360

Query: 361 LSHHHDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 395
           +SHHHD  YRQQ C+    TG+  REK S Y+S KL P
Sbjct: 361 MSHHHDKDYRQQICESASKTGTDMREKFSYYISNKLPP 395

BLAST of Cp4.1LG01g10530.1 vs. TAIR10
Match: AT3G55350.1 (AT3G55350.1 PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 490.3 bits (1261), Expect = 1.1e-138
Identity = 247/409 (60.39%), Postives = 301/409 (73.59%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSF------------------QPQPLDWWDEF 60
           MGPI+  K+KK  +A+KKV + V  AA+ +                     Q LDWWD F
Sbjct: 1   MGPIKTIKKKK--RAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGF 60

Query: 61  SQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVA 120
           S+RI G  +  K   FESVFKISRKTF YICSLVK    AK +NF+D NG PLSLND+VA
Sbjct: 61  SRRIYGGSTDPKT--FESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVA 120

Query: 121 VALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSK 180
           VALRRL SGESLS IG++FGMNQS+VSQITWRFVE+MEE+ I HLSWPS    +D+IKSK
Sbjct: 121 VALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPSK---LDEIKSK 180

Query: 181 FQKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITG 240
           F+KI GLPNCCG I+ THI+M LP  E +N VWLD EKN SM LQ +VDP+MRF D+I G
Sbjct: 181 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAG 240

Query: 241 WPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQ 300
           WPGSL+D +VL++SGF+K  + G+RLNG+K+ LSE +EL EYI+GDSGFPLLPWLLTPYQ
Sbjct: 241 WPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQ 300

Query: 301 GKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHN 360
           GK  +  QTEFNKRH      AQ AL++LK+ W+II G+MW PD+++LPRII VCCLLHN
Sbjct: 301 GKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHN 360

Query: 361 IMIDMEDEMQDEMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYLSEK 392
           I+IDMED+  D+ PLS  HD +YRQ+SCK  D   S+ R++LS  L  K
Sbjct: 361 IIIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of Cp4.1LG01g10530.1 vs. TAIR10
Match: AT3G63270.1 (AT3G63270.1 Putative harbinger transposase-derived nuclease (InterPro:IPR006912))

HSP 1 Score: 354.0 bits (907), Expect = 1.2e-97
Identity = 174/396 (43.94%), Postives = 261/396 (65.91%), Query Frame = 1

Query: 1   MGPIRGFKRKKQK------KAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGP-LSQS 60
           M P++  K+ K+K      K  K   +    A  L  +    DWWD F  R + P +   
Sbjct: 1   MAPVKQKKKNKKKPLDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSD 60

Query: 61  KNTKFESVFKISRKTFSYICSLVKEAMMAKT-SNFTDLNGKPLSLNDQVAVALRRLCSGE 120
           ++  F+  F+ S+ TFSYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL SG+
Sbjct: 61  EDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGD 120

Query: 121 SLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNC 180
           S  ++G +FG+ QS+VSQ+TWRF+EA+EE+   HL WP ++  +++IKSKF+++ GLPNC
Sbjct: 121 SQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNC 180

Query: 181 CGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALV 240
           CG I+TTHI+MTLP  ++++  W D+EKN SM LQ + D EMRF +++TGWPG ++ + +
Sbjct: 181 CGAIDTTHIIMTLPAVQASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKL 240

Query: 241 LESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTE 300
           L+ SGFFK  ++ + L+G    LS+ +++ EY++G   +PLLPWL+TP+     +D    
Sbjct: 241 LKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVA 300

Query: 301 FNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQ 360
           FN+RH   R VA  A  +LK  W+I+  +MW+PD+ KLP IILVCCLLHNI+ID  D +Q
Sbjct: 301 FNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQ 360

Query: 361 DEMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYL 389
           +++PLS HHD  Y  + CK  +  GS  R  L+ +L
Sbjct: 361 EDVPLSGHHDSGYADRYCKQTEPLGSELRGCLTEHL 394

BLAST of Cp4.1LG01g10530.1 vs. TAIR10
Match: AT5G12010.1 (AT5G12010.1 unknown protein)

HSP 1 Score: 149.4 bits (376), Expect = 4.5e-36
Identity = 93/324 (28.70%), Postives = 164/324 (50.62%), Query Frame = 1

Query: 38  WWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSL 97
           WW+E S R+  P        F+  F++S+ TF  IC  +  A+  + +   +     + +
Sbjct: 161 WWEECS-RLDYP-----EEDFKKAFRMSKSTFELICDELNSAVAKEDTALRNA----IPV 220

Query: 98  NDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGI-RHLSWPSTEEDM 157
             +VAV + RL +GE L  +   FG+  S+  ++     +A+++  + ++L WP  +E +
Sbjct: 221 RQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPD-DESL 280

Query: 158 DQIKSKFQKIRGLPNCCGVIETTHIMMTLPTTESAN-----GVWLDREKNCSMILQVIVD 217
             I+ +F+ + G+PN  G + TTHI +  P    A+         +++ + S+ +Q +V+
Sbjct: 281 RNIRERFESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVN 340

Query: 218 PEMRFCDIITGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGF 277
           P+  F D+  GWPGS+ D  VLE S  ++R+ +G  L G             ++ G  G 
Sbjct: 341 PKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGM------------WVAGGPGH 400

Query: 278 PLLPWLLTPYQGKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLP 337
           PLL W+L PY  + L   Q  FN++    + VA+ A  RLK  W  ++    +     LP
Sbjct: 401 PLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQK-RTEVKLQDLP 460

Query: 338 RIILVCCLLHNIMIDMEDEMQDEM 356
            ++  CC+LHNI    E++M+ E+
Sbjct: 461 TVLGACCVLHNICEMREEKMEPEL 460

BLAST of Cp4.1LG01g10530.1 vs. TAIR10
Match: AT4G29780.1 (AT4G29780.1 unknown protein)

HSP 1 Score: 129.4 bits (324), Expect = 4.8e-30
Identity = 89/325 (27.38%), Postives = 152/325 (46.77%), Query Frame = 1

Query: 37  DWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLS 96
           DWWD  S+            +F   F++S+ TF+ IC  +   +  K +   D    P  
Sbjct: 198 DWWDRVSR------PDFPEDEFRREFRMSKSTFNLICEELDTTVTKKNTMLRDAIPAP-- 257

Query: 97  LNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGI-RHLSWPSTEED 156
              +V V + RL +G  L ++ + FG+  S+  ++      A+ +  + ++L WPS  E 
Sbjct: 258 --KRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPSDSE- 317

Query: 157 MDQIKSKFQKIRGLPNCCGVIETTHIMMTLPTTESA-----NGVWLDREKNCSMILQVIV 216
           ++  K+KF+ +  +PN  G I TTHI +  P    A          +++ + S+ +Q +V
Sbjct: 318 INSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVV 377

Query: 217 DPEMRFCDIITGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSG 276
           + +  F D+  G PGSL+D  +LE S               + + +       +I+G+SG
Sbjct: 378 NADGIFTDVCIGNPGSLTDDQILEKSSL------------SRQRAARGMLRDSWIVGNSG 437

Query: 277 FPLLPWLLTPYQGKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKL 336
           FPL  +LL PY  + L   Q  FN+     + +A  A  RLK  W  ++    +     L
Sbjct: 438 FPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQK-RTEVKLQDL 497

Query: 337 PRIILVCCLLHNIMIDMEDEMQDEM 356
           P ++  CC+LHNI    ++EM  E+
Sbjct: 498 PYVLGACCVLHNICEMRKEEMLPEL 498

BLAST of Cp4.1LG01g10530.1 vs. TAIR10
Match: AT1G72270.1 (AT1G72270.1 Ribosome 60S biogenesis N-terminal (InterPro:IPR021714))

HSP 1 Score: 95.1 bits (235), Expect = 1.0e-19
Identity = 77/312 (24.68%), Postives = 134/312 (42.95%), Query Frame = 1

Query: 42  FSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQV 101
           F++ +T       + ++   F++S+ TF  + S++  + +                    
Sbjct: 81  FNRFLTSATEDEDDPRWCLYFRMSKSTFFSLYSILSHSSLPS-----------------F 140

Query: 102 AVALRRLCSGESLSNIGDSFGMNQSS-VSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIK 161
           A  + RL  G S   +   FG + +S  S+  +   + + EK           + +D  K
Sbjct: 141 AATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEK---------LSQQLDDPK 200

Query: 162 SKFQKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDII 221
             F     LPNC GV+                G  L  +   S+++Q +VD   RF DI 
Sbjct: 201 PDFSP-NLLPNCYGVVGFGRF--------EVKGKLLGAKG--SILVQALVDSNGRFVDIS 260

Query: 222 TGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTP 281
            GWP ++    +   +  F  ++  E L+G   KL     +  YI+GDS  PLLPWL+TP
Sbjct: 261 AGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLVTP 320

Query: 282 YQ-GKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHK-LPRIILVCC 341
           Y        ++ EFN    +     + A  +++  W+I+    WKP+  + +P +I   C
Sbjct: 321 YDLTSDEESFREEFNNVVHTGLHSVEIAFAKVRARWRILDK-KWKPETIEFMPFVITTGC 352

Query: 342 LLHNIMIDMEDE 351
           LLHN +++  D+
Sbjct: 381 LLHNFLVNSGDD 352

BLAST of Cp4.1LG01g10530.1 vs. NCBI nr
Match: gi|449459932|ref|XP_004147700.1| (PREDICTED: putative nuclease HARBI1 [Cucumis sativus])

HSP 1 Score: 741.9 bits (1914), Expect = 5.8e-211
Identity = 360/394 (91.37%), Postives = 379/394 (96.19%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKK  K +KKV Q VFA+ASLS Q QPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKK--KVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FG+NQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMD+IKSKF+KIRGLPNCCGV+ETTH
Sbjct: 121 FGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVL+SSGFFK
Sbjct: 181 IMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
            SQDGERLNGKKMKLSE+SELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHF+T
Sbjct: 241 LSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFAT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDMEDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 395
           HDPSYRQQSC+FVDNT SI+REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of Cp4.1LG01g10530.1 vs. NCBI nr
Match: gi|659123396|ref|XP_008461643.1| (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 739.6 bits (1908), Expect = 2.9e-210
Identity = 359/394 (91.12%), Postives = 378/394 (95.94%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKK  K +KKV Q VFA+ASLS Q QPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKK--KVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FG+NQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMD+IKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSD+LVL+SSGFFK
Sbjct: 181 IMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDSLVLQSSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
            SQDGERLNGKKM+LSE+SELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHF+T
Sbjct: 241 LSQDGERLNGKKMRLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFAT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDMEDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 395
           HDPSYRQQSC+FVDNT SI REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 392

BLAST of Cp4.1LG01g10530.1 vs. NCBI nr
Match: gi|1009120469|ref|XP_015876938.1| (PREDICTED: putative nuclease HARBI1 [Ziziphus jujuba])

HSP 1 Score: 605.1 bits (1559), Expect = 8.5e-170
Identity = 293/392 (74.74%), Postives = 336/392 (85.71%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGP+RG K  K+KK +KKV Q V AA SL  +P+PLDWWD FSQRITGPL QSK  KFES
Sbjct: 1   MGPVRGLK--KRKKVEKKVDQNVLAA-SLGPEPEPLDWWDGFSQRITGPLLQSKKMKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAK SNF DLNGKPLSLNDQVAVALRRL +GESLS+IGDS
Sbjct: 61  VFKISRKTFSYICSLVKEDMMAKASNFVDLNGKPLSLNDQVAVALRRLSAGESLSSIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           F MNQS+VSQ+TWRFVE+MEE+G+ HL WPSTE +M++IKSKF+KIRGLPNCCG I+TTH
Sbjct: 121 FKMNQSTVSQLTWRFVESMEERGLHHLHWPSTETEMEEIKSKFEKIRGLPNCCGAIDTTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPT + ++ VWLD EKNCSMILQ IVDPEMRF ++ITGWPGSL+D +VL SSGFFK
Sbjct: 181 IMMTLPTMDPSSDVWLDHEKNCSMILQAIVDPEMRFRNVITGWPGSLNDDIVLRSSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
              +G+ LNGKKM L E +ELGEYI+GD+GFPLLPWLLTPY+GK L D+Q E+NKR F+T
Sbjct: 241 LCGEGKMLNGKKMVLPEGTELGEYIVGDAGFPLLPWLLTPYRGKHLPDFQAEYNKRLFAT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           ++VAQRAL RLKEMWKII G+MWKPDKHKLPRIILVCC+LHNI+IDMEDEMQDE+PLSHH
Sbjct: 301 KMVAQRALARLKEMWKIIHGVMWKPDKHKLPRIILVCCILHNIVIDMEDEMQDELPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKL 393
           HD  Y Q + + VD +  I REKLS+ LS KL
Sbjct: 361 HDTGYHQLNSESVDKSALILREKLSLQLSGKL 389

BLAST of Cp4.1LG01g10530.1 vs. NCBI nr
Match: gi|590563694|ref|XP_007009443.1| (RNA binding protein, putative [Theobroma cacao])

HSP 1 Score: 600.5 bits (1547), Expect = 2.1e-168
Identity = 288/401 (71.82%), Postives = 342/401 (85.29%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQY------VFAAASLSFQPQPLDWWDEFSQRITGPLSQSK 60
           MGPIRGFKR+K K A KKVV           A+SL  QPQPLDWWDEFS+RI+G LSQSK
Sbjct: 1   MGPIRGFKRRK-KAADKKVVDQNVLPSSAAVASSLGSQPQPLDWWDEFSKRISGTLSQSK 60

Query: 61  NTK-FESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGES 120
           ++K FESVF+ISRKTF YICSLVKE MMA+ S+FTDLNGKPLSLNDQVAVALRRL SGES
Sbjct: 61  DSKSFESVFRISRKTFDYICSLVKEDMMARQSSFTDLNGKPLSLNDQVAVALRRLSSGES 120

Query: 121 LSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCC 180
           LS IGD+FGMNQS+VSQITWRFVEAMEE+G+ HLSWPSTE +M+QIKSKF+KIRGLPNCC
Sbjct: 121 LSIIGDTFGMNQSTVSQITWRFVEAMEERGLHHLSWPSTEAEMEQIKSKFEKIRGLPNCC 180

Query: 181 GVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVL 240
           G I+ TH++MTLPT + +N VW DREKN SMILQ +VDPEMRF D+I GWPGSLSDA+VL
Sbjct: 181 GAIDITHVVMTLPTMDPSNNVWFDREKNYSMILQAVVDPEMRFRDVIAGWPGSLSDAIVL 240

Query: 241 ESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEF 300
            SSGFF+ S++G+RLNGKK+ +SE +++ EYIIGD+GFPLLPWL TPYQGKGL+D Q EF
Sbjct: 241 RSSGFFRLSEEGKRLNGKKLNISEGTDIREYIIGDAGFPLLPWLFTPYQGKGLSDLQVEF 300

Query: 301 NKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQD 360
           NKRH +TR+VAQ AL RLKEMW+II G+MW PDK++LPRI+LVCCLLHNI+ID+EDE+ D
Sbjct: 301 NKRHAATRMVAQMALARLKEMWRIIHGVMWMPDKNRLPRIVLVCCLLHNILIDLEDEVLD 360

Query: 361 EMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 395
           +M LSHHHD  YR+Q+C+ +D +  I R+KLS+YL+ KL P
Sbjct: 361 DMSLSHHHDTGYRRQNCESLDKSALIMRDKLSLYLTGKLPP 400

BLAST of Cp4.1LG01g10530.1 vs. NCBI nr
Match: gi|823255971|ref|XP_012460644.1| (PREDICTED: putative nuclease HARBI1 isoform X1 [Gossypium raimondii])

HSP 1 Score: 595.5 bits (1534), Expect = 6.7e-167
Identity = 280/393 (71.25%), Postives = 340/393 (86.51%), Query Frame = 1

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTK-FE 60
           MGPIRGFKR+K K A KKVV +   ++SL  Q QPLDWWD+FS+RI+GPLSQSK ++ FE
Sbjct: 1   MGPIRGFKRRK-KTADKKVVDHNVFSSSLESQLQPLDWWDDFSKRISGPLSQSKGSRSFE 60

Query: 61  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 120
           S+F+IS+KTF+YICSLVKE MMA+ S++TD+NGKPLSLNDQVAVALRRL SGESLS IGD
Sbjct: 61  SIFRISKKTFNYICSLVKEDMMARQSSYTDINGKPLSLNDQVAVALRRLSSGESLSVIGD 120

Query: 121 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETT 180
           +FGMNQS+VSQITWRFVEAMEEKG+ HL+WP TE +M+QIKSKF+KIRGLPNCCG I+ T
Sbjct: 121 TFGMNQSTVSQITWRFVEAMEEKGLHHLTWPLTEAEMEQIKSKFEKIRGLPNCCGAIDIT 180

Query: 181 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 240
           H++MTLPT + +N VW DREKN SMILQ +VDPEMR  D+I GWPGSLSDA+VL SSGFF
Sbjct: 181 HVVMTLPTMDPSNNVWFDREKNYSMILQAVVDPEMRLRDVIAGWPGSLSDAVVLRSSGFF 240

Query: 241 KRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFS 300
           + S++G+RL GKK+ +SE  E+GEYIIGD+GFPLLPWLLTPYQGKGL+D Q EFNKRH +
Sbjct: 241 RLSEEGKRLTGKKLNISEGMEIGEYIIGDAGFPLLPWLLTPYQGKGLSDLQIEFNKRHAA 300

Query: 301 TRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSH 360
           TR+VAQ AL RLKEMW+II G+MW PDK++LPRI+LVCCLLHNI+IDMEDE+ D+M LSH
Sbjct: 301 TRMVAQMALARLKEMWRIIHGVMWMPDKNRLPRIVLVCCLLHNILIDMEDEVFDDMSLSH 360

Query: 361 HHDPSYRQQSCKFVDNTGSITREKLSMYLSEKL 393
           HHD  YR+Q+C++ D +  I R+KLS+Y++ KL
Sbjct: 361 HHDTGYRRQNCEYFDQSAMIMRDKLSLYITGKL 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HARB1_DANRE1.6e-2726.80Putative nuclease HARBI1 OS=Danio rerio GN=harbi1 PE=2 SV=1[more]
HARB1_MOUSE1.3e-2426.71Putative nuclease HARBI1 OS=Mus musculus GN=Harbi1 PE=2 SV=1[more]
HARB1_RAT1.3e-2426.03Putative nuclease HARBI1 OS=Rattus norvegicus GN=Harbi1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KS64_CUCSA4.1e-21191.37Uncharacterized protein OS=Cucumis sativus GN=Csa_5G180900 PE=4 SV=1[more]
A0A061FMZ6_THECC1.5e-16871.82RNA binding protein, putative OS=Theobroma cacao GN=TCM_042838 PE=4 SV=1[more]
A0A0B0NEP8_GOSAR1.2e-16570.63Putative nuclease HARBI1 OS=Gossypium arboreum GN=F383_15215 PE=4 SV=1[more]
A0A067FX22_CITSI4.4e-15767.24Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015432mg PE=4 SV=1[more]
A0A067K0W6_JATCU7.5e-15767.84Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18441 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G55350.11.1e-13860.39 PIF / Ping-Pong family of plant transposases[more]
AT3G63270.11.2e-9743.94 Putative harbinger transposase-derived nuclease (InterPro:IPR006912)[more]
AT5G12010.14.5e-3628.70 unknown protein[more]
AT4G29780.14.8e-3027.38 unknown protein[more]
AT1G72270.11.0e-1924.68 Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)[more]
Match NameE-valueIdentityDescription
gi|449459932|ref|XP_004147700.1|5.8e-21191.37PREDICTED: putative nuclease HARBI1 [Cucumis sativus][more]
gi|659123396|ref|XP_008461643.1|2.9e-21091.12PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
gi|1009120469|ref|XP_015876938.1|8.5e-17074.74PREDICTED: putative nuclease HARBI1 [Ziziphus jujuba][more]
gi|590563694|ref|XP_007009443.1|2.1e-16871.82RNA binding protein, putative [Theobroma cacao][more]
gi|823255971|ref|XP_012460644.1|6.7e-16771.25PREDICTED: putative nuclease HARBI1 isoform X1 [Gossypium raimondii][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR027806HARBI1_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0043565 sequence-specific DNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG01g10530Cp4.1LG01g10530gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g10530.1:five_prime_utr:001Cp4.1LG01g10530.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g10530.1:cds:001Cp4.1LG01g10530.1:cds:001CDS
Cp4.1LG01g10530.1:cds:002Cp4.1LG01g10530.1:cds:002CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g10530.1:three_prime_utr:001Cp4.1LG01g10530.1:three_prime_utr:001three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG01g10530.1Cp4.1LG01g10530.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 176..342
score: 1.5
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 13..390
score: 1.2E
NoneNo IPR availablePANTHERPTHR22930:SF45SUBFAMILY NOT NAMEDcoord: 13..390
score: 1.2E