Cp4.1LG01g08150 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g08150
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionRNA-directed DNA polymerase (reverse transcriptase)
LocationCp4.1LG01 : 5117718 .. 5120024 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCATCCACCTAAGAAAGATAGCGACACCATGGAAACCCAACCTCTCATCTTCCTTCAACAAACTTTACTCTGATCGCCCATTGAAGCATTCGCCTCTTCTCAAGCTTCCGTTTCAACATTCTGTACAAACCCTCACAAGGCCCCAACTCGAGGCTCTAGTTCTATCCCGATTCTCCCAGGGCAAGTTCTTCGACCTTCTTCGAAATGTTGTTGCCTCTCCCTCTGTTCTTTTCACCGCCTCCCAAAATCTCATCACTCCACTCCCAAGCAATCGCCTCAATGCTCCCGACTCGCTACTCAGTTTCGATATGGTCTCTAGTTGCTTTTCGGTCGAGGACATGGCTCGGGAGCTCTACGAAAATCGTTTCGATGTTGGTGCTTGCTGTGTTCGGTTGGAATCATCGGAAGAGAAAGGTGAGTTTCTGGTCTTACCGAATTTGAAATTGAAGGTCTTACTCGAGGCTATTAGGATTGTGTTGGAAATTGTTTATGACGAACGATTTGTAACGTTCTCTTATGGGGGGCGTGTCGGTATGGGGCGGCACACTGCCATTAGGTACCTGAAGAACTCGGTGCAAAATCCTAGTTGGTGGTTTACCGTTGCATTTCGTCGCAGAAAGTTCGATTCTGTACATGTAAATAAGTTGTGCTTATTGGTGCAAGAGAAAATTAAGGATGATATTTTGATTCTTATGCTAAAGAAACTGTTTGAATTGGAGGCAGTTCAAATTGAATTGGGTGGTTGTTATTTAGGAAGGGGGTTCCCTCAGGAAAGTGGTTTGTGTTCAATCTTGTCTAATATATACTTCAATGGCTTTGATAAAGAACTTCAACAAATACGTCTTGAAAAGAATGAAGAAAATCCCAAGTTCAGTCTGGACGGTACTGTTTCTTTCCATAATCCGGTGAAAATATATGCCGTTAGGTATCTGGATGAGATATTAGTTATAACATCGGGGTCGAAGATGCTAATAATGGAGTTGAAAAGCCAGGTGCTAAGGTATTTAGAAGGGAATTTAGAATTGGAAGTGGATCGAATGAATACTGCAATTCATAGTGCTGTCTCCGAGAAAATTAGTTTCTTAGGAATGGAACTACAGGCAGTACCACCTTCAGTTCTGCATCCACCAATGTCCGAAAAGGCAATCAGGGCTCGGAAGAAGTACCTTAGACAGAAGGAAGTTAGAGCAATAGAATTGAGAAATGCCCGTGAGAGAAACAGGAAAAAATTGGGATTGAAGATATTGGGTCACGTGTTCAAGAAATTGAAGCGAACCGATGGCTTGAAATATGAATTCCAAATCGAGAAGGAAGTCACAGAAATCTTCAGAAATTGGGCCGATGAAGTAGTGCGAGATTTCTTGGAGTCTTTGGAAGATAATACAGAGTGGCACCGTCCGCTGTCAGCAGGTGATTTCCTCTCCTTAAAACACATAAGAAATCAGTTGCCAGTAGATCTTGTGAATGCTTATGACAGGTTTCAAGATCAGGTAAACCAGCACTTGAATCCTCTAAAGGCCAAAAAGGAGAAGGCTAGGGAGGATGAAAAGAAAAGATTGGGTGAAGAAGAACGATATGCTAAAAGAACAGTTGAGGACTTAACAAGGCTATGCATCAAAGTTGAAGCTCCTATAGAGCTTGTTAGGAAGGCAGTGAAGATGATTGGATTTACAAATAAAATGGGCCGTCCTCGGCCTATCAGCTCTCTCATTGCTCTTGAAGATACAGATATTATCAAGTGGTATGCCGGTGTAGGAAGAAGGTGGTTAGACTTCTTTTGCTGTTGTCATAACTATAAGACGGTCAAAACTATTGTAACTTACCATTTGAGGTTTTCTTGTATTTTGACATTGGCGGAAAAGCATGAATCGACCAAACGGGAAGCCATGAAACATTACAGTAAAGATTTGAAAGTCTTTGATTTAAATGGCAAGGAAGAGATGCACTTCCCAACAGAAAGAGAAGTTAAGATGTTGGGAGAAAGAAATCTTGCTGACCCATACCCTGTGGATGGGGCTTTTTCTTTGTTTCTGATTAGATTAGTCACTGATGAAGATTCATATCCTTGTATTGCTCATTTTTGCAATAGAACAGACTCTATTCTATATAGGGTCCGATTACTGCAAAAGACTCTGAATGTCAATCCATCTAATGGAGTGGAATGGGTGAGAGGGATGGGAGTGATTCATGAAAGTTTAAATCAGAGATGCCTCCCTCTATGTGCTGATCACATCAGTGATTTATACATGGGGAAAATCAACCTTCAAGACTTGGACTGCACCTTATCATTGGATATGGACTGA

mRNA sequence

ATGCTCATCCACCTAAGAAAGATAGCGACACCATGGAAACCCAACCTCTCATCTTCCTTCAACAAACTTTACTCTGATCGCCCATTGAAGCATTCGCCTCTTCTCAAGCTTCCGTTTCAACATTCTGTACAAACCCTCACAAGGCCCCAACTCGAGGCTCTAGTTCTATCCCGATTCTCCCAGGGCAAGTTCTTCGACCTTCTTCGAAATGTTGTTGCCTCTCCCTCTGTTCTTTTCACCGCCTCCCAAAATCTCATCACTCCACTCCCAAGCAATCGCCTCAATGCTCCCGACTCGCTACTCAGTTTCGATATGGTCTCTAGTTGCTTTTCGGTCGAGGACATGGCTCGGGAGCTCTACGAAAATCGTTTCGATGTTGGTGCTTGCTGTGTTCGGTTGGAATCATCGGAAGAGAAAGGTGAGTTTCTGGTCTTACCGAATTTGAAATTGAAGGTCTTACTCGAGGCTATTAGGATTGTGTTGGAAATTGTTTATGACGAACGATTTGTAACGTTCTCTTATGGGGGGCGTGTCGGTATGGGGCGGCACACTGCCATTAGGTACCTGAAGAACTCGGTGCAAAATCCTAGTTGGTGGTTTACCGTTGCATTTCGTCGCAGAAAGTTCGATTCTGTACATGTAAATAAGTTGTGCTTATTGGTGCAAGAGAAAATTAAGGATGATATTTTGATTCTTATGCTAAAGAAACTGTTTGAATTGGAGGCAGTTCAAATTGAATTGGGTGGTTGTTATTTAGGAAGGGGGTTCCCTCAGGAAAGTGGTTTGTGTTCAATCTTGTCTAATATATACTTCAATGGCTTTGATAAAGAACTTCAACAAATACGTCTTGAAAAGAATGAAGAAAATCCCAAGTTCAGTCTGGACGGTACTGTTTCTTTCCATAATCCGGTGAAAATATATGCCGTTAGGTATCTGGATGAGATATTAGTTATAACATCGGGGTCGAAGATGCTAATAATGGAGTTGAAAAGCCAGGTGCTAAGGTATTTAGAAGGGAATTTAGAATTGGAAGTGGATCGAATGAATACTGCAATTCATAGTGCTGTCTCCGAGAAAATTAGTTTCTTAGGAATGGAACTACAGGCAGTACCACCTTCAGTTCTGCATCCACCAATGTCCGAAAAGGCAATCAGGGCTCGGAAGAAGTACCTTAGACAGAAGGAAGTTAGAGCAATAGAATTGAGAAATGCCCGTGAGAGAAACAGGAAAAAATTGGGATTGAAGATATTGGGTCACGTGTTCAAGAAATTGAAGCGAACCGATGGCTTGAAATATGAATTCCAAATCGAGAAGGAAGTCACAGAAATCTTCAGAAATTGGGCCGATGAAGTAGTGCGAGATTTCTTGGAGTCTTTGGAAGATAATACAGAGTGGCACCGTCCGCTGTCAGCAGGTGATTTCCTCTCCTTAAAACACATAAGAAATCAGTTGCCAGTAGATCTTGTGAATGCTTATGACAGGTTTCAAGATCAGGTAAACCAGCACTTGAATCCTCTAAAGGCCAAAAAGGAGAAGGCTAGGGAGGATGAAAAGAAAAGATTGGGTGAAGAAGAACGATATGCTAAAAGAACAGTTGAGGACTTAACAAGGCTATGCATCAAAGTTGAAGCTCCTATAGAGCTTGTTAGGAAGGCAGTGAAGATGATTGGATTTACAAATAAAATGGGCCGTCCTCGGCCTATCAGCTCTCTCATTGCTCTTGAAGATACAGATATTATCAAGTGGTATGCCGGTGTAGGAAGAAGGTGGTTAGACTTCTTTTGCTGTTGTCATAACTATAAGACGGTCAAAACTATTGTAACTTACCATTTGAGGTTTTCTTGTATTTTGACATTGGCGGAAAAGCATGAATCGACCAAACGGGAAGCCATGAAACATTACAGTAAAGATTTGAAAGTCTTTGATTTAAATGGCAAGGAAGAGATGCACTTCCCAACAGAAAGAGAAGTTAAGATGTTGGGAGAAAGAAATCTTGCTGACCCATACCCTGTGGATGGGGCTTTTTCTTTGTTTCTGATTAGATTAGTCACTGATGAAGATTCATATCCTTGTATTGCTCATTTTTGCAATAGAACAGACTCTATTCTATATAGGGTCCGATTACTGCAAAAGACTCTGAATGTCAATCCATCTAATGGAGTGGAATGGGTGAGAGGGATGGGAGTGATTCATGAAAGTTTAAATCAGAGATGCCTCCCTCTATGTGCTGATCACATCAGTGATTTATACATGGGGAAAATCAACCTTCAAGACTTGGACTGCACCTTATCATTGGATATGGACTGA

Coding sequence (CDS)

ATGCTCATCCACCTAAGAAAGATAGCGACACCATGGAAACCCAACCTCTCATCTTCCTTCAACAAACTTTACTCTGATCGCCCATTGAAGCATTCGCCTCTTCTCAAGCTTCCGTTTCAACATTCTGTACAAACCCTCACAAGGCCCCAACTCGAGGCTCTAGTTCTATCCCGATTCTCCCAGGGCAAGTTCTTCGACCTTCTTCGAAATGTTGTTGCCTCTCCCTCTGTTCTTTTCACCGCCTCCCAAAATCTCATCACTCCACTCCCAAGCAATCGCCTCAATGCTCCCGACTCGCTACTCAGTTTCGATATGGTCTCTAGTTGCTTTTCGGTCGAGGACATGGCTCGGGAGCTCTACGAAAATCGTTTCGATGTTGGTGCTTGCTGTGTTCGGTTGGAATCATCGGAAGAGAAAGGTGAGTTTCTGGTCTTACCGAATTTGAAATTGAAGGTCTTACTCGAGGCTATTAGGATTGTGTTGGAAATTGTTTATGACGAACGATTTGTAACGTTCTCTTATGGGGGGCGTGTCGGTATGGGGCGGCACACTGCCATTAGGTACCTGAAGAACTCGGTGCAAAATCCTAGTTGGTGGTTTACCGTTGCATTTCGTCGCAGAAAGTTCGATTCTGTACATGTAAATAAGTTGTGCTTATTGGTGCAAGAGAAAATTAAGGATGATATTTTGATTCTTATGCTAAAGAAACTGTTTGAATTGGAGGCAGTTCAAATTGAATTGGGTGGTTGTTATTTAGGAAGGGGGTTCCCTCAGGAAAGTGGTTTGTGTTCAATCTTGTCTAATATATACTTCAATGGCTTTGATAAAGAACTTCAACAAATACGTCTTGAAAAGAATGAAGAAAATCCCAAGTTCAGTCTGGACGGTACTGTTTCTTTCCATAATCCGGTGAAAATATATGCCGTTAGGTATCTGGATGAGATATTAGTTATAACATCGGGGTCGAAGATGCTAATAATGGAGTTGAAAAGCCAGGTGCTAAGGTATTTAGAAGGGAATTTAGAATTGGAAGTGGATCGAATGAATACTGCAATTCATAGTGCTGTCTCCGAGAAAATTAGTTTCTTAGGAATGGAACTACAGGCAGTACCACCTTCAGTTCTGCATCCACCAATGTCCGAAAAGGCAATCAGGGCTCGGAAGAAGTACCTTAGACAGAAGGAAGTTAGAGCAATAGAATTGAGAAATGCCCGTGAGAGAAACAGGAAAAAATTGGGATTGAAGATATTGGGTCACGTGTTCAAGAAATTGAAGCGAACCGATGGCTTGAAATATGAATTCCAAATCGAGAAGGAAGTCACAGAAATCTTCAGAAATTGGGCCGATGAAGTAGTGCGAGATTTCTTGGAGTCTTTGGAAGATAATACAGAGTGGCACCGTCCGCTGTCAGCAGGTGATTTCCTCTCCTTAAAACACATAAGAAATCAGTTGCCAGTAGATCTTGTGAATGCTTATGACAGGTTTCAAGATCAGGTAAACCAGCACTTGAATCCTCTAAAGGCCAAAAAGGAGAAGGCTAGGGAGGATGAAAAGAAAAGATTGGGTGAAGAAGAACGATATGCTAAAAGAACAGTTGAGGACTTAACAAGGCTATGCATCAAAGTTGAAGCTCCTATAGAGCTTGTTAGGAAGGCAGTGAAGATGATTGGATTTACAAATAAAATGGGCCGTCCTCGGCCTATCAGCTCTCTCATTGCTCTTGAAGATACAGATATTATCAAGTGGTATGCCGGTGTAGGAAGAAGGTGGTTAGACTTCTTTTGCTGTTGTCATAACTATAAGACGGTCAAAACTATTGTAACTTACCATTTGAGGTTTTCTTGTATTTTGACATTGGCGGAAAAGCATGAATCGACCAAACGGGAAGCCATGAAACATTACAGTAAAGATTTGAAAGTCTTTGATTTAAATGGCAAGGAAGAGATGCACTTCCCAACAGAAAGAGAAGTTAAGATGTTGGGAGAAAGAAATCTTGCTGACCCATACCCTGTGGATGGGGCTTTTTCTTTGTTTCTGATTAGATTAGTCACTGATGAAGATTCATATCCTTGTATTGCTCATTTTTGCAATAGAACAGACTCTATTCTATATAGGGTCCGATTACTGCAAAAGACTCTGAATGTCAATCCATCTAATGGAGTGGAATGGGTGAGAGGGATGGGAGTGATTCATGAAAGTTTAAATCAGAGATGCCTCCCTCTATGTGCTGATCACATCAGTGATTTATACATGGGGAAAATCAACCTTCAAGACTTGGACTGCACCTTATCATTGGATATGGACTGA

Protein sequence

MLIHLRKIATPWKPNLSSSFNKLYSDRPLKHSPLLKLPFQHSVQTLTRPQLEALVLSRFSQGKFFDLLRNVVASPSVLFTASQNLITPLPSNRLNAPDSLLSFDMVSSCFSVEDMARELYENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFELEAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFSLDGTVSFHNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILGHVFKKLKRTDGLKYEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDFLSLKHIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEERYAKRTVEDLTRLCIKVEAPIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFFCCCHNYKTVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTEREVKMLGERNLADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLNVNPSNGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD
BLAST of Cp4.1LG01g08150 vs. Swiss-Prot
Match: AI2M_YEAST (Putative COX1/OXI3 intron 2 protein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=AI2 PE=3 SV=2)

HSP 1 Score: 68.2 bits (165), Expect = 4.6e-10
Identity = 60/191 (31.41%), Postives = 96/191 (50.26%), Query Frame = 1

Query: 132 RLESSEEKGEF--LVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGMGRHTAIRYL 191
           R+E  +  G F  L + N + K++ E++R++LEI+Y+  F  +S+G R  +   TAI   
Sbjct: 345 RVEIPKTSGGFRPLSVGNPREKIVQESMRMMLEIIYNNSFSYYSHGFRPNLSCLTAIIQC 404

Query: 192 KNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFELEAVQIELGG 251
           KN +Q  +W+  V    + FD++  N L  ++ E+IKD   + +L KL  L A  ++   
Sbjct: 405 KNYMQYCNWFIKVDL-NKCFDTIPHNMLINVLNERIKDKGFMDLLYKL--LRAGYVDKNN 464

Query: 252 CYLGR--GFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFSLDGTVSFHNPV--K 311
            Y     G PQ S +  IL NI+ +  DK L+  + E        S  G    +N +  K
Sbjct: 465 NYHNTTLGIPQGSVVSPILCNIFLDKLDKYLEN-KFENEFNTGNMSNRGRNPIYNSLSSK 524

Query: 312 IYAVRYLDEIL 317
           IY  + L E L
Sbjct: 525 IYRCKLLSEKL 531

BLAST of Cp4.1LG01g08150 vs. Swiss-Prot
Match: YMC6_SCHPO (Uncharacterized 91 kDa protein in cob intron OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPMIT.06 PE=3 SV=4)

HSP 1 Score: 66.6 bits (161), Expect = 1.3e-09
Identity = 45/140 (32.14%), Postives = 74/140 (52.86%), Query Frame = 1

Query: 151 KVLLEAIRIVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFD 210
           K++ E +RIVLE +Y+  F T S+G R G   H+A+R +  + +  +WW      +  FD
Sbjct: 318 KLVQEILRIVLEAIYEPLFNTASHGFRPGRSCHSALRSIFTNFKGCTWWIEGDI-KACFD 377

Query: 211 SVHVNKLCLLVQEKIKDDILILMLKKLFELEAVQIELGGCYLGRGFPQESGLCSILSNIY 270
           S+  +KL  L+  KIKD   I +++K         E    Y   G PQ S +  IL+NIY
Sbjct: 378 SIPHDKLIALLSSKIKDQRFIQLIRKALN-AGYLTENRYKYDIVGTPQGSIVSPILANIY 437

Query: 271 FNGFDKELQQIRLEKNEENP 291
            +  D+ ++ ++ E + + P
Sbjct: 438 LHQLDEFIENLKSEFDYKGP 455

BLAST of Cp4.1LG01g08150 vs. Swiss-Prot
Match: LTRA_LACLM (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (strain MG1363) GN=ltrA PE=1 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 4.3e-08
Identity = 49/173 (28.32%), Postives = 86/173 (49.71%), Query Frame = 1

Query: 134 ESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSV 193
           + + +K   L +P    K++ EA+RI+LE +Y+  F   S+G R     HTA++ +K   
Sbjct: 91  KKNSKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREF 150

Query: 194 QNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFELEAVQIELGGCY-L 253
              + WF     +  FD++    L  L+  KIKD  +  ++ K   L+A  +E    +  
Sbjct: 151 -GGARWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKF--LKAGYLENWQYHKT 210

Query: 254 GRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFSLDGTVSFHNPVK 306
             G PQ   L  +L+NIY +  DK + Q++++ + E+P+         HN +K
Sbjct: 211 YSGTPQGGILSPLLANIYLHELDKFVLQLKMKFDRESPERITPEYRELHNEIK 260

BLAST of Cp4.1LG01g08150 vs. Swiss-Prot
Match: LTRA_LACLC (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris GN=ltrA PE=1 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 4.3e-08
Identity = 49/173 (28.32%), Postives = 86/173 (49.71%), Query Frame = 1

Query: 134 ESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSV 193
           + + +K   L +P    K++ EA+RI+LE +Y+  F   S+G R     HTA++ +K   
Sbjct: 91  KKNSKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREF 150

Query: 194 QNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFELEAVQIELGGCY-L 253
              + WF     +  FD++    L  L+  KIKD  +  ++ K   L+A  +E    +  
Sbjct: 151 -GGARWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKF--LKAGYLENWQYHKT 210

Query: 254 GRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFSLDGTVSFHNPVK 306
             G PQ   L  +L+NIY +  DK + Q++++ + E+P+         HN +K
Sbjct: 211 YSGTPQGGILSPLLANIYLHELDKFVLQLKMKFDRESPERITPEYRELHNEIK 260

BLAST of Cp4.1LG01g08150 vs. Swiss-Prot
Match: NICA_PSEPU (Putative nicotine oxidoreductase OS=Pseudomonas putida GN=nicA PE=3 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 1.6e-07
Identity = 40/132 (30.30%), Postives = 72/132 (54.55%), Query Frame = 1

Query: 151 KVLLEAIRIVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFD 210
           KV+ E IR +LE +Y+  F   S+G R G   HTA++ ++ S    +W       +  FD
Sbjct: 109 KVVQEVIRSILEAIYEPTFSKNSHGFRAGKSCHTALKQVRESWSGVTWVIEGDI-KGCFD 168

Query: 211 SVHVNKLCLLVQEKIKDDILILMLKKLFELEAVQIELGGCYLGR-GFPQESGLCSILSNI 270
           ++  +KL   ++ +IKD+  I +++K   L A   E G  +    G PQ S +  IL+N+
Sbjct: 169 NISHSKLIDQLRLRIKDERFINLIRK--ALNAGYFENGAFFSATLGTPQGSIISPILANV 228

Query: 271 YFNGFDKELQQI 282
           + +  D++++Q+
Sbjct: 229 FLDQLDRKVEQL 237

BLAST of Cp4.1LG01g08150 vs. TrEMBL
Match: F6HLP1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g05780 PE=4 SV=1)

HSP 1 Score: 1020.4 bits (2637), Expect = 1.2e-294
Identity = 522/774 (67.44%), Postives = 634/774 (81.91%), Query Frame = 1

Query: 1   MLIHLRKIATPWKPNLSSSFNKLYSDRPLKHSPLLKLPFQHSVQTLTRPQLEALVLSRFS 60
           ML++ ++IAT     L S  + L   R  +HS L   P  +    LT+PQL+ALV++ +S
Sbjct: 24  MLLNPKRIAT-----LHSRVSILSLLR--RHSTLPPNP--NPTTPLTKPQLKALVINHYS 83

Query: 61  QGKFFDLLRNVVASPSVLFTASQNLITPLPSNRLNAPDSLLSFDMVSSCFSVEDMARELY 120
           +GKF +L++NVVASP VL  A QNL TP  SN +N+    L+   V+  FSVE++ REL 
Sbjct: 84  RGKFSNLIQNVVASPPVLLLACQNL-TPR-SNDVNS----LASPAVALRFSVEELGRELG 143

Query: 121 ENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGM 180
           ENRFDV +CCVR+  S +KGE LVLPNLKLKV++EAIR+VLEIVYDER VTF+YGGRVGM
Sbjct: 144 ENRFDVESCCVRMVPSRKKGESLVLPNLKLKVVIEAIRMVLEIVYDERLVTFAYGGRVGM 203

Query: 181 GRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFEL 240
           GRHTAIRYLKNSVQNP+WWF V F R KF+  +VNKLCL+++EKIKD +LI +++KLFE 
Sbjct: 204 GRHTAIRYLKNSVQNPNWWFKVTFDREKFEHKNVNKLCLIIEEKIKDTVLIGIVRKLFEC 263

Query: 241 EAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFS----LDG 300
           E +QIELGGCYLGRGFPQE GL SIL N+YFNGFDKE+Q +R+  N+ENP+F     L G
Sbjct: 264 EVLQIELGGCYLGRGFPQECGLSSILINVYFNGFDKEIQDLRIRTNQENPRFDSNEVLSG 323

Query: 301 TVSFHNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHSAV 360
           +  F+ PVKIYAVRYLDEILVITSGSKML M+LK+QV+++LEG LEL+VDR+  AIHSA 
Sbjct: 324 SSVFYKPVKIYAVRYLDEILVITSGSKMLTMDLKNQVMKFLEGKLELKVDRLKMAIHSAT 383

Query: 361 SEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKI 420
            EKI FLGMELQAV PSVL PPMSEKAIRA+KKYLRQKEV+AIELRNARE NRKKLGLKI
Sbjct: 384 MEKIDFLGMELQAVQPSVLRPPMSEKAIRAQKKYLRQKEVKAIELRNARETNRKKLGLKI 443

Query: 421 LGHVFKKLKRTDGLKYEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDFLS 480
           L HVFKKLK++D  K++F IE EV EIFR WADEVV++FL SLE+   W+R LS GDFLS
Sbjct: 444 LAHVFKKLKQSDEFKFDFHIENEVREIFRTWADEVVKEFLGSLEEQANWYRMLSVGDFLS 503

Query: 481 LKHIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEE--RYAKRTVED 540
           L+HIR+QLP +LV+AYD FQ+QV++H+ P+KA+  KA E+ ++R+ EEE  +YA+RTV++
Sbjct: 504 LRHIRHQLPQELVDAYDHFQEQVDKHIKPVKAR--KALEEAERRVVEEEEQKYAERTVQE 563

Query: 541 LTRLCIKVEAPIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFF 600
           LTRLC+KV+APIELVRKAVKM GFTN MGRPRPI  LIALEDTDIIKWYAGVGRRWLDFF
Sbjct: 564 LTRLCMKVDAPIELVRKAVKMAGFTNNMGRPRPIKLLIALEDTDIIKWYAGVGRRWLDFF 623

Query: 601 CCCHNYKTVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTE 660
           CCCHN+K VKT+VTYHLRFSC+LTLAEKHESTK E ++HY+KDLKV D NG EE+HFP E
Sbjct: 624 CCCHNFKMVKTVVTYHLRFSCLLTLAEKHESTKLETIRHYTKDLKVSDFNGIEEVHFPAE 683

Query: 661 REVKMLGERNLADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLN 720
           RE+KM+G++NL+DP PVDGA SL LIRL +DE +Y C+AHFC+R D+I+YRVRLLQ  LN
Sbjct: 684 REIKMMGDKNLSDPKPVDGALSLALIRLASDEPAYSCVAHFCDRKDTIVYRVRLLQNRLN 743

Query: 721 VNPSNGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 769
           VNP +  +WV GMG IHE LN++CLPLC+DHI DLYMG I+LQD+DCT  +D+D
Sbjct: 744 VNPLDEKKWVPGMGAIHEGLNRKCLPLCSDHIHDLYMGTISLQDIDCTSFVDVD 780

BLAST of Cp4.1LG01g08150 vs. TrEMBL
Match: A0A061FIF3_THECC (RNA-directed DNA polymerase OS=Theobroma cacao GN=TCM_033550 PE=4 SV=1)

HSP 1 Score: 1007.7 bits (2604), Expect = 7.7e-291
Identity = 503/735 (68.44%), Postives = 612/735 (83.27%), Query Frame = 1

Query: 42  SVQTLTRPQLEALVLSRFSQGKFFDLLRNVVASPSVLFTASQNLITPLPSNRLNAPDSLL 101
           S Q LT+  L  LVL+ +S G F +LL NV+A PSVL TA QNL    PS       SLL
Sbjct: 23  STQPLTKANLRNLVLNHYSHGTFSNLLHNVIALPSVLLTACQNLSNSPPST---TKTSLL 82

Query: 102 SFDMVSSCFSVEDMARELYENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVL 161
           +   VS+ FS++ M  E+++N+FD+ + CV++      GE L LPNLKLKVL+EAIR+VL
Sbjct: 83  T--SVSNHFSIDQMGHEIFQNKFDISSSCVKVAPPSPSGEPLFLPNLKLKVLIEAIRMVL 142

Query: 162 EIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLV 221
           EIVYDE+FVTFSYGGRVGMGRHTA+RYLKN+V NPSWWF V+F   KFD  +V+KLCL +
Sbjct: 143 EIVYDEKFVTFSYGGRVGMGRHTAVRYLKNNVTNPSWWFNVSFCPNKFDEFNVDKLCLFI 202

Query: 222 QEKIKDDILILMLKKLFELEAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQI 281
            +K+KD +LI ++KKLFE + V+IELGGCYLGRGFPQE GLCSIL N+YF+GFD+E+Q++
Sbjct: 203 GKKVKDAMLINVIKKLFECQVVRIELGGCYLGRGFPQECGLCSILINVYFDGFDREVQEM 262

Query: 282 RLEKNEENPKFSLD-----GTVSFHNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRY 341
           RL+ N +NPKF L+      +  F+ P K+YAVRYLDEILVITSGSKM I ELK +VL +
Sbjct: 263 RLQMNRKNPKFDLNELGFKNSNVFYKPEKMYAVRYLDEILVITSGSKMFIKELKDRVLDF 322

Query: 342 LEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEV 401
           LE NL L+VDR+ TAIHSAVSEKI+FLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEV
Sbjct: 323 LEVNLGLKVDRVKTAIHSAVSEKINFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEV 382

Query: 402 RAIELRNARERNRKKLGLKILGHVFKKLKRTD-GLKYEFQIEKEVTEIFRNWADEVVRDF 461
           RA+ELRNARERNRKKLGLKIL HVFKKLK+++ G  +EF+IE EVTEIFR WADEVV++F
Sbjct: 383 RALELRNARERNRKKLGLKILSHVFKKLKQSNNGFNFEFRIENEVTEIFRTWADEVVQEF 442

Query: 462 LESLEDNTEWHRPLSAGDFLSLKHIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKKEKARE 521
           L+SLE    WHR LS GDFLSL+HIR+QLP DLV+AYD+FQ+QV++HL P+KA+   A E
Sbjct: 443 LQSLEGRWNWHRLLSRGDFLSLRHIRHQLPQDLVDAYDKFQEQVDKHLTPIKAR--NALE 502

Query: 522 DEKKRLGEEE--RYAKRTVEDLTRLCIKVEAPIELVRKAVKMIGFTNKMGRPRPISSLIA 581
           +E++R+ EEE  +YA+ TV+DLT+LC+KV APIELVRKAV+M GFTN MGRPRP+S L A
Sbjct: 503 EEERRVIEEEEQKYAEHTVDDLTKLCMKVSAPIELVRKAVRMAGFTNNMGRPRPVSLLFA 562

Query: 582 LEDTDIIKWYAGVGRRWLDFFCCCHNYKTVKTIVTYHLRFSCILTLAEKHESTKREAMKH 641
           LEDTDIIKWYAGVGRRWLDFFCCCHN+K VKT+V+YHLRFSCILTLA+KHESTK EA+KH
Sbjct: 563 LEDTDIIKWYAGVGRRWLDFFCCCHNFKMVKTVVSYHLRFSCILTLAQKHESTKHEAIKH 622

Query: 642 YSKDLKVFDLNGKEEMHFPTEREVKMLGERNLADPYPVDGAFSLFLIRLVTDEDSYPCIA 701
           YSKDLKV D+NG EE+HFPTER+VKM+G++NL+DP PVDGA SL LIRL ++E S+ C+A
Sbjct: 623 YSKDLKVSDMNGNEEVHFPTERDVKMMGDKNLSDPKPVDGAISLTLIRLASEEPSHSCVA 682

Query: 702 HFCNRTDSILYRVRLLQKTLNVNPSNGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGK 761
           HFC+RTD+I+YRVRLLQ  LN+NPS+  +WV+GMG IHESLN++CL LCADHI+DLYMGK
Sbjct: 683 HFCDRTDTIMYRVRLLQNHLNLNPSDEAQWVKGMGAIHESLNRKCLSLCADHINDLYMGK 742

Query: 762 INLQDLDCTLSLDMD 769
           I LQD+DCT  +++D
Sbjct: 743 ITLQDIDCTSFVEVD 750

BLAST of Cp4.1LG01g08150 vs. TrEMBL
Match: A0A067K380_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22465 PE=4 SV=1)

HSP 1 Score: 989.9 bits (2558), Expect = 1.7e-285
Identity = 496/764 (64.92%), Postives = 615/764 (80.50%), Query Frame = 1

Query: 19  SFNKLYSDRPLKHSPLLKLPFQHSVQT-----LTRPQLEALVLSRFSQGKFFDLLRNVVA 78
           S N ++    L +S L   P   + +T     +TR QL+ LVLS++S GKF +L++NVVA
Sbjct: 15  SSNPVFFSSKLLYSTLSLNPNHQNPKTPTPNPITRSQLKDLVLSQYSHGKFSNLIQNVVA 74

Query: 79  SPSVLFTASQNLITPLPSNRLNAPDSL-----LSFDMVSSCFSVEDMARELYENRFDVGA 138
            PSVL +AS+NL+ P   N   +P+S+       +  VS   S+E+M  +++ NRFD+ +
Sbjct: 75  LPSVLLSASENLV-PGSINAATSPESVGFTTHSLYYSVSKHLSIEEMGHDIFYNRFDIES 134

Query: 139 CCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGMGRHTAIRY 198
            CV++E    KGEFLVLPNLKLKVL+EAIR+VLEI+YD+RF+TFSYGGRV MGRHTAIRY
Sbjct: 135 NCVKMEG---KGEFLVLPNLKLKVLIEAIRVVLEIIYDDRFITFSYGGRVNMGRHTAIRY 194

Query: 199 LKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFELEAVQIELG 258
           LKNSV+NPSWWF V F   KFD  +++KLCL ++EKIKD ILI ++K+LF    ++IE G
Sbjct: 195 LKNSVKNPSWWFNVCFNHFKFDQRNLDKLCLFIEEKIKDRILIDVIKRLFHCGVLRIEFG 254

Query: 259 GCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFS----LDGTVSFHNPV 318
           G YLGRGFPQE GLCSIL NIYFNGFD+E+Q++RL  +E+NPKF      + ++SF+ PV
Sbjct: 255 GFYLGRGFPQECGLCSILINIYFNGFDREIQEMRLRISEQNPKFEPKEVSERSISFYKPV 314

Query: 319 KIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHSAVSEKISFLG 378
            +YAVRYLDEIL+ITSGSKM+ M+LK++VL +LE  LEL VD+ NTAIHSAVSEKI FLG
Sbjct: 315 NVYAVRYLDEILIITSGSKMMTMDLKNKVLSFLEEKLELNVDKTNTAIHSAVSEKIDFLG 374

Query: 379 MELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILGHVFKKL 438
           MELQAVPPSVLHPPMSEKAIRARKKYL+QKEVR++ELRNARERNRKKLGLKIL +VFKKL
Sbjct: 375 MELQAVPPSVLHPPMSEKAIRARKKYLKQKEVRSLELRNARERNRKKLGLKILSNVFKKL 434

Query: 439 KRTDGLKYEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDFLSLKHIRNQL 498
           K+++G K++FQIE EV EIF  WADEVV++FLESLE+   WHR L+AG+FLSL+HIR+QL
Sbjct: 435 KQSNGFKFDFQIENEVREIFATWADEVVQEFLESLEERWNWHRMLTAGEFLSLRHIRDQL 494

Query: 499 PVDLVNAYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEERYAKRTVEDLTRLCIKVEA 558
           P DLVNAYD+FQ+QV++HL+P+K +K    E+ +    EE +YA+RTVEDLT+LC+KV A
Sbjct: 495 PQDLVNAYDKFQEQVDKHLSPVKVRKALEEEERRVEEDEERKYAERTVEDLTKLCMKVSA 554

Query: 559 PIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFFCCCHNYKTVK 618
           PIELVRKAVKM GFTN MGRPRPI  L  LED DIIKWY+GVGRRWLDFFCCCHN+K VK
Sbjct: 555 PIELVRKAVKMNGFTNNMGRPRPIHFLTVLEDADIIKWYSGVGRRWLDFFCCCHNFKMVK 614

Query: 619 TIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTEREVKMLGERN 678
           T+V YHLRFSCILTLAEKHE+TK EA+KHY+K+LKV D++G EE+HFPTE+EVKM+G++N
Sbjct: 615 TVVNYHLRFSCILTLAEKHEATKLEAIKHYTKNLKVTDVDGNEEVHFPTEKEVKMMGDKN 674

Query: 679 LADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLNVNPSNGVEWV 738
           L+DP PVDGA SL LIRL  DE S  CIAHFC+RTD+I+YRVRL+Q  LN++P  G  WV
Sbjct: 675 LSDPKPVDGALSLALIRLAHDEPSGSCIAHFCDRTDTIMYRVRLMQNLLNMSPMKGERWV 734

Query: 739 RGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 769
            GM  IHE +++ CLPLC+DHISDLY GKI LQD+DCT  +D+D
Sbjct: 735 PGMSAIHECIDRVCLPLCSDHISDLYTGKITLQDIDCTSFVDVD 774

BLAST of Cp4.1LG01g08150 vs. TrEMBL
Match: B9T609_RICCO (RNA binding protein, putative OS=Ricinus communis GN=RCOM_0160820 PE=4 SV=1)

HSP 1 Score: 973.4 bits (2515), Expect = 1.6e-280
Identity = 498/774 (64.34%), Postives = 619/774 (79.97%), Query Frame = 1

Query: 1   MLIHLRKIATPWKPNLSSSFNKLYSDRPLKHSPLLKLPFQHSVQTLTRPQLEALVLSRFS 60
           ML  L+ IA P+   + SS    YS  P+      K P    V  LT  QL+ALVLS++S
Sbjct: 1   MLFKLKAIA-PFNLRVLSSCRLFYSTVPINP----KTP----VNPLTGQQLKALVLSQYS 60

Query: 61  QGKFFDLLRNVVASPSVLFTASQNLITPLPSNRLNAPDSLLSFDMVSSCFSVEDMARELY 120
            GKF +L++NVVA PSVL +A++NL+T        +P+  L    VS  FS+E+M RE++
Sbjct: 61  HGKFVNLIQNVVALPSVLISAAENLVT--------SPNESLYLS-VSKHFSIEEMGREVF 120

Query: 121 ENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGM 180
           + RFD+ + C R  S   KGE LVLPNLKLKV +EAIR+VLEIVYD+RFVTF YGGRV M
Sbjct: 121 DKRFDLESHCARFAS---KGESLVLPNLKLKVFIEAIRVVLEIVYDDRFVTFCYGGRVNM 180

Query: 181 GRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFEL 240
           GRHTAIRYLKN+V++PSWWF+V F R KFDS +++KLCL ++EKI D +LI ++K+LFE 
Sbjct: 181 GRHTAIRYLKNTVKDPSWWFSVCFSRLKFDSRNLDKLCLFIEEKINDGVLIDVIKRLFEC 240

Query: 241 EAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFS----LDG 300
             + IELGG +LG+G PQE GLC IL NIYFNGFDKE+QQIRL  +E+NPKF      + 
Sbjct: 241 GVLNIELGGFHLGKGLPQECGLCPILINIYFNGFDKEIQQIRLRISEQNPKFEPNEVSER 300

Query: 301 TVSFHNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHSAV 360
           + S   P+K+YA+RYLDEILVITSGSKML M+LKS+VL++LE  LEL+VDR+ TAIHSAV
Sbjct: 301 SNSSFKPLKVYAIRYLDEILVITSGSKMLTMDLKSKVLKFLEEKLELKVDRIETAIHSAV 360

Query: 361 SEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKI 420
           SEKI FLGME QAVPPSVLHPPMSEKAIRARKK+LRQK+V+A+ELRNARE NRKKLGLKI
Sbjct: 361 SEKIDFLGMEFQAVPPSVLHPPMSEKAIRARKKFLRQKKVKALELRNARESNRKKLGLKI 420

Query: 421 LGHVFKKLKRTDGLKYEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDFLS 480
           L HVFKKLK+++G K+E QIE EV +IF  WADEVV++FL SL++   WHR L+AGDFLS
Sbjct: 421 LSHVFKKLKQSNGFKFEVQIENEVRKIFATWADEVVQEFLGSLDERWNWHRMLTAGDFLS 480

Query: 481 LKHIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEE--RYAKRTVED 540
           L+HIR+QLP DL++AYD+FQ QV+++L+P+KA+  K  E+E++R+ EEE  +YA+RT+ED
Sbjct: 481 LRHIRDQLPEDLIDAYDKFQGQVDKYLSPVKAR--KVLEEEQRRVEEEEQRKYAERTMED 540

Query: 541 LTRLCIKVEAPIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFF 600
           LTRLC+KV APIELVRKAVKM GFTN MGRPRPI+ L  LED DIIKWYAGVGRRWLDFF
Sbjct: 541 LTRLCMKVSAPIELVRKAVKMAGFTNNMGRPRPINFLTVLEDADIIKWYAGVGRRWLDFF 600

Query: 601 CCCHNYKTVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTE 660
           CCCHN+K VKT+V+YHLRFSCILTLAEKHE+TK EA++HY+KDLK+ D++G EE++FPTE
Sbjct: 601 CCCHNFKMVKTVVSYHLRFSCILTLAEKHEATKCEAIRHYTKDLKIHDIDGNEEVYFPTE 660

Query: 661 REVKMLGERNLADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLN 720
           REVKM+G++NL+DP PVDGA SL  IRL  +E S+ C+AHFCNRTD+I+YRVRLLQ  LN
Sbjct: 661 REVKMMGDKNLSDPKPVDGALSLVFIRLAVNEPSHSCVAHFCNRTDTIMYRVRLLQNLLN 720

Query: 721 VNPSNGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 769
           ++P+ GV W+ GM  IHE +++ CLPLC+ HISDLY GKI LQD+DCT  L +D
Sbjct: 721 LSPTRGVNWIPGMITIHECIDRTCLPLCSVHISDLYTGKITLQDIDCTSFLHVD 751

BLAST of Cp4.1LG01g08150 vs. TrEMBL
Match: V4LD34_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10012522mg PE=4 SV=1)

HSP 1 Score: 922.5 bits (2383), Expect = 3.3e-265
Identity = 473/758 (62.40%), Postives = 588/758 (77.57%), Query Frame = 1

Query: 19  SFNKLYSDRPLKHSPLLKLPFQHSVQTLTRPQLEALVLSRFSQGKFFDLLRNVVASPSVL 78
           SF    S R L  + LL    Q + + L + +LEALVL ++S GKF+ L++N VA PSVL
Sbjct: 16  SFFVSSSLRNLSTASLLLNSDQTTGEPLVKSELEALVLKQYSHGKFYSLVKNAVALPSVL 75

Query: 79  FTASQNLITPLPSNRLNAPDSLLSFDMVSSCFSVEDMARELYENRFDVGACCVRLESSEE 138
             A QNL   L +N  +   S    D VS  FS+E+M RE+ E +FD+ +CCV   SS E
Sbjct: 76  LAACQNL--SLAAN--SGVSSTELADCVSRRFSIEEMGREIREGKFDIRSCCVEFVSSRE 135

Query: 139 KG--EFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNP 198
            G  E LVLPNLKLKVL+EAIR+VLEIVYD+RF TFSYGGRVGMGRHTAIRYLKNSV+NP
Sbjct: 136 NGRCESLVLPNLKLKVLIEAIRMVLEIVYDDRFATFSYGGRVGMGRHTAIRYLKNSVENP 195

Query: 199 SWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFELEAVQIELGGCYLGRGF 258
            WWF V+F R  FD  +V+KLC  V EKI D +LI M+KKLFE   ++IELGGC  GRGF
Sbjct: 196 RWWFRVSFAREMFDDRNVDKLCGFVGEKINDGLLIEMIKKLFEFGILRIELGGCNSGRGF 255

Query: 259 PQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFSLDGTVS-----FHNPVKIYAVRY 318
           PQE GL SIL N+YF+G DKE+Q +RL+   +NP+ S  G        F  PV +YAVRY
Sbjct: 256 PQECGLSSILINVYFDGLDKEIQDMRLKTKLKNPRVSDTGEEESTCNVFFKPVNLYAVRY 315

Query: 319 LDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVP 378
           LDEIL+ITSGSKML M+LK +++  LE  L+L+VDR+NTAIHSAVSEKISFLGM LQAVP
Sbjct: 316 LDEILLITSGSKMLTMDLKKRIVDILEQRLDLKVDRVNTAIHSAVSEKISFLGMYLQAVP 375

Query: 379 PSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILGHVFKKLKRTDGLK 438
           PSVL PPMSEKA+RA KKY RQKEVR +ELRNARERNRKKLGLKI  HV KKLK+++G +
Sbjct: 376 PSVLRPPMSEKAVRAMKKYQRQKEVRRLELRNARERNRKKLGLKIFRHVLKKLKQSNGFR 435

Query: 439 YEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDFLSLKHIRNQLPVDLVNA 498
            E++IE EV ++F+ W +EV+++FL SLE+  +WH  L+ GDFLSL+HIR +LP DL++A
Sbjct: 436 CEYEIENEVRDVFQRWEEEVMQEFLGSLEERWKWHWLLTRGDFLSLRHIREKLPQDLIDA 495

Query: 499 YDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEE--RYAKRTVEDLTRLCIKVEAPIELV 558
           YD FQ+QV++HL P +AK  +  EDE++R+ EEE  RYA+RTVEDLT+LC+KV AP EL+
Sbjct: 496 YDEFQEQVDKHLAPTQAK--RVLEDEERRVEEEEEQRYAERTVEDLTKLCMKVSAPEELI 555

Query: 559 RKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFFCCCHNYKTVKTIVTY 618
           RKAVK++GFTN +GRPRPIS L+ALED+DIIKWYAGVGR+WLDFFCCCHNY+ VK IV+Y
Sbjct: 556 RKAVKLVGFTNSLGRPRPISHLLALEDSDIIKWYAGVGRKWLDFFCCCHNYRMVKIIVSY 615

Query: 619 HLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTEREVKMLGERNLADPY 678
           H+RFSCILTLAEKH STKREA++HY+KDLKV DLNG EE HFP EREVKM+G++NL+DP 
Sbjct: 616 HMRFSCILTLAEKHRSTKREAIRHYTKDLKVCDLNGSEEAHFPLEREVKMMGDKNLSDPR 675

Query: 679 PVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLNVNPSNGVEWVRGMGV 738
           PVDG  SL LIRL +DE  + C A FC R+D+I+YRV LLQ  L++NP +  +WV GMG 
Sbjct: 676 PVDGTLSLLLIRLASDEPLHSCAASFCERSDTIMYRVHLLQNRLHINPLDEEKWVHGMGT 735

Query: 739 IHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDM 768
           IH +LN++CLPLC+ HIS +Y+GK+ LQD+D +  +D+
Sbjct: 736 IHSALNRKCLPLCSTHISHVYLGKMTLQDVDGSSFVDL 767

BLAST of Cp4.1LG01g08150 vs. TAIR10
Match: AT5G04050.2 (AT5G04050.2 RNA-directed DNA polymerase (reverse transcriptase))

HSP 1 Score: 592.4 bits (1526), Expect = 3.9e-169
Identity = 322/527 (61.10%), Postives = 398/527 (75.52%), Query Frame = 1

Query: 39  FQHSVQTLTRP----QLEALVLSRFSQGKFFDLLRNVVASPSVLFTASQNLITPLPSNRL 98
           F +S QT+T P    +LEALVL ++S GKF+ L++N V+ P VL  A QNL        L
Sbjct: 33  FLNSDQTITEPLVKSELEALVLKQYSHGKFYSLVKNAVSLPCVLLAACQNL-------SL 92

Query: 99  NAPDSLLSFDMVSSCFSVEDMARELYENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLL 158
           +A  S    D VS  FS+E+M RE+ E RFD+ +CCV   SS      LVLPNLKLKVL+
Sbjct: 93  SANSSGDLADRVSRRFSIEEMGREIREGRFDIRSCCVEFISSS-----LVLPNLKLKVLI 152

Query: 159 EAIRIVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHV 218
           EAIR+VLEIVYD+RF TFSYGGRVGMGRHTAIRYLKNSV+NP WWF V+F R  F+  +V
Sbjct: 153 EAIRMVLEIVYDDRFATFSYGGRVGMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNV 212

Query: 219 NKLCLLVQEKIKDDILILMLKKLFELEAVQIELGGCYLGRGFPQESGLCSILSNIYFNGF 278
           + LC  V EKI D +LI M+KKLFE   ++IELGGC  GRGFPQE GLCSIL N+YF+G 
Sbjct: 213 DILCGFVGEKINDVMLIEMIKKLFEFGILKIELGGCNSGRGFPQECGLCSILINVYFDGL 272

Query: 279 DKELQQIRLEKNEENPKFSLDGTVS----FHNPVKIYAVRYLDEILVITSGSKMLIMELK 338
           DKE+Q +RL+   +NP+       S    F  PV IYAVRYLDEILVITSGSKML M+LK
Sbjct: 273 DKEIQDLRLKMKVKNPRVGTGDEESTGNVFFKPVNIYAVRYLDEILVITSGSKMLTMDLK 332

Query: 339 SQVLRYLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKY 398
            +++  LE  LEL VDR+NT+IHSAVSEKI+FLGM LQAVPPSVL PP SEKA+RA KKY
Sbjct: 333 KRIVDILEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPSVLRPPKSEKAVRAMKKY 392

Query: 399 LRQKEVRAIELRNARERNRKKLGLKILGHVFKKLKRTDGLKYEFQIEKEVTEIFRNWADE 458
            RQK+VR +ELRNARERNRK LGLKI  HV KK+K+++G K+E +IE EV +IF++W +E
Sbjct: 393 QRQKDVRKLELRNARERNRKTLGLKIFRHVLKKIKQSNGFKFEGEIENEVRDIFQSWGEE 452

Query: 459 VVRDFLESLEDNTEWHRPLSAGDFLSLKHIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKK 518
           V++DF+ SLE+  +WH  L+ GDFLSL+HIR +LP DL++AYD FQ+QV++HL P +AK 
Sbjct: 453 VMQDFMGSLEERWKWHWLLTRGDFLSLRHIREKLPQDLIDAYDEFQEQVDKHLAPTQAK- 512

Query: 519 EKAREDEKKRLGEEE--RYAKRTVEDLTRLCIKVEAPIELVRKAVKM 556
            K  EDE++R+ EEE  RYA+RTVEDLT+LC+KV AP ELVRKA+K+
Sbjct: 513 -KVLEDEERRVEEEEEQRYAERTVEDLTKLCMKVSAPEELVRKAIKV 545

BLAST of Cp4.1LG01g08150 vs. TAIR10
Match: AT1G74350.1 (AT1G74350.1 Intron maturase, type II family protein)

HSP 1 Score: 197.2 bits (500), Expect = 3.7e-50
Identity = 192/753 (25.50%), Postives = 320/753 (42.50%), Query Frame = 1

Query: 50  QLEALVLSRFSQGKFFDLLRNVVASPSVLFTASQNLITPLPSNRLNAPDSLLSFDMVSSC 109
           +L+  V  +   GKF DLL+ V+A P  L  A   +       RLN   S +S    +  
Sbjct: 45  RLKKRVKEQCINGKFSDLLKKVIARPETLRDAYDCI-------RLN---SNVSITERNGS 104

Query: 110 FSVEDMARELYENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERF 169
            + + +A EL    FDV +    + + ++  E LVLP++ LKV+ EAIRIVLE+V+   F
Sbjct: 105 VAFDSIAEELSSGVFDVASNTFSIVARDKTKEVLVLPSVALKVVQEAIRIVLEVVFSPHF 164

Query: 170 VTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDI 229
              S+  R G GR +A++Y+ N++    W FT++  ++   SV  N L ++ +EK++D  
Sbjct: 165 SKISHSCRSGRGRASALKYINNNISRSDWCFTLSLNKKLDVSVFENLLSVM-EEKVEDSS 224

Query: 230 LILMLKKLFELEAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEEN 289
           L ++L+ +FE   + +E GG   G G PQE  L  +L NIY + FD E  +I +     +
Sbjct: 225 LSILLRSMFEARVLNLEFGGFPKGHGLPQEGVLSRVLMNIYLDRFDHEFYRISM----RH 284

Query: 290 PKFSLDGTVSFHNP---VKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVD 349
               LD      +P   ++ +  R   E  + ++  +   + L+    R+++  +   V 
Sbjct: 285 EALGLDSKTDEDSPGSKLRSWFRRQAGEQGLKSTTEQ--DVALRVYCCRFMD-EIYFSVS 344

Query: 350 RMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSE-------------------KAIRAR 409
                     SE I FL   L         P   E                     ++A 
Sbjct: 345 GPKKVASDIRSEAIGFLRNSLHLDITDETDPSPCEATSGLRVLGTLVRKNVRESPTVKAV 404

Query: 410 KKYLRQKEVRAIELRNARERNRKKLGLKILGHVFKKLKRTD--GL--------------K 469
            K   +  + A++   A      ++G K LGH  KK+K ++  GL              K
Sbjct: 405 HKLKEKVRLFALQKEEAWTLGTVRIGKKWLGHGLKKVKESEIKGLADSNSTLSQISCHRK 464

Query: 470 YEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDFLSLKH-IRNQLPVDLVN 529
              + +     + R W ++V+R   +  E            +F+  KH +   +P +L +
Sbjct: 465 AGMETDHWYKILLRIWMEDVLRTSADRSE------------EFVLSKHVVEPTVPQELRD 524

Query: 530 AYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEERYAKRTVEDLTRLCIKVEAPIELVR 589
           A+ +FQ+    +++   A  E      +               D       V AP   + 
Sbjct: 525 AFYKFQNAAAAYVSSETANLEALLPCPQS-------------HDRPVFFGDVVAPTNAIG 584

Query: 590 KAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFFCCCHNYKTVKTIVTYH 649
           + +   G     G  R  S LI L+   II WY+G+ RRW+ ++  C N+  +K ++   
Sbjct: 585 RRLYRYGLITAKGYARSNSMLILLDTAQIIDWYSGLVRRWVIWYEGCSNFDEIKALIDNQ 644

Query: 650 LRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTEREVKMLGERNLADPYP 709
           +R SCI TLA K+   + E  K    +L         E     E+      +R+    Y 
Sbjct: 645 IRMSCIRTLAAKYRIHENEIEKRLDLELSTIPSAEDIEQEIQHEKLDSPAFDRDEHLTYG 704

Query: 710 V--DGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLNVNPSNGVEWVRGMG 761
           +   G   L L RLV++     C    C+     +Y +  +++           W  G  
Sbjct: 705 LSNSGLCLLSLARLVSESRPCNCFVIGCSMAAPAVYTLHAMER------QKFPGWKTGFS 748

BLAST of Cp4.1LG01g08150 vs. TAIR10
Match: ATMG00520.1 (ATMG00520.1 Intron maturase, type II family protein)

HSP 1 Score: 73.2 bits (178), Expect = 8.0e-13
Identity = 45/125 (36.00%), Postives = 69/125 (55.20%), Query Frame = 1

Query: 540 IKVEAPIELVRKAVKMIGFTNKMGRPRPI--SSLIALEDTDIIKWYAGVGRRWLDFFCCC 599
           IK+EAPI+ + + ++  G  ++  RP PI  + L  + D DI+ W AG+    L ++ C 
Sbjct: 533 IKIEAPIKKILRRLRDRGIISRR-RPWPIHVACLTNVSDEDIVNWSAGIAISPLSYYRCR 592

Query: 600 HNYKTVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDL-NGKEEMHFPTERE 659
            N   V+TIV + +R+S I TLA KH+S+    +  YSKD  + +   GK    FP   E
Sbjct: 593 DNLYQVRTIVDHQIRWSAIFTLAHKHKSSAPNIILKYSKDSNIVNQEGGKILAEFPNSIE 652

Query: 660 VKMLG 662
           +  LG
Sbjct: 653 LGKLG 656

BLAST of Cp4.1LG01g08150 vs. NCBI nr
Match: gi|778676093|ref|XP_011650528.1| (PREDICTED: uncharacterized protein LOC101217546 [Cucumis sativus])

HSP 1 Score: 1310.0 bits (3389), Expect = 0.0e+00
Identity = 664/770 (86.23%), Postives = 702/770 (91.17%), Query Frame = 1

Query: 1   MLIHLRKIATPWKPNLSSSFNKLYSDRPLK--HSPLLKLPFQHSVQTLTRPQLEALVLSR 60
           M IH RKIAT  KP  S+S NKLYS  PLK  HSP L LP QHS +TLT  QL+ALVLSR
Sbjct: 1   MFIHFRKIATSLKPKFSNSLNKLYSHLPLKLKHSPELNLPSQHSPETLTSSQLKALVLSR 60

Query: 61  FSQGKFFDLLRNVVASPSVLFTASQNLITPLPSNRLNAPDSLLSFDMVSSCFSVEDMARE 120
           FS GKF DL +NVVASPSVL TASQNLITP  SN   APDSL  FD+VS CFSVE MARE
Sbjct: 61  FSHGKFVDLFQNVVASPSVLLTASQNLITPPFSN---APDSLPLFDLVSKCFSVEVMARE 120

Query: 121 LYENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRV 180
           L ENRFDVGACCV +   EEKGE L+LPNLKLKVL+EAIR+V+EIVYDERFVTFSYGGRV
Sbjct: 121 LSENRFDVGACCVPMAPLEEKGESLLLPNLKLKVLIEAIRMVMEIVYDERFVTFSYGGRV 180

Query: 181 GMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLF 240
           GMGRHTAIRYLKNSVQNPSWWFTVAFRR+KF+SVHVN LCLL+QEKIKDDILI ML+KLF
Sbjct: 181 GMGRHTAIRYLKNSVQNPSWWFTVAFRRKKFESVHVNTLCLLMQEKIKDDILIYMLRKLF 240

Query: 241 ELEAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFSLDGTV 300
           E+EA+QIELGGCYLGRGFPQESGLCSIL NIYFNGFDKE+Q+IRL+K EENPKF+LD  V
Sbjct: 241 EVEAIQIELGGCYLGRGFPQESGLCSILLNIYFNGFDKEIQRIRLQKIEENPKFNLDEIV 300

Query: 301 SFHNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHSAVSE 360
           SFHNPVKIYAVRYLDEILVITSGSKML MELKSQVL+YLEGNLELEVDRMNTAIHSAVSE
Sbjct: 301 SFHNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSE 360

Query: 361 KISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILG 420
           KI FLGMEL+AVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKIL 
Sbjct: 361 KIGFLGMELRAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILS 420

Query: 421 HVFKKLKRTDGLKYEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDFLSLK 480
           HVFKK KRT G K EFQIEKEV  IFRNWADEVV+DF ES ED+ EWHR LSAGDFLSLK
Sbjct: 421 HVFKKFKRTGGFKSEFQIEKEVRSIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLK 480

Query: 481 HIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEERYAKRTVEDLTRL 540
           HIRNQLP DLVNAYDRFQDQVN+HLNP+K KKEKAREDE+KRL EEE YAKRTV+DLTRL
Sbjct: 481 HIRNQLPEDLVNAYDRFQDQVNKHLNPVKFKKEKAREDEEKRLEEEELYAKRTVDDLTRL 540

Query: 541 CIKVEAPIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFFCCCH 600
           CIKV+APIELVRKAV+M+GFTN MGRPRPISSLI LED DIIKWY+GVGRRWLDFFCCCH
Sbjct: 541 CIKVDAPIELVRKAVRMVGFTNNMGRPRPISSLIVLEDADIIKWYSGVGRRWLDFFCCCH 600

Query: 601 NYKTVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTEREVK 660
           NYK VKT+VTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNG EEMHFPTE+EVK
Sbjct: 601 NYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGNEEMHFPTEKEVK 660

Query: 661 MLGERNLADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLNVNPS 720
           MLGERNLADPYPVDGA SL LIRL TDE SYPCIA+FCNRT+SILYRVRLLQ+TLNVNPS
Sbjct: 661 MLGERNLADPYPVDGALSLLLIRLATDEASYPCIANFCNRTNSILYRVRLLQRTLNVNPS 720

Query: 721 NGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 769
           +GVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD
Sbjct: 721 DGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 767

BLAST of Cp4.1LG01g08150 vs. NCBI nr
Match: gi|659126896|ref|XP_008463418.1| (PREDICTED: putative COX1/OXI3 intron 1 protein [Cucumis melo])

HSP 1 Score: 1294.3 bits (3348), Expect = 0.0e+00
Identity = 657/768 (85.55%), Postives = 697/768 (90.76%), Query Frame = 1

Query: 1   MLIHLRKIATPWKPNLSSSFNKLYSDRPLKHSPLLKLPFQHSVQTLTRPQLEALVLSRFS 60
           M IH RKIAT  KP  S+S NKLYS  PL+   +L LP QHS ++LT  QL+ALVLSRFS
Sbjct: 1   MFIHFRKIATSLKPKFSNSLNKLYSHLPLRKK-VLNLPSQHSPESLTSSQLKALVLSRFS 60

Query: 61  QGKFFDLLRNVVASPSVLFTASQNLITPLPSNRLNAPDSLLSFDMVSSCFSVEDMARELY 120
            GKF DL +NVVASPSVL TAS+NLITP  SN   APDSL   D+VS CFSVE MAREL 
Sbjct: 61  HGKFVDLFQNVVASPSVLLTASRNLITPPFSN---APDSLPLSDLVSKCFSVEVMARELS 120

Query: 121 ENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGM 180
           ENRFDVGACCV +   EEKGE LVLPNLKLKVL+EAIR+VLEIVYDERFVTFSYGGRVGM
Sbjct: 121 ENRFDVGACCVPMAPLEEKGESLVLPNLKLKVLIEAIRMVLEIVYDERFVTFSYGGRVGM 180

Query: 181 GRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFEL 240
           GRHTAIRYLKNSVQNPSWWFTVAFRR+KF+SVHVN LCLL+QEKIKDDILI ML+KLFE+
Sbjct: 181 GRHTAIRYLKNSVQNPSWWFTVAFRRKKFESVHVNTLCLLMQEKIKDDILIYMLRKLFEV 240

Query: 241 EAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFSLDGTVSF 300
           EA+QIELGGCYLGRGFPQESGLCSIL NIYFNGFDKE+Q+IRL+KNEENPKF+LD  VSF
Sbjct: 241 EAIQIELGGCYLGRGFPQESGLCSILLNIYFNGFDKEIQRIRLQKNEENPKFNLDEIVSF 300

Query: 301 HNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHSAVSEKI 360
           HNPVKIYAVRYLDEILVITSGSKML MELKSQVL+YLEGNLELEVDRMNTAIHSAVSEKI
Sbjct: 301 HNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKI 360

Query: 361 SFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILGHV 420
            FLGMELQAV PSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKIL H+
Sbjct: 361 GFLGMELQAVTPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHM 420

Query: 421 FKKLKRTDGLKYEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDFLSLKHI 480
           FKK KRT G K EFQIE EV  IFRNWADEVV+DF ES ED+ EWHR LSAGDFLSLKHI
Sbjct: 421 FKKFKRTGGFKSEFQIETEVRSIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHI 480

Query: 481 RNQLPVDLVNAYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEERYAKRTVEDLTRLCI 540
           RNQLP DLVNAYDRFQ QVN+HLNP+K KKEKAREDE+KRL EEE YAKRTVEDLTRLCI
Sbjct: 481 RNQLPEDLVNAYDRFQYQVNKHLNPVKVKKEKAREDEEKRLEEEELYAKRTVEDLTRLCI 540

Query: 541 KVEAPIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFFCCCHNY 600
           KV+APIELVRKAV+M+GFTNKMGRPRPISSLIALED DIIKWY+GVGRRWLDFFCCCHNY
Sbjct: 541 KVDAPIELVRKAVRMVGFTNKMGRPRPISSLIALEDADIIKWYSGVGRRWLDFFCCCHNY 600

Query: 601 KTVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTEREVKML 660
           K VKT+VTYHLRFSCILTLAEKHESTKREAMKHY KDLKVFDLNG EEMHFPTE+ VKML
Sbjct: 601 KMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYGKDLKVFDLNGNEEMHFPTEKAVKML 660

Query: 661 GERNLADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLNVNPSNG 720
           GERNLADPYPVDGA SL LIRL TDE SYPCIA+FCNRT+SILYRVRLLQ+TLNVNPS+G
Sbjct: 661 GERNLADPYPVDGALSLLLIRLATDEASYPCIANFCNRTNSILYRVRLLQRTLNVNPSDG 720

Query: 721 VEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 769
           VEWV+GMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD
Sbjct: 721 VEWVKGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 764

BLAST of Cp4.1LG01g08150 vs. NCBI nr
Match: gi|359481896|ref|XP_002274379.2| (PREDICTED: uncharacterized protein LOC100264128 [Vitis vinifera])

HSP 1 Score: 1020.4 bits (2637), Expect = 1.7e-294
Identity = 522/774 (67.44%), Postives = 634/774 (81.91%), Query Frame = 1

Query: 1   MLIHLRKIATPWKPNLSSSFNKLYSDRPLKHSPLLKLPFQHSVQTLTRPQLEALVLSRFS 60
           ML++ ++IAT     L S  + L   R  +HS L   P  +    LT+PQL+ALV++ +S
Sbjct: 1   MLLNPKRIAT-----LHSRVSILSLLR--RHSTLPPNP--NPTTPLTKPQLKALVINHYS 60

Query: 61  QGKFFDLLRNVVASPSVLFTASQNLITPLPSNRLNAPDSLLSFDMVSSCFSVEDMARELY 120
           +GKF +L++NVVASP VL  A QNL TP  SN +N+    L+   V+  FSVE++ REL 
Sbjct: 61  RGKFSNLIQNVVASPPVLLLACQNL-TPR-SNDVNS----LASPAVALRFSVEELGRELG 120

Query: 121 ENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRVGM 180
           ENRFDV +CCVR+  S +KGE LVLPNLKLKV++EAIR+VLEIVYDER VTF+YGGRVGM
Sbjct: 121 ENRFDVESCCVRMVPSRKKGESLVLPNLKLKVVIEAIRMVLEIVYDERLVTFAYGGRVGM 180

Query: 181 GRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLFEL 240
           GRHTAIRYLKNSVQNP+WWF V F R KF+  +VNKLCL+++EKIKD +LI +++KLFE 
Sbjct: 181 GRHTAIRYLKNSVQNPNWWFKVTFDREKFEHKNVNKLCLIIEEKIKDTVLIGIVRKLFEC 240

Query: 241 EAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFS----LDG 300
           E +QIELGGCYLGRGFPQE GL SIL N+YFNGFDKE+Q +R+  N+ENP+F     L G
Sbjct: 241 EVLQIELGGCYLGRGFPQECGLSSILINVYFNGFDKEIQDLRIRTNQENPRFDSNEVLSG 300

Query: 301 TVSFHNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHSAV 360
           +  F+ PVKIYAVRYLDEILVITSGSKML M+LK+QV+++LEG LEL+VDR+  AIHSA 
Sbjct: 301 SSVFYKPVKIYAVRYLDEILVITSGSKMLTMDLKNQVMKFLEGKLELKVDRLKMAIHSAT 360

Query: 361 SEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKI 420
            EKI FLGMELQAV PSVL PPMSEKAIRA+KKYLRQKEV+AIELRNARE NRKKLGLKI
Sbjct: 361 MEKIDFLGMELQAVQPSVLRPPMSEKAIRAQKKYLRQKEVKAIELRNARETNRKKLGLKI 420

Query: 421 LGHVFKKLKRTDGLKYEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDFLS 480
           L HVFKKLK++D  K++F IE EV EIFR WADEVV++FL SLE+   W+R LS GDFLS
Sbjct: 421 LAHVFKKLKQSDEFKFDFHIENEVREIFRTWADEVVKEFLGSLEEQANWYRMLSVGDFLS 480

Query: 481 LKHIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEE--RYAKRTVED 540
           L+HIR+QLP +LV+AYD FQ+QV++H+ P+KA+  KA E+ ++R+ EEE  +YA+RTV++
Sbjct: 481 LRHIRHQLPQELVDAYDHFQEQVDKHIKPVKAR--KALEEAERRVVEEEEQKYAERTVQE 540

Query: 541 LTRLCIKVEAPIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFF 600
           LTRLC+KV+APIELVRKAVKM GFTN MGRPRPI  LIALEDTDIIKWYAGVGRRWLDFF
Sbjct: 541 LTRLCMKVDAPIELVRKAVKMAGFTNNMGRPRPIKLLIALEDTDIIKWYAGVGRRWLDFF 600

Query: 601 CCCHNYKTVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTE 660
           CCCHN+K VKT+VTYHLRFSC+LTLAEKHESTK E ++HY+KDLKV D NG EE+HFP E
Sbjct: 601 CCCHNFKMVKTVVTYHLRFSCLLTLAEKHESTKLETIRHYTKDLKVSDFNGIEEVHFPAE 660

Query: 661 REVKMLGERNLADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLN 720
           RE+KM+G++NL+DP PVDGA SL LIRL +DE +Y C+AHFC+R D+I+YRVRLLQ  LN
Sbjct: 661 REIKMMGDKNLSDPKPVDGALSLALIRLASDEPAYSCVAHFCDRKDTIVYRVRLLQNRLN 720

Query: 721 VNPSNGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 769
           VNP +  +WV GMG IHE LN++CLPLC+DHI DLYMG I+LQD+DCT  +D+D
Sbjct: 721 VNPLDEKKWVPGMGAIHEGLNRKCLPLCSDHIHDLYMGTISLQDIDCTSFVDVD 757

BLAST of Cp4.1LG01g08150 vs. NCBI nr
Match: gi|1009159848|ref|XP_015898038.1| (PREDICTED: uncharacterized protein LOC107431593 [Ziziphus jujuba])

HSP 1 Score: 1015.8 bits (2625), Expect = 4.1e-293
Identity = 506/775 (65.29%), Postives = 626/775 (80.77%), Query Frame = 1

Query: 1   MLIHLRKIATPWKPNLSSSFNKLYSDRPLKHSPLLKLPFQHSVQTLTRPQLEALVLSRFS 60
           ML +LR++A  +    +  ++ L  + P            +S + L+  QL+ALVL+++S
Sbjct: 1   MLTNLRRLAVLFCVPRTKPYSVLLPNNP------------NSTEPLSAHQLKALVLAQYS 60

Query: 61  QGKFFDLLRNVVASPSVLFTASQNLIT-PLPSNRLNAPDSLLSFDMVSSCFSVEDMAREL 120
            G F +L++NVVA P+VL TA QN+ T P   +     DS     +VS  FS+ +M R+L
Sbjct: 61  HGNFSNLVQNVVALPAVLLTACQNITTSPTRDDADYQADSPSILHLVSKRFSIHEMGRQL 120

Query: 121 YENRFDVGACCVRLESSEEKG-EFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRV 180
           Y+N+FD+ ACCV +E S ++G E LVLP+LKLKVL+EA+R+VLE+VYDERFVTFSYGGRV
Sbjct: 121 YQNQFDIEACCVTIEPSTKRGGESLVLPSLKLKVLIEAVRMVLEVVYDERFVTFSYGGRV 180

Query: 181 GMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLF 240
           GMGRHTAIRYLKNSVQNPSWWF V+F R KFDS HV KLC+ + EKIKD IL+ ++++LF
Sbjct: 181 GMGRHTAIRYLKNSVQNPSWWFNVSFGREKFDSTHVEKLCMFMGEKIKDRILVDIIRRLF 240

Query: 241 ELEAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFSLDGTV 300
           E  AVQIELGGCY GRGFPQESGL SIL NIYF+GFDKE+Q +RL+KN+ENPKF  +  V
Sbjct: 241 ECNAVQIELGGCYFGRGFPQESGLSSILLNIYFDGFDKEIQDMRLQKNQENPKFDPNEVV 300

Query: 301 S----FHNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHS 360
           S    FH PVK+YAVRYLD+ILVITSGSKML M+LKS VL+YLEG LEL+V+++ TA+HS
Sbjct: 301 SKDHVFHKPVKMYAVRYLDDILVITSGSKMLTMDLKSWVLKYLEGRLELKVNKVETALHS 360

Query: 361 AVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGL 420
           AVSEKI F+GMEL+A  PSVLHPPMSEKAIRARKKYLRQKEVR++EL+NARERNRKKLG+
Sbjct: 361 AVSEKIDFVGMELRAAEPSVLHPPMSEKAIRARKKYLRQKEVRSLELKNARERNRKKLGM 420

Query: 421 KILGHVFKKLKRTDGLKYEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAGDF 480
           KI  HVFKKLKR+DG K+++QIE EV EIF  WA+EV ++F  SLE+   WHR LSAGDF
Sbjct: 421 KIFSHVFKKLKRSDGFKFDYQIENEVREIFNTWANEVAQEFFGSLEERWNWHRMLSAGDF 480

Query: 481 LSLKHIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEERYAKRTVED 540
           LSL+HIR+QLP +LV+AYD FQ+QV++HLNP KA+K    E+ ++   E ++YAK TVED
Sbjct: 481 LSLRHIRDQLPKELVDAYDNFQEQVDKHLNPTKARKLLEEEERRREEEENQKYAKTTVED 540

Query: 541 LTRLCIKVEAPIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFF 600
           LT+LC+KV+APIEL+RK VK+ GFTN MGRPRPIS L ALED DI+KWY GVGRRWLDFF
Sbjct: 541 LTKLCMKVDAPIELIRKTVKLAGFTNHMGRPRPISFLTALEDADIVKWYGGVGRRWLDFF 600

Query: 601 CCCHNYKTVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTE 660
            CCHN+KTVKTIVTYHLRFSCILTLAEKHESTKREA+KHY+KDLK+FD++G EE+HFPTE
Sbjct: 601 SCCHNFKTVKTIVTYHLRFSCILTLAEKHESTKREAIKHYTKDLKIFDMSGNEEVHFPTE 660

Query: 661 REVKMLGERNLA-DPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTL 720
           +EVKM+G++NL+ DP  VDGA  L LIRL +DE  Y C+AHFC RTD+++YRVRLLQ+ L
Sbjct: 661 KEVKMMGDKNLSVDPKLVDGALCLALIRLASDEPPYSCVAHFCERTDTVVYRVRLLQRQL 720

Query: 721 NVNPSNGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 769
           NVNP +  +W++GMGVIHESL+ +CLPLC  H+ DLYMGKI LQD+DCT  +D+D
Sbjct: 721 NVNPLDVEKWIQGMGVIHESLHLKCLPLCPHHVHDLYMGKITLQDIDCTSFVDVD 763

BLAST of Cp4.1LG01g08150 vs. NCBI nr
Match: gi|657987378|ref|XP_008385840.1| (PREDICTED: uncharacterized protein LOC103448365 isoform X1 [Malus domestica])

HSP 1 Score: 1011.5 bits (2614), Expect = 7.7e-292
Identity = 514/776 (66.24%), Postives = 622/776 (80.15%), Query Frame = 1

Query: 1   MLIHLRKIAT--PW--KPNLSSSFNKLYSDRPLKHSPLLKLPFQHSVQTLTRPQLEALVL 60
           MLI+LR+  T  P+   P+ S S N +        S   ++P   S   L+  QL++LVL
Sbjct: 1   MLINLRRRITILPFHTNPSPSISLNLI--------STSTQIPRSDSTNXLSESQLKSLVL 60

Query: 61  SRFSQGKFFDLLRNVVASPSVLFTASQNLITPLPSNRLNAPDSLLSFDMVSSCFSVEDMA 120
           S+FS GKF +LL+NVVA P++L TA QNL +P   N      SLL  D VS  FS+ +M 
Sbjct: 61  SQFSHGKFTNLLQNVVALPALLLTACQNLTSPKTQNGNGLSPSLL--DSVSKRFSIHEMG 120

Query: 121 RELYENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGG 180
           R+L ENRFDVGA  V + +   +GE LVLPNLKLKVL+EAIR+VL IVYDERFVTFSYGG
Sbjct: 121 RQLCENRFDVGAXSVAMAAQRNRGESLVLPNLKLKVLIEAIRMVLGIVYDERFVTFSYGG 180

Query: 181 RVGMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKK 240
           RV MGRHTAIRYLKNSV+NPSWWF+V F R KFD  +VNKLCL +QEKI D+ILI ++KK
Sbjct: 181 RVNMGRHTAIRYLKNSVENPSWWFSVGFXREKFDQRNVNKLCLFMQEKIDDEILIDVIKK 240

Query: 241 LFELEAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKELQQIRLEKNEENPKFSLDG 300
           LFE  AV+IELG C LGRGFPQES L SIL NIYFNGFDKE+Q++RL KN+E+PKF  + 
Sbjct: 241 LFECGAVRIELGSCCLGRGFPQESXLTSILMNIYFNGFDKEIQEMRLRKNQEHPKFVSNE 300

Query: 301 TVS----FHNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAI 360
            VS    F+ PVKIYAVRYLDEIL+ITSGSKML M+LK+ V++YLEG LEL+VD + TAI
Sbjct: 301 LVSKHDVFYKPVKIYAVRYLDEILIITSGSKMLTMDLKNWVVKYLEGTLELKVDGLKTAI 360

Query: 361 HSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKL 420
           HSAVSEKI F+GMELQAVPPSVL+PPMSEKA RARKKYLRQKEV+A+ELRNARERNRKKL
Sbjct: 361 HSAVSEKIDFMGMELQAVPPSVLNPPMSEKAXRARKKYLRQKEVKALELRNARERNRKKL 420

Query: 421 GLKILGHVFKKLKRTDGLKYEFQIEKEVTEIFRNWADEVVRDFLESLEDNTEWHRPLSAG 480
           GLKI+ HV+KKLKR+ G K E QIE EV EIFR W  E V++FL SLE+  EW+  LSAG
Sbjct: 421 GLKIMSHVYKKLKRSSGFKSEHQIENEVREIFRTWGGETVQEFLGSLEEXWEWYHKLSAG 480

Query: 481 DFLSLKHIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKKEKAREDEKKRLGEEERYAKRTV 540
           DFLSL+HIR+QLP +LV+AYD+FQ QV++HLNP+KA++    E+ + +  EE++YAK TV
Sbjct: 481 DFLSLRHIRDQLPQELVDAYDKFQGQVHKHLNPVKARRALEEEERRAKEEEEQKYAKATV 540

Query: 541 EDLTRLCIKVEAPIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLD 600
           EDL +LC+K +APIEL+RK V++IGFTN MGRP+PI+ L ALED DIIKWYAG+GRRWL+
Sbjct: 541 EDLAKLCVKADAPIELIRKMVRLIGFTNHMGRPQPITLLTALEDADIIKWYAGIGRRWLE 600

Query: 601 FFCCCHNYKTVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFP 660
           F+CCCHN+K VKT+VTYH+RFSCILTLAEKHESTK+EAMKHY+KDLKVFDL+G EE+HFP
Sbjct: 601 FYCCCHNFKMVKTVVTYHMRFSCILTLAEKHESTKQEAMKHYTKDLKVFDLDGNEEVHFP 660

Query: 661 TEREVKMLGERNLADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKT 720
           TEREVKM+G++NL+DP PVDGA SL LIRL +DE  Y C+AHFCNRTD+I+YRVRLLQ  
Sbjct: 661 TEREVKMMGDKNLSDPKPVDGALSLALIRLASDEPPYSCVAHFCNRTDTIVYRVRLLQNH 720

Query: 721 LNVNPSNGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 769
           LNVNP +  +WV GMG I+ESLN +C P+C DH  DLYMG+I  QD+DCT  +++D
Sbjct: 721 LNVNPMDEKKWVPGMGAINESLNLKCFPVCPDHTHDLYMGRITFQDIDCTSFVEVD 766

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AI2M_YEAST4.6e-1031.41Putative COX1/OXI3 intron 2 protein OS=Saccharomyces cerevisiae (strain ATCC 204... [more]
YMC6_SCHPO1.3e-0932.14Uncharacterized 91 kDa protein in cob intron OS=Schizosaccharomyces pombe (strai... [more]
LTRA_LACLM4.3e-0828.32Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (stra... [more]
LTRA_LACLC4.3e-0828.32Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris GN=lt... [more]
NICA_PSEPU1.6e-0730.30Putative nicotine oxidoreductase OS=Pseudomonas putida GN=nicA PE=3 SV=1[more]
Match NameE-valueIdentityDescription
F6HLP1_VITVI1.2e-29467.44Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g05780 PE=4 SV=... [more]
A0A061FIF3_THECC7.7e-29168.44RNA-directed DNA polymerase OS=Theobroma cacao GN=TCM_033550 PE=4 SV=1[more]
A0A067K380_JATCU1.7e-28564.92Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22465 PE=4 SV=1[more]
B9T609_RICCO1.6e-28064.34RNA binding protein, putative OS=Ricinus communis GN=RCOM_0160820 PE=4 SV=1[more]
V4LD34_EUTSA3.3e-26562.40Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10012522mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G04050.23.9e-16961.10 RNA-directed DNA polymerase (reverse transcriptase)[more]
AT1G74350.13.7e-5025.50 Intron maturase, type II family protein[more]
ATMG00520.18.0e-1336.00ATMG00520.1 Intron maturase, type II family protein[more]
Match NameE-valueIdentityDescription
gi|778676093|ref|XP_011650528.1|0.0e+0086.23PREDICTED: uncharacterized protein LOC101217546 [Cucumis sativus][more]
gi|659126896|ref|XP_008463418.1|0.0e+0085.55PREDICTED: putative COX1/OXI3 intron 1 protein [Cucumis melo][more]
gi|359481896|ref|XP_002274379.2|1.7e-29467.44PREDICTED: uncharacterized protein LOC100264128 [Vitis vinifera][more]
gi|1009159848|ref|XP_015898038.1|4.1e-29365.29PREDICTED: uncharacterized protein LOC107431593 [Ziziphus jujuba][more]
gi|657987378|ref|XP_008385840.1|7.7e-29266.24PREDICTED: uncharacterized protein LOC103448365 isoform X1 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006397mRNA processing
Vocabulary: INTERPRO
TermDefinition
IPR024937Domain_X
IPR000477RT_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006397 mRNA processing
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
biological_process GO:0006120 mitochondrial electron transport, NADH to ubiquinone
biological_process GO:0015992 proton transport
biological_process GO:0006814 sodium ion transport
biological_process GO:0006744 ubiquinone biosynthetic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016020 membrane
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0009055 electron carrier activity
molecular_function GO:0008137 NADH dehydrogenase (ubiquinone) activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g08150.1Cp4.1LG01g08150.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 144..367
score: 1.
IPR024937Domain XPFAMPF01348Intron_maturas2coord: 541..648
score: 1.6
NoneNo IPR availablePANTHERPTHR33642FAMILY NOT NAMEDcoord: 44..768
score:
NoneNo IPR availablePANTHERPTHR33642:SF1RNA-DIRECTED DNA POLYMERASE (REVERSE TRANSCRIPTASE)coord: 44..768
score:

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g08150Cucurbita pepo (Zucchini)cpecpeB007
Cp4.1LG01g08150Cucumber (Gy14) v1cgycpeB0077
Cp4.1LG01g08150Cucurbita maxima (Rimu)cmacpeB350
Cp4.1LG01g08150Cucurbita moschata (Rifu)cmocpeB317
Cp4.1LG01g08150Cucumber (Chinese Long) v2cpecuB393
Cp4.1LG01g08150Melon (DHL92) v3.5.1cpemeB409