IVF0014015 (gene) Melon (IVF77) v1

Overview
NameIVF0014015
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionIntegrase
Locationchr05: 8508803 .. 8512848 (+)
RNA-Seq ExpressionIVF0014015
SyntenyIVF0014015
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCGAATGGTAACATGTTGCAACCCCAACTTCCAAGGTTCAGCGGAAAGAATTTTAATCAATGGAGTATTCAAATGAAAGTGTTATATGGCTCTCAAGAATTGTGGGATATTGTTGAAAGAGGATACACTGAAGTTGAGAATCAGAGTGAGCTCACAAATCAACAACTTGTTGAGTTAAGAGAAAATCGTAAGAAAGACAAAAAGGCTTTATTCTTCATTTATCAAGCTGTTGATGAATTTATTTTCGAGAGAATTTCAACAGCTACTTCTGCAAAGGCGGCTTGGGATATTCTAAGATCTACCTATCAAGGAGAAGATAAGGTAAAGATGATAAGGTTACAAGCTCTCAGATCCGAATTTGATTGCATTAAAATGAAAGAAACTGAAACTATTGAAGAATTTTTCAATCGTATTCTTGTAATTGTAAATAGTTTAAGATCAAATGGTGAAGAAGTAGGCGATCAAAGAGTTGTTGAAAAGATTCTTAGAAGTATGCCAAGAAAATTTGAGCATATCGTCGTTGCAATTGAAGAATCGAAAGACTTATCTACGTTGTCTATAAATAGCTTGATGGGTTCTCTTCAATCCCATGAGCTAAGATTAAAACAATTTGATGTTAACCCCGAGGAAGCTTTTCAAATGCAAACTTCATTTAGAGGCGGTTCACGTGGAAGGCGTGGTGGTCATGGAAGACGAGGAGGTGGAAGAAACTATGATAATAGAAGCGGTGCAAATTCTGAAAATTCACAAGAAAGCTCTTCTTTATCTCGAGGAAGAGGAAGCGGAAGAGGAAGAGGCTTTGGCAGAAACCAAGGAGGTGGTCGTGGTAATTTTCTCAAATTCAATGCTTTAATTGCGGAAAGTATGGTCATTTTCAAGCAAATTGTTGGGCACTAAAAAATGGAGTTGGAAATACCACCATGAATATGCATAAAGAACAAAAGAAAATTGATGAAGGCATTCTATTCCTCGCATGTAGTGTTCAAGACAATGTTGTAGAGCCTACATGGTATCTTGATAGTGGTTGTAGCAACCACATGACAGGAAATAGAAGTATATTTGTTACTTTGGATGAATCTTTCCAAAGTGAAGTGAAGACTGGTGATAATACCAGACTACAAGTCAAAGGCCAAGGTGATATTCTTGTGAAGACAAAGAAAGGGACAAAACGAGTTACAAATGTGTTCTATGTTCCAGGTCTAAAGCATAATCTTTTGAGTATTGGCCAACTGCTTCAACGAGGTTTAAAAGTTTCATTTGAAGGTGACATATGTGCAATCAAAGATCAGGCCGGTGTTCTTATTGCCAAGGTAAAAATGACTGCTAATAAGATGTTTCCTCTTAACTTTACATATGGTCAAATATCTTGCTTCAGCAGCATATTGAAGGATCCATCCTGGCTTTGGCATTTTAGATATGGTCACTTAAACTTCAAATCACTATCTTATTTGTGCAAAAATCATATGGTGAGAGGTATACAAAATATCAACCATGAGACAAATATTTGTGAAGTGTGTATTCTTGCAAAACATCATCGAGATTCATTTCCAACTGGGAAAGCTTGGAGAGCCTCTAAACCTCTCGAGTTGATTCATACAGATTTGTGTGGTCCTATGCGAACAACAACAAATGGAGGTAATCGATATTTCATAACCTTCATCGATGATTTCAGTAGAAAGTTGTGGATTTATTTTTTGAAAGAAAAGAGTGAAGCACTTGTATGTTTTAAATCCTTCAAAGCTTTTACTGAAAATCAAAGTGGTTACAAGATAAAAACTTTGAGATCTGACCGTGGTGGAGAATATATAGCTTTTGGTAATTTTTTCAAGGAGCAAGGAATTCATCATCAAATGACAGCTCGAATGACTCCACAGCAAAATGGAGTTGCAGAGAGAAAAAATAGAACAATCATGGAAATGGCGAGAAGTATGCTAAAAGCAAAAAATCTGCCAAACGAATTTTGGGGAGATGCTGTTGCATGTACTGTTTACATTCTAAATCGAGCTCCAACAAAGAGTGTTCCAGGTATGACTCCTTATGAAGCATGGTGTGGTGAGAAACCATCTGTTAGTCATTTGAGAGTGTTTGGGAGTATAGCTTATTCTCATATTCCAAATCAGCTAAGAGGCAAGCTTGATGATAAATCTGAAAAATGCATTATGGTAGGTTATAGTGAAAATTCTAAAGCTTATCGATTGTATAATCCTGTGTCAAGAAAAATTATTATCAGCAGAGATGTGATTTTCAGTGAAGATGAATCATGGAACTGGAATGACGACGTTGATGAAGCTAAAAGTCCATTTCATGTTAATATTGATGAAAATGAAGTTGCTCAAGAATTAGAGCAAGCAGAAATTCAAGCGATGGAGTCATCTTCGTCCTCAACGTCATCTTCCACAAGTAATGATGAAATCTCACCAAGGAGAATGAGGAGTATTCAAGAAATTTATAATACCACTAACAGGATTAATGATGATCATTTTGCTAATTTTGCATTATTTGCTGGTGTTGATCCTGTAACTTTTGATGAAGCCATCCAAGATGAGAAATGGAAGATTGCAATGGATCAAGAGATTGATGCGATAAGAAGAAATGAAACATGGGAGTTGATGGAGCTTCCGACAAACAAACAAGCTCTTGGAGTAAAATGGGTGTACAGAACAAAGTTGAAGTCAGATGGTAATGTTGAAAAATACAAGGCAAGACTTGTTGTAAAAGGCTACAAGCAGGAATATGGTGTGGATTATGAAGAAATATTTGCCCCTGTGACAAGAATTGAGACCATTCGATTGATTTTGTCATTAGCTGCTCAAAATGGATGGAAAGTTTATCAAATGGATGTAAAATCCGCTTTTTTGAATGGACACTTGAAGGAAGAGATATTTGTTGCACAACCTTTGGGCTATGTGCAAAGGGGAGAAGAAGAAAAAGTGTACAAGTTGAAAAAAGCTTTGTATGGATTGAAGCAAGCTCCGCGAGCTTGGTACAGTCGTATTGACAGTTTTTTTCTAAAGACAGGATTTCGAAGGTGTCCATATGAGCATGCACTCTATGTCAAAGAAGACAAGTATGGCAAATTTCTCATCGTTTCTCTTTACGTTGATGATTTACTTTTTACTGGAAATGATAAATTTTTGTGTGATGATTTTAAGAATTCCATGAAAAATGAATTCGAGATGAGTGATATGGGTCTCATCCATTACTTTCTCGGAATTGAAGTTAATCAAAATGAAGGAGAAATTGTCATTTCACAGCAAAAGTATGCTCATGATTTACTAAAAAAATTTCGGATGGAAAATGCTTCACCTTGCAACACTCCCATGGATGCAAATTTGAAATTGTGCAAGGATGATATTGGAGAAGCAGTCGATCCAAGTTTATATCGAAGCTTAGTTGGAAGCTTAATGTATTTGACAGCAACAAGACCTGATATTTTATTTGCTGTAAGTATGTTAAGCAGATTTATGACAAACCCGAAAAGAAGTCATTGGGAAGCAGGAAAAAGAGTTCTTCGTTATATTCTTGGCACCATTAATTTTGGAATTTATTACAAGAAAGTTTCAGAATCAGTGATGTTTGGTTTTTGTGATAGTGACTGGGGTGGTAATGTGGATGATCATAAAAGTACATCTGGTTATGTTTTTAGTATGGGTTCAGGTGTTTTTTCATGGACTTCAAAGAAACAATCTGTTGTTGCCCTTTCTACAACCGAAGCAGAATATATCTCGTTAGCTGCAGCTGGATGTCAAGCTTTATGGCTTCGGTGGATGTTAAAAGAATTGAAGTGTATTCAAAAATGTGAAACTGTTTTATTTTGTGATAATGGATCTGCCATAGCATTATCAAAGAATCCAGTTTTCCATGGAAGAAGCAAGCATATTAGAATCAAATATCATTTTATCAGAGACTTGGTTAAAGATGGAGAAGTGATAGTAAAATATTGCAAGACTCAAGATCAAGTGGCTGATATTTTTACAAAGGCGCTCAAGTTTGACTTATTTGTTAAATTCAGAGGAAAACTTGGAGTTGCTCAAGTTTAG

mRNA sequence

ATGGCTTCGAATGGTAACATGTTGCAACCCCAACTTCCAAGGTTCAGCGGAAAGAATTTTAATCAATGGAGTATTCAAATGAAAGTGTTATATGGCTCTCAAGAATTGTGGGATATTGTTGAAAGAGGATACACTGAAGTTGAGAATCAGAGTGAGCTCACAAATCAACAACTTGTTGAGTTAAGAGAAAATCGTAAGAAAGACAAAAAGGCTTTATTCTTCATTTATCAAGCTGTTGATGAATTTATTTTCGAGAGAATTTCAACAGCTACTTCTGCAAAGGCGGCTTGGGATATTCTAAGATCTACCTATCAAGGAGAAGATAAGGTAAAGATGATAAGGTTACAAGCTCTCAGATCCGAATTTGATTGCATTAAAATGAAAGAAACTGAAACTATTGAAGAATTTTTCAATCGTATTCTTGTAATTGTAAATAGTTTAAGATCAAATGGTGAAGAAGTAGGCGATCAAAGAGTTGTTGAAAAGATTCTTAGAAGTATGCCAAGAAAATTTGAGCATATCGTCGTTGCAATTGAAGAATCGAAAGACTTATCTACGTTGTCTATAAATAGCTTGATGGGTTCTCTTCAATCCCATGAGCTAAGATTAAAACAATTTGATGTTAACCCCGAGGAAGCTTTTCAAATGCAAACTTCATTTAGAGGCGGTTCACGTGGAAGGCGTGGTGGTCATGGAAGACGAGGAGGTGGAAGAAACTATGATAATAGAAGCGGTGCAAATTCTGAAAATTCACAAGAAAGCTCTTCTTTATCTCGAGGAAGAGGAAGCGGAAGAGGAAGAGGCTTTGGCAGAAACCAAGGAGGTGGTCGTGGTAATTTTCTCAAATTCAATGCTTTAATTGCGGAAACAAATTGTTGGGCACTAAAAAATGGAGTTGGAAATACCACCATGAATATGCATAAAGAACAAAAGAAAATTGATGAAGGCATTCTATTCCTCGCATGTAGTGTTCAAGACAATGTTGTAGAGCCTACATGGTATCTTGATAGTGGTTGTAGCAACCACATGACAGGAAATAGAAGTATATTTGTTACTTTGGATGAATCTTTCCAAAGTGAAGTGAAGACTGGTGATAATACCAGACTACAAGTCAAAGGCCAAGGTGATATTCTTGTGAAGACAAAGAAAGGGACAAAACGAGTTACAAATGTGTTCTATGTTCCAGGTCTAAAGCATAATCTTTTGAGTATTGGCCAACTGCTTCAACGAGGTTTAAAAGTTTCATTTGAAGGTGACATATGTGCAATCAAAGATCAGGCCGGTGTTCTTATTGCCAAGGTAAAAATGACTGCTAATAAGATGTTTCCTCTTAACTTTACATATGGTCAAATATCTTGCTTCAGCAGCATATTGAAGGATCCATCCTGGCTTTGGCATTTTAGATATGGTCACTTAAACTTCAAATCACTATCTTATTTGTGCAAAAATCATATGGTGAGAGGTATACAAAATATCAACCATGAGACAAATATTTGTGAAGTGTGTATTCTTGCAAAACATCATCGAGATTCATTTCCAACTGGGAAAGCTTGGAGAGCCTCTAAACCTCTCGAGTTGATTCATACAGATTTGTGTGGTCCTATGCGAACAACAACAAATGGAGGTAATCGATATTTCATAACCTTCATCGATGATTTCAGTAGAAAGTTGTGGATTTATTTTTTGAAAGAAAAGAGTGAAGCACTTGTATGTTTTAAATCCTTCAAAGCTTTTACTGAAAATCAAAGTGGTTACAAGATAAAAACTTTGAGATCTGACCGTGGTGGAGAATATATAGCTTTTGGTAATTTTTTCAAGGAGCAAGGAATTCATCATCAAATGACAGCTCGAATGACTCCACAGCAAAATGGAGTTGCAGAGAGAAAAAATAGAACAATCATGGAAATGGCGAGAAGTATGCTAAAAGCAAAAAATCTGCCAAACGAATTTTGGGGAGATGCTGTTGCATGTACTGTTTACATTCTAAATCGAGCTCCAACAAAGAGTGTTCCAGGTATGACTCCTTATGAAGCATGGTGTGGTGAGAAACCATCTGTTAGTCATTTGAGAGTGTTTGGGAGTATAGCTTATTCTCATATTCCAAATCAGCTAAGAGGCAAGCTTGATGATAAATCTGAAAAATGCATTATGGTAGGTTATAGTGAAAATTCTAAAGCTTATCGATTGTATAATCCTGTGTCAAGAAAAATTATTATCAGCAGAGATGTGATTTTCAGTGAAGATGAATCATGGAACTGGAATGACGACGTTGATGAAGCTAAAAGTCCATTTCATGTTAATATTGATGAAAATGAAGTTGCTCAAGAATTAGAGCAAGCAGAAATTCAAGCGATGGAGTCATCTTCGTCCTCAACGTCATCTTCCACAAGTAATGATGAAATCTCACCAAGGAGAATGAGGAGTATTCAAGAAATTTATAATACCACTAACAGGATTAATGATGATCATTTTGCTAATTTTGCATTATTTGCTGGTGTTGATCCTGTAACTTTTGATGAAGCCATCCAAGATGAGAAATGGAAGATTGCAATGGATCAAGAGATTGATGCGATAAGAAGAAATGAAACATGGGAGTTGATGGAGCTTCCGACAAACAAACAAGCTCTTGGAGTAAAATGGGTGTACAGAACAAAGTTGAAGTCAGATGGTAATGTTGAAAAATACAAGGCAAGACTTGTTGTAAAAGGCTACAAGCAGGAATATGGTGTGGATTATGAAGAAATATTTGCCCCTGTGACAAGAATTGAGACCATTCGATTGATTTTGTCATTAGCTGCTCAAAATGGATGGAAAGTTTATCAAATGGATGTAAAATCCGCTTTTTTGAATGGACACTTGAAGGAAGAGATATTTGTTGCACAACCTTTGGGCTATGTGCAAAGGGGAGAAGAAGAAAAAGTGTACAAGTTGAAAAAAGCTTTGTATGGATTGAAGCAAGCTCCGCGAGCTTGGTACAGTCGTATTGACAGTTTTTTTCTAAAGACAGGATTTCGAAGGTGTCCATATGAGCATGCACTCTATGTCAAAGAAGACAAGTATGGCAAATTTCTCATCGTTTCTCTTTACGTTGATGATTTACTTTTTACTGGAAATGATAAATTTTTGTGTGATGATTTTAAGAATTCCATGAAAAATGAATTCGAGATGAGTGATATGGGTCTCATCCATTACTTTCTCGGAATTGAAGTTAATCAAAATGAAGGAGAAATTGTCATTTCACAGCAAAAGTATGCTCATGATTTACTAAAAAAATTTCGGATGGAAAATGCTTCACCTTGCAACACTCCCATGGATGCAAATTTGAAATTGTGCAAGGATGATATTGGAGAAGCAGTCGATCCAAGTTTATATCGAAGCTTAGTTGGAAGCTTAATGTATTTGACAGCAACAAGACCTGATATTTTATTTGCTGTAAGTATGTTAAGCAGATTTATGACAAACCCGAAAAGAAGTCATTGGGAAGCAGGAAAAAGAGTTCTTCGTTATATTCTTGGCACCATTAATTTTGGAATTTATTACAAGAAAGTTTCAGAATCAGTGATGTTTGGTTTTTGTGATAGTGACTGGGGTGGTAATGTGGATGATCATAAAAGTACATCTGGTTATGTTTTTAGTATGGGTTCAGGTGTTTTTTCATGGACTTCAAAGAAACAATCTGTTGTTGCCCTTTCTACAACCGAAGCAGAATATATCTCGTTAGCTGCAGCTGGATGTCAAGCTTTATGGCTTCGGTGGATGTTAAAAGAATTGAAGTGTATTCAAAAATGTGAAACTGTTTTATTTTGTGATAATGGATCTGCCATAGCATTATCAAAGAATCCAGTTTTCCATGGAAGAAGCAAGCATATTAGAATCAAATATCATTTTATCAGAGACTTGGTTAAAGATGGAGAAGTGATAGTAAAATATTGCAAGACTCAAGATCAAGTGGCTGATATTTTTACAAAGGCGCTCAAGTTTGACTTATTTGTTAAATTCAGAGGAAAACTTGGAGTTGCTCAAGTTTAG

Coding sequence (CDS)

ATGGCTTCGAATGGTAACATGTTGCAACCCCAACTTCCAAGGTTCAGCGGAAAGAATTTTAATCAATGGAGTATTCAAATGAAAGTGTTATATGGCTCTCAAGAATTGTGGGATATTGTTGAAAGAGGATACACTGAAGTTGAGAATCAGAGTGAGCTCACAAATCAACAACTTGTTGAGTTAAGAGAAAATCGTAAGAAAGACAAAAAGGCTTTATTCTTCATTTATCAAGCTGTTGATGAATTTATTTTCGAGAGAATTTCAACAGCTACTTCTGCAAAGGCGGCTTGGGATATTCTAAGATCTACCTATCAAGGAGAAGATAAGGTAAAGATGATAAGGTTACAAGCTCTCAGATCCGAATTTGATTGCATTAAAATGAAAGAAACTGAAACTATTGAAGAATTTTTCAATCGTATTCTTGTAATTGTAAATAGTTTAAGATCAAATGGTGAAGAAGTAGGCGATCAAAGAGTTGTTGAAAAGATTCTTAGAAGTATGCCAAGAAAATTTGAGCATATCGTCGTTGCAATTGAAGAATCGAAAGACTTATCTACGTTGTCTATAAATAGCTTGATGGGTTCTCTTCAATCCCATGAGCTAAGATTAAAACAATTTGATGTTAACCCCGAGGAAGCTTTTCAAATGCAAACTTCATTTAGAGGCGGTTCACGTGGAAGGCGTGGTGGTCATGGAAGACGAGGAGGTGGAAGAAACTATGATAATAGAAGCGGTGCAAATTCTGAAAATTCACAAGAAAGCTCTTCTTTATCTCGAGGAAGAGGAAGCGGAAGAGGAAGAGGCTTTGGCAGAAACCAAGGAGGTGGTCGTGGTAATTTTCTCAAATTCAATGCTTTAATTGCGGAAACAAATTGTTGGGCACTAAAAAATGGAGTTGGAAATACCACCATGAATATGCATAAAGAACAAAAGAAAATTGATGAAGGCATTCTATTCCTCGCATGTAGTGTTCAAGACAATGTTGTAGAGCCTACATGGTATCTTGATAGTGGTTGTAGCAACCACATGACAGGAAATAGAAGTATATTTGTTACTTTGGATGAATCTTTCCAAAGTGAAGTGAAGACTGGTGATAATACCAGACTACAAGTCAAAGGCCAAGGTGATATTCTTGTGAAGACAAAGAAAGGGACAAAACGAGTTACAAATGTGTTCTATGTTCCAGGTCTAAAGCATAATCTTTTGAGTATTGGCCAACTGCTTCAACGAGGTTTAAAAGTTTCATTTGAAGGTGACATATGTGCAATCAAAGATCAGGCCGGTGTTCTTATTGCCAAGGTAAAAATGACTGCTAATAAGATGTTTCCTCTTAACTTTACATATGGTCAAATATCTTGCTTCAGCAGCATATTGAAGGATCCATCCTGGCTTTGGCATTTTAGATATGGTCACTTAAACTTCAAATCACTATCTTATTTGTGCAAAAATCATATGGTGAGAGGTATACAAAATATCAACCATGAGACAAATATTTGTGAAGTGTGTATTCTTGCAAAACATCATCGAGATTCATTTCCAACTGGGAAAGCTTGGAGAGCCTCTAAACCTCTCGAGTTGATTCATACAGATTTGTGTGGTCCTATGCGAACAACAACAAATGGAGGTAATCGATATTTCATAACCTTCATCGATGATTTCAGTAGAAAGTTGTGGATTTATTTTTTGAAAGAAAAGAGTGAAGCACTTGTATGTTTTAAATCCTTCAAAGCTTTTACTGAAAATCAAAGTGGTTACAAGATAAAAACTTTGAGATCTGACCGTGGTGGAGAATATATAGCTTTTGGTAATTTTTTCAAGGAGCAAGGAATTCATCATCAAATGACAGCTCGAATGACTCCACAGCAAAATGGAGTTGCAGAGAGAAAAAATAGAACAATCATGGAAATGGCGAGAAGTATGCTAAAAGCAAAAAATCTGCCAAACGAATTTTGGGGAGATGCTGTTGCATGTACTGTTTACATTCTAAATCGAGCTCCAACAAAGAGTGTTCCAGGTATGACTCCTTATGAAGCATGGTGTGGTGAGAAACCATCTGTTAGTCATTTGAGAGTGTTTGGGAGTATAGCTTATTCTCATATTCCAAATCAGCTAAGAGGCAAGCTTGATGATAAATCTGAAAAATGCATTATGGTAGGTTATAGTGAAAATTCTAAAGCTTATCGATTGTATAATCCTGTGTCAAGAAAAATTATTATCAGCAGAGATGTGATTTTCAGTGAAGATGAATCATGGAACTGGAATGACGACGTTGATGAAGCTAAAAGTCCATTTCATGTTAATATTGATGAAAATGAAGTTGCTCAAGAATTAGAGCAAGCAGAAATTCAAGCGATGGAGTCATCTTCGTCCTCAACGTCATCTTCCACAAGTAATGATGAAATCTCACCAAGGAGAATGAGGAGTATTCAAGAAATTTATAATACCACTAACAGGATTAATGATGATCATTTTGCTAATTTTGCATTATTTGCTGGTGTTGATCCTGTAACTTTTGATGAAGCCATCCAAGATGAGAAATGGAAGATTGCAATGGATCAAGAGATTGATGCGATAAGAAGAAATGAAACATGGGAGTTGATGGAGCTTCCGACAAACAAACAAGCTCTTGGAGTAAAATGGGTGTACAGAACAAAGTTGAAGTCAGATGGTAATGTTGAAAAATACAAGGCAAGACTTGTTGTAAAAGGCTACAAGCAGGAATATGGTGTGGATTATGAAGAAATATTTGCCCCTGTGACAAGAATTGAGACCATTCGATTGATTTTGTCATTAGCTGCTCAAAATGGATGGAAAGTTTATCAAATGGATGTAAAATCCGCTTTTTTGAATGGACACTTGAAGGAAGAGATATTTGTTGCACAACCTTTGGGCTATGTGCAAAGGGGAGAAGAAGAAAAAGTGTACAAGTTGAAAAAAGCTTTGTATGGATTGAAGCAAGCTCCGCGAGCTTGGTACAGTCGTATTGACAGTTTTTTTCTAAAGACAGGATTTCGAAGGTGTCCATATGAGCATGCACTCTATGTCAAAGAAGACAAGTATGGCAAATTTCTCATCGTTTCTCTTTACGTTGATGATTTACTTTTTACTGGAAATGATAAATTTTTGTGTGATGATTTTAAGAATTCCATGAAAAATGAATTCGAGATGAGTGATATGGGTCTCATCCATTACTTTCTCGGAATTGAAGTTAATCAAAATGAAGGAGAAATTGTCATTTCACAGCAAAAGTATGCTCATGATTTACTAAAAAAATTTCGGATGGAAAATGCTTCACCTTGCAACACTCCCATGGATGCAAATTTGAAATTGTGCAAGGATGATATTGGAGAAGCAGTCGATCCAAGTTTATATCGAAGCTTAGTTGGAAGCTTAATGTATTTGACAGCAACAAGACCTGATATTTTATTTGCTGTAAGTATGTTAAGCAGATTTATGACAAACCCGAAAAGAAGTCATTGGGAAGCAGGAAAAAGAGTTCTTCGTTATATTCTTGGCACCATTAATTTTGGAATTTATTACAAGAAAGTTTCAGAATCAGTGATGTTTGGTTTTTGTGATAGTGACTGGGGTGGTAATGTGGATGATCATAAAAGTACATCTGGTTATGTTTTTAGTATGGGTTCAGGTGTTTTTTCATGGACTTCAAAGAAACAATCTGTTGTTGCCCTTTCTACAACCGAAGCAGAATATATCTCGTTAGCTGCAGCTGGATGTCAAGCTTTATGGCTTCGGTGGATGTTAAAAGAATTGAAGTGTATTCAAAAATGTGAAACTGTTTTATTTTGTGATAATGGATCTGCCATAGCATTATCAAAGAATCCAGTTTTCCATGGAAGAAGCAAGCATATTAGAATCAAATATCATTTTATCAGAGACTTGGTTAAAGATGGAGAAGTGATAGTAAAATATTGCAAGACTCAAGATCAAGTGGCTGATATTTTTACAAAGGCGCTCAAGTTTGACTTATTTGTTAAATTCAGAGGAAAACTTGGAGTTGCTCAAGTTTAG

Protein sequence

MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRSEFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIAETNCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEKYKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYLTATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWMLKELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQDQVADIFTKALKFDLFVKFRGKLGVAQV
Homology
BLAST of IVF0014015 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 677.2 bits (1746), Expect = 3.9e-193
Identity = 448/1370 (32.70%), Postives = 714/1370 (52.12%), Query Frame = 0

Query: 11   QLPRFSGKN-FNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKDK 70
            ++ +F+G N F+ W  +M+ L   Q L  +++    + +    +  +   +L      D+
Sbjct: 7    EVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLD---VDSKKPDTMKAEDWADL------DE 66

Query: 71   KALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRSEFDCIKMKE 130
            +A   I   + + +   I    +A+  W  L S Y  +     +    L+ +   + M E
Sbjct: 67   RAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKL---YLKKQLYALHMSE 126

Query: 131  TETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEESKDLSTLSI 190
                    N    ++  L + G ++ ++     +L S+P  ++++   I   K  +T+ +
Sbjct: 127  GTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGK--TTIEL 186

Query: 191  NSLMGSLQSHELRLKQFDVNPEEAFQ-MQTSFRGGSRGR-RGGHGRRGGGRNYDNRSGAN 250
              +  +L  +E   K+    PE   Q + T  RG S  R    +GR G      NRS + 
Sbjct: 187  KDVTSALLLNEKMRKK----PENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSR 246

Query: 251  SENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIAETNCWALKNGVGNTTMNMH 310
              N    +     +        G+ +  G+ N         + N  A+     N  + ++
Sbjct: 247  VRNCYNCNQPGHFKRDCPNPRKGKGETSGQKN---------DDNTAAMVQNNDNVVLFIN 306

Query: 311  KEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLDESFQSEVKTGDNT 370
            +E++            +  +  E  W +D+  S+H T  R +F          VK G+ +
Sbjct: 307  EEEE-----------CMHLSGPESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTS 366

Query: 371  RLQVKGQGDILVKTKKG-TKRVTNVFYVPGLKHNLLSIGQLLQRGLKVSFEGDICAIKDQ 430
              ++ G GDI +KT  G T  + +V +VP L+ NL+S   L + G +  F      +   
Sbjct: 367  YSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRL--T 426

Query: 431  AGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNFKSLSYLCKNHMV 490
             G L+    +    ++  N    Q    ++  +    LWH R GH++ K L  L K  ++
Sbjct: 427  KGSLVIAKGVARGTLYRTNAEICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLI 486

Query: 491  RGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGPMRTTTNGGNRYF 550
               +        C+ C+  K HR SF T    R    L+L+++D+CGPM   + GGN+YF
Sbjct: 487  SYAKGTT--VKPCDYCLFGKQHRVSFQTSSE-RKLNILDLVYSDVCGPMEIESMGGNKYF 546

Query: 551  ITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDRGGEYIA--FGNF 610
            +TFIDD SRKLW+Y LK K +    F+ F A  E ++G K+K LRSD GGEY +  F  +
Sbjct: 547  VTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEY 606

Query: 611  FKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDAVACTVYILNR 670
                GI H+ T   TPQ NGVAER NRTI+E  RSML+   LP  FWG+AV    Y++NR
Sbjct: 607  CSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINR 666

Query: 671  APTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEKCIMVGYSENS 730
            +P+  +    P   W  ++ S SHL+VFG  A++H+P + R KLDDKS  CI +GY +  
Sbjct: 667  SPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEE 726

Query: 731  KAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVAQELEQAEIQA 790
              YRL++PV +K+I SRDV+F E E     D  ++ K+    N              +  
Sbjct: 727  FGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIPNF-------------VTI 786

Query: 791  MESSSSSTSSSTSNDEISPRRMR-------------SIQEIYNTTN-------------- 850
              +S++ TS+ ++ DE+S +  +              ++E+ + T               
Sbjct: 787  PSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERP 846

Query: 851  RINDDHF--ANFALFA-GVDPVTFDEAI-QDEKWKI--AMDQEIDAIRRNETWELMELPT 910
            R+    +    + L +   +P +  E +   EK ++  AM +E++++++N T++L+ELP 
Sbjct: 847  RVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPK 906

Query: 911  NKQALGVKWVYRTKLKSDGNVEKYKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSL 970
             K+ L  KWV++ K   D  + +YKARLVVKG++Q+ G+D++EIF+PV ++ +IR ILSL
Sbjct: 907  GKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSL 966

Query: 971  AAQNGWKVYQMDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAW 1030
            AA    +V Q+DVK+AFL+G L+EEI++ QP G+   G++  V KL K+LYGLKQAPR W
Sbjct: 967  AASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQW 1026

Query: 1031 YSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMK 1090
            Y + DSF     + +   +  +Y K      F+I+ LYVDD+L  G DK L    K  + 
Sbjct: 1027 YMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLS 1086

Query: 1091 NEFEMSDMGLIHYFLGIEV--NQNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLK 1150
              F+M D+G     LG+++   +   ++ +SQ+KY   +L++F M+NA P +TP+  +LK
Sbjct: 1087 KSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLK 1146

Query: 1151 LCKDDIGEAVDPS------LYRSLVGSLMY-LTATRPDILFAVSMLSRFMTNPKRSHWEA 1210
            L K      V+         Y S VGSLMY +  TRPDI  AV ++SRF+ NP + HWEA
Sbjct: 1147 LSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEA 1206

Query: 1211 GKRVLRYILGTINFGIYYKKVSESVMFGFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTS 1270
             K +LRY+ GT    + +   S+ ++ G+ D+D  G++D+ KS++GY+F+   G  SW S
Sbjct: 1207 VKWILRYLRGTTGDCLCFGG-SDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQS 1266

Query: 1271 KKQSVVALSTTEAEYISLAAAGCQALWLRWMLKELKCIQKCETVLFCDNGSAIALSKNPV 1330
            K Q  VALSTTEAEYI+    G + +WL+  L+EL   QK E V++CD+ SAI LSKN +
Sbjct: 1267 KLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQK-EYVVYCDSQSAIDLSKNSM 1318

BLAST of IVF0014015 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 590.1 bits (1520), Expect = 6.2e-167
Identity = 433/1472 (29.42%), Postives = 707/1472 (48.03%), Query Frame = 0

Query: 6    NMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYT--EVENQSELTNQQLVELRE 65
            N+    + + +  N+  WS Q+  L+   EL   ++   T       ++   +   +   
Sbjct: 17   NVNMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTR 76

Query: 66   NRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRSEFD 125
             +++DK     +  A+   +   +S AT+A   W+ LR  Y       + +L+    ++ 
Sbjct: 77   WKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQW- 136

Query: 126  CIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEESKD 185
                K T+TI+++   ++   + L   G+ +     VE++L ++P +++ ++  I     
Sbjct: 137  ---TKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDT 196

Query: 186  LSTLS-INSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNYDN 245
              TL+ I+  + + +S  L +    V P  A     S R  +      +G R     YDN
Sbjct: 197  PPTLTEIHERLLNHESKILAVSSATVIPITA--NAVSHRNTTTTNNNNNGNR--NNRYDN 256

Query: 246  RSGANSENSQESSSLSRGRGSGRGRGF-GRNQ-GGGRGNFLKFNALIAETNCWALKNGVG 305
            R+  N+    + SS +    + + + + G+ Q  G +G+  K         C  L++ + 
Sbjct: 257  RNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAK--------RCSQLQHFL- 316

Query: 306  NTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGN-RSIFVTLDESFQS 365
             +++N  +              ++        W LDSG ++H+T +  ++ +    +   
Sbjct: 317  -SSVNSQQPPSPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGD 376

Query: 366  EVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQ-RGLKVSFEG 425
            +V   D + + +   G   + TK     + N+ YVP +  NL+S+ +L    G+ V F  
Sbjct: 377  DVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFP 436

Query: 426  DICAIKD-QAGVLIAKVKMTANKMFPLNFTYGQ-ISCFSS-ILKDPSWLWHFRYGHLNFK 485
                +KD   GV + + K T ++++       Q +S F+S   K     WH R GH    
Sbjct: 437  ASFQVKDLNTGVPLLQGK-TKDELYEWPIASSQPVSLFASPSSKATHSSWHARLGHPAPS 496

Query: 486  SLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDL-CGP 545
             L+ +  N+ +  + N +H+   C  C++ K ++  F +     +++PLE I++D+   P
Sbjct: 497  ILNSVISNYSL-SVLNPSHKFLSCSDCLINKSNKVPF-SQSTINSTRPLEYIYSDVWSSP 556

Query: 546  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 605
            + +  N   RY++ F+D F+R  W+Y LK+KS+    F +FK   EN+   +I T  SD 
Sbjct: 557  ILSHDN--YRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDN 616

Query: 606  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 665
            GGE++A   +F + GI H  +   TP+ NG++ERK+R I+E   ++L   ++P  +W  A
Sbjct: 617  GGEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYA 676

Query: 666  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 725
             A  VY++NR PT  +   +P++   G  P+   LRVFG   Y  +    + KLDDKS +
Sbjct: 677  FAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQ 736

Query: 726  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSE--------------------DESWNWN 785
            C+ +GYS    AY   +  + ++ ISR V F E                    + S  W+
Sbjct: 737  CVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWS 796

Query: 786  D-----------DVDEAKSPFHV-------------------NIDE-------------- 845
                               P H                    N+D               
Sbjct: 797  PHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTA 856

Query: 846  -------------------------------NEVAQELEQAEIQAMESSSSS----TSSS 905
                                           NE   +L Q+     +SSSSS    TS+S
Sbjct: 857  PRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSAS 916

Query: 906  TSNDEISPRRM-----RSIQEIYNTTNRI------------------NDDHFANFALFAG 965
            +S+   +P  +       + +I N  N+                   N  +    +L A 
Sbjct: 917  SSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAE 976

Query: 966  VDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELM-ELPTNKQALGVKWVYRTKLKSDGN 1025
             +P T  +A++DE+W+ AM  EI+A   N TW+L+   P++   +G +W++  K  SDG+
Sbjct: 977  SEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGS 1036

Query: 1026 VEKYKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNG 1085
            + +YKARLV KGY Q  G+DY E F+PV +  +IR++L +A    W + Q+DV +AFL G
Sbjct: 1037 LNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQG 1096

Query: 1086 HLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEH 1145
             L ++++++QP G++ +     V KL+KALYGLKQAPRAWY  + ++ L  GF     + 
Sbjct: 1097 TLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDT 1156

Query: 1146 ALYVKEDKYGKFLIVSL-YVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEV 1205
            +L+V +   GK ++  L YVDD+L TGND  L  +  +++   F + D   +HYFLGIE 
Sbjct: 1157 SLFVLQ--RGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEA 1216

Query: 1206 NQNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGS 1265
             +    + +SQ++Y  DLL +  M  A P  TPM  + KL      +  DP+ YR +VGS
Sbjct: 1217 KRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGS 1276

Query: 1266 LMYLTATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFG 1325
            L YL  TRPDI +AV+ LS+FM  P   H +A KR+LRY+ GT N GI+ KK +   +  
Sbjct: 1277 LQYLAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHA 1336

Query: 1326 FCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWL 1343
            + D+DW G+ DD+ ST+GY+  +G    SW+SKKQ  V  S+TEAEY S+A    +  W+
Sbjct: 1337 YSDADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSEMQWI 1396

BLAST of IVF0014015 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 562.8 bits (1449), Expect = 1.1e-158
Identity = 427/1457 (29.31%), Postives = 683/1457 (46.88%), Query Frame = 0

Query: 15   FSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVELRENRKKD--KKAL 74
            F G+ +  W  +++ L   Q++  +V+                   L  N   D  KKA 
Sbjct: 11   FDGEKYAIWKFRIRALLAEQDVLKVVD------------------GLMPNEVDDSWKKAE 70

Query: 75   FFIYQAVDEFIFER-ISTATSAKAAWDILRSTYQGEDKVKMIRLQALRSEFDCIKMKETE 134
                  + E++ +  ++ ATS   A  IL +     ++  +    ALR     +K+    
Sbjct: 71   RCAKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLKLSSEM 130

Query: 135  TIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEESKDLSTLSINS 194
            ++   F+    +++ L + G ++ +   +  +L ++P  ++ I+ AIE   +   L++  
Sbjct: 131  SLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLSE-ENLTLAF 190

Query: 195  LMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNYDNRSGANSENS 254
            +   L   E+++K  D N      M       +               Y N    N    
Sbjct: 191  VKNRLLDQEIKIKN-DHNDTSKKVMNAIVHNNN-------------NTYKNNLFKNRVTK 250

Query: 255  QESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIAETNCWALKNGVGNTTMNMHKE-Q 314
             +       +   +    GR      G+  K        +C+  K  + N      K+ Q
Sbjct: 251  PKKIFKGNSKYKVKCHHCGR-----EGHIKK--------DCFHYKRILNNKNKENEKQVQ 310

Query: 315  KKIDEGILFLACSVQDNVVEPT--WYLDSGCSNHMTGNRSIFVTLDESFQSEVKTGDNTR 374
                 GI F+   V +  V     + LDSG S+H+  + S++          V+     +
Sbjct: 311  TATSHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLINDESLYT-------DSVEVVPPLK 370

Query: 375  LQVKGQGDILVKTKKGTKRVTN--------VFYVPGLKHNLLSIGQLLQRGLKVSFEGDI 434
            + V  QG+ +  TK+G  R+ N        V +      NL+S+ +L + G+ + F+   
Sbjct: 371  IAVAKQGEFIYATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSG 430

Query: 435  CAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNFKSLSYL 494
              I  + G+++ K     N +  +NF   Q    ++  K+   LWH R+GH++   L  +
Sbjct: 431  VTI-SKNGLMVVKNSGMLNNVPVINF---QAYSINAKHKNNFRLWHERFGHISDGKLLEI 490

Query: 495  CKNHMVRG---IQNINHETNICEVCILAKHHRDSFPTGK-AWRASKPLELIHTDLCGPMR 554
             + +M      + N+     ICE C+  K  R  F   K      +PL ++H+D+CGP+ 
Sbjct: 491  KRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPIT 550

Query: 555  TTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDRGG 614
              T     YF+ F+D F+     Y +K KS+    F+ F A +E     K+  L  D G 
Sbjct: 551  PVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGR 610

Query: 615  EYIA--FGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 674
            EY++     F  ++GI + +T   TPQ NGV+ER  RTI E AR+M+    L   FWG+A
Sbjct: 611  EYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEA 670

Query: 675  VACTVYILNRAPTKSV--PGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKS 734
            V    Y++NR P++++     TPYE W  +KP + HLRVFG+  Y HI N+ +GK DDKS
Sbjct: 671  VLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKNK-QGKFDDKS 730

Query: 735  EKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWN---------WNDDVDEAKSP 794
             K I VGY  N   ++L++ V+ K I++RDV+  E    N         +  D  E+++ 
Sbjct: 731  FKSIFVGYEPN--GFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENK 790

Query: 795  FHVN----IDENEVAQELEQAE-IQAMESSSSSTSSSTSNDEIS------PRRMRSIQEI 854
               N    I + E   E ++ + IQ ++ S  S + +  ND         P   +    I
Sbjct: 791  NFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNI 850

Query: 855  Y--------------NTTNRINDDHF-----------------ANFALFAGVD------- 914
                            +  R  DDH                  A      G+D       
Sbjct: 851  QFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDG 910

Query: 915  ----------------------------------------PVTFDE-AIQDEK--WKIAM 974
                                                    P +FDE   +D+K  W+ A+
Sbjct: 911  IEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAI 970

Query: 975  DQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEKYKARLVVKGYKQEYGVD 1034
            + E++A + N TW + + P NK  +  +WV+  K    GN  +YKARLV +G+ Q+Y +D
Sbjct: 971  NTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQID 1030

Query: 1035 YEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLKEEIFVAQPLGYVQRGEE 1094
            YEE FAPV RI + R ILSL  Q   KV+QMDVK+AFLNG LKEEI++  P G       
Sbjct: 1031 YEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNS 1090

Query: 1095 EKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALYV-KEDKYGKFLIVSLYV 1154
            + V KL KA+YGLKQA R W+   +    +  F     +  +Y+  +    + + V LYV
Sbjct: 1091 DNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYV 1150

Query: 1155 DDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNEGEIVISQQKYAHDLLK 1214
            DD++    D    ++FK  +  +F M+D+  I +F+GI +   E +I +SQ  Y   +L 
Sbjct: 1151 DDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILS 1210

Query: 1215 KFRMENASPCNTPMDA--NLKLCKDDIGEAVDPSLYRSLVGSLMY-LTATRPDILFAVSM 1274
            KF MEN +  +TP+ +  N +L   D  +   P   RSL+G LMY +  TRPD+  AV++
Sbjct: 1211 KFNMENCNAVSTPLPSKINYELLNSD-EDCNTPC--RSLIGCLMYIMLCTRPDLTTAVNI 1270

Query: 1275 LSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKK--VSESVMFGFCDSDWGGNVDDHKS 1334
            LSR+ +      W+  KRVLRY+ GTI+  + +KK    E+ + G+ DSDW G+  D KS
Sbjct: 1271 LSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKS 1330

Query: 1335 TSGYVFSM-GSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWMLKELKCIQKCE 1342
            T+GY+F M    +  W +K+Q+ VA S+TEAEY++L  A  +ALWL+++L  +    +  
Sbjct: 1331 TTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENP 1390

BLAST of IVF0014015 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 560.8 bits (1444), Expect = 4.1e-158
Identity = 431/1477 (29.18%), Postives = 683/1477 (46.24%), Query Frame = 0

Query: 6    NMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQ---SELTNQQLVELR 65
            N+    + + +  N+  WS Q+  L+   EL   ++ G T +      ++   +   +  
Sbjct: 17   NVNMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLD-GSTPMPPATIGTDAVPRVNPDYT 76

Query: 66   ENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRSEF 125
              R++DK     I  A+   +   +S AT+A   W+ LR  Y       + +L       
Sbjct: 77   RWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQL------- 136

Query: 126  DCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEESK 185
                            R +   + L   G+ +     VE++L ++P  ++ ++  I  +K
Sbjct: 137  ----------------RFITRFDQLALLGKPMDHDEQVERVLENLPDDYKPVIDQI-AAK 196

Query: 186  DL--STLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 245
            D   S   I+  + + +S  L L   +V P  A     + R  +  R      RG  RNY
Sbjct: 197  DTPPSLTEIHERLINRESKLLALNSAEVVPITA--NVVTHRNTNTNR--NQNNRGDNRNY 256

Query: 246  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQ-GGGRGNFLKFNALIAETNCWALKNGV 305
            +N +   S + Q SSS SR          GR Q    +G+  K         C  L    
Sbjct: 257  NNNNN-RSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAK--------RCPQLHQ-- 316

Query: 306  GNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGN-RSIFVTLDESFQ 365
              +T N  +              +V        W LDSG ++H+T +  ++      +  
Sbjct: 317  FQSTTNQQQSTSPFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGG 376

Query: 366  SEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRG-LKVSFE 425
             +V   D + + +   G   + T   +  +  V YVP +  NL+S+ +L     + V F 
Sbjct: 377  DDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFF 436

Query: 426  GDICAIKD-QAGVLIAKVKMTANKMFPLNFTYGQ-ISCFSS-ILKDPSWLWHFRYGHLNF 485
                 +KD   GV + + K T ++++       Q +S F+S   K     WH R GH + 
Sbjct: 437  PASFQVKDLNTGVPLLQGK-TKDELYEWPIASSQAVSMFASPCSKATHSSWHSRLGHPSL 496

Query: 486  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDL-CG 545
              L+ +  NH +  + N +H+   C  C + K H+  F +     +SKPLE I++D+   
Sbjct: 497  AILNSVISNHSL-PVLNPSHKLLSCSDCFINKSHKVPF-SNSTITSSKPLEYIYSDVWSS 556

Query: 546  PMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSD 605
            P+ +  N   RY++ F+D F+R  W+Y LK+KS+    F  FK+  EN+   +I TL SD
Sbjct: 557  PILSIDN--YRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSD 616

Query: 606  RGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGD 665
             GGE++   ++  + GI H  +   TP+ NG++ERK+R I+EM  ++L   ++P  +W  
Sbjct: 617  NGGEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPY 676

Query: 666  AVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSE 725
            A +  VY++NR PT  +   +P++   G+ P+   L+VFG   Y  +    R KL+DKS+
Sbjct: 677  AFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSK 736

Query: 726  KCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSE--------------------DESWNW 785
            +C  +GYS    AY   +  + ++  SR V F E                    D + NW
Sbjct: 737  QCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNW 796

Query: 786  NDDVDEAKSPF----------HVNIDENEVAQELEQAEIQAMESS--SSSTSSSTSNDEI 845
                    +P           H++      +        Q   S+  SSS SS +S++  
Sbjct: 797  PSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPT 856

Query: 846  SP--------------RRMRSIQEIYNTTN------------------------------ 905
            +P              +   S   I N  N                              
Sbjct: 857  APSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPST 916

Query: 906  --------------------------------------------------RINDDHFANF 965
                                                              + N  +    
Sbjct: 917  SISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYAT 976

Query: 966  ALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELM-ELPTNKQALGVKWVYRTKL 1025
            +L A  +P T  +A++D++W+ AM  EI+A   N TW+L+   P +   +G +W++  K 
Sbjct: 977  SLAANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKF 1036

Query: 1026 KSDGNVEKYKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKS 1085
             SDG++ +YKARLV KGY Q  G+DY E F+PV +  +IR++L +A    W + Q+DV +
Sbjct: 1037 NSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNN 1096

Query: 1086 AFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRR 1145
            AFL G L +E++++QP G+V +   + V +L+KA+YGLKQAPRAWY  + ++ L  GF  
Sbjct: 1097 AFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVN 1156

Query: 1146 CPYEHALYVKEDKYGKFLIVSL-YVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYF 1205
               + +L+V +   G+ +I  L YVDD+L TGND  L     +++   F + +   +HYF
Sbjct: 1157 SISDTSLFVLQ--RGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYF 1216

Query: 1206 LGIEVNQNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYR 1265
            LGIE  +    + +SQ++Y  DLL +  M  A P  TPM  + KL      +  DP+ YR
Sbjct: 1217 LGIEAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTEYR 1276

Query: 1266 SLVGSLMYLTATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSE 1325
             +VGSL YL  TRPD+ +AV+ LS++M  P   HW A KRVLRY+ GT + GI+ KK + 
Sbjct: 1277 GIVGSLQYLAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNT 1336

Query: 1326 SVMFGFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGC 1343
              +  + D+DW G+ DD+ ST+GY+  +G    SW+SKKQ  V  S+TEAEY S+A    
Sbjct: 1337 LSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSS 1396

BLAST of IVF0014015 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 179.1 bits (453), Expect = 3.3e-43
Identity = 105/301 (34.88%), Postives = 163/301 (54.15%), Query Frame = 0

Query: 942  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLK 1001
            MDV +AFLN  + E I+V QP G+V     + V++L   +YGLKQAP  W   I++   K
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 1002 TGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGL 1061
             GF R   EH LY +    G  + +++YVDDLL       + D  K  +   + M D+G 
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGP-IYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGK 120

Query: 1062 IHYFLGIEVNQ-NEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVD 1121
            +  FLG+ ++Q + G+I +S Q Y      +  +       TP+  +  L +       D
Sbjct: 121  VDKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKD 180

Query: 1122 PSLYRSLVGSLMYLTAT-RPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIY 1181
             + Y+S+VG L++   T RPDI + VS+LSRF+  P+  H E+ +RVLRY+  T +  + 
Sbjct: 181  ITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLK 240

Query: 1182 YKKVSESVMFGFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKK-QSVVALSTTEAEYI 1240
            Y+  S+  +  +CD+  G   D   ST GYV  +     +W+SKK + V+ + +TEAEYI
Sbjct: 241  YRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYI 300

BLAST of IVF0014015 vs. ExPASy TrEMBL
Match: A0A5D3E3T2 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold156G00030 PE=4 SV=1)

HSP 1 Score: 2669.0 bits (6917), Expect = 0.0e+00
Identity = 1332/1348 (98.81%), Postives = 1334/1348 (98.96%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE
Sbjct: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 300

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1343
            DQVADIFTKALKFDLFVKFRGKLGVAQV
Sbjct: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1348

BLAST of IVF0014015 vs. ExPASy TrEMBL
Match: A0A5D3CLV1 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold132G00180 PE=4 SV=1)

HSP 1 Score: 2667.5 bits (6913), Expect = 0.0e+00
Identity = 1331/1348 (98.74%), Postives = 1334/1348 (98.96%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE
Sbjct: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 300

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQA+ESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 781  QELEQAEIQAVESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1343
            DQVADIFTKALKFDLFVKFRGKLGVAQV
Sbjct: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1348

BLAST of IVF0014015 vs. ExPASy TrEMBL
Match: A0A5D3CXM6 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold209G001540 PE=4 SV=1)

HSP 1 Score: 2667.5 bits (6913), Expect = 0.0e+00
Identity = 1331/1347 (98.81%), Postives = 1333/1347 (98.96%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE
Sbjct: 466  MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 525

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 526  LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 585

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 586  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 645

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 646  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 705

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 706  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 765

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 766  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 825

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 826  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 885

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 886  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 945

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 946  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 1005

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 1006 MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 1065

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 1066 GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 1125

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 1126 VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 1185

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 1186 CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 1245

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 1246 QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 1305

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 1306 PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 1365

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 1366 YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 1425

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 1426 EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1485

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1486 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1545

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1546 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1605

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1606 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1665

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1666 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1725

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1726 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1785

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQ 1342
            DQVADIFTKALKFDLFVKFRGKLGVAQ
Sbjct: 1786 DQVADIFTKALKFDLFVKFRGKLGVAQ 1812

BLAST of IVF0014015 vs. ExPASy TrEMBL
Match: A0A5A7UDP7 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold174G001450 PE=4 SV=1)

HSP 1 Score: 2667.5 bits (6913), Expect = 0.0e+00
Identity = 1331/1348 (98.74%), Postives = 1334/1348 (98.96%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYT+VENQSELTNQQLVE
Sbjct: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTKVENQSELTNQQLVE 60

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 300

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1343
            DQVADIFTKALKFDLFVKFRGKLGVAQV
Sbjct: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1348

BLAST of IVF0014015 vs. ExPASy TrEMBL
Match: A0A5A7TEJ0 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold122G002270 PE=4 SV=1)

HSP 1 Score: 2667.5 bits (6913), Expect = 0.0e+00
Identity = 1331/1348 (98.74%), Postives = 1333/1348 (98.89%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE
Sbjct: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 300

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1343
            DQVADIFTKA KFDLFVKFRGKLGVAQV
Sbjct: 1321 DQVADIFTKAFKFDLFVKFRGKLGVAQV 1348

BLAST of IVF0014015 vs. NCBI nr
Match: KAA0057291.1 (integrase [Cucumis melo var. makuwa] >KAA0060890.1 integrase [Cucumis melo var. makuwa] >KAA0062702.1 integrase [Cucumis melo var. makuwa] >TYJ98712.1 integrase [Cucumis melo var. makuwa] >TYK13441.1 integrase [Cucumis melo var. makuwa])

HSP 1 Score: 2655 bits (6882), Expect = 0.0
Identity = 1332/1348 (98.81%), Postives = 1334/1348 (98.96%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE
Sbjct: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 300

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1342
            DQVADIFTKALKFDLFVKFRGKLGVAQV
Sbjct: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1348

BLAST of IVF0014015 vs. NCBI nr
Match: TYJ95504.1 (integrase [Cucumis melo var. makuwa] >TYK08522.1 integrase [Cucumis melo var. makuwa] >TYK12883.1 integrase [Cucumis melo var. makuwa] >TYK20599.1 integrase [Cucumis melo var. makuwa])

HSP 1 Score: 2654 bits (6878), Expect = 0.0
Identity = 1331/1348 (98.74%), Postives = 1334/1348 (98.96%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE
Sbjct: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 300

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQA+ESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 781  QELEQAEIQAVESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1342
            DQVADIFTKALKFDLFVKFRGKLGVAQV
Sbjct: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1348

BLAST of IVF0014015 vs. NCBI nr
Match: KAA0039947.1 (integrase [Cucumis melo var. makuwa])

HSP 1 Score: 2654 bits (6878), Expect = 0.0
Identity = 1331/1348 (98.74%), Postives = 1333/1348 (98.89%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE
Sbjct: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 300

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1342
            DQVADIFTKA KFDLFVKFRGKLGVAQV
Sbjct: 1321 DQVADIFTKAFKFDLFVKFRGKLGVAQV 1348

BLAST of IVF0014015 vs. NCBI nr
Match: KAA0051599.1 (integrase [Cucumis melo var. makuwa])

HSP 1 Score: 2654 bits (6878), Expect = 0.0
Identity = 1331/1348 (98.74%), Postives = 1334/1348 (98.96%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYT+VENQSELTNQQLVE
Sbjct: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTKVENQSELTNQQLVE 60

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 300

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1342
            DQVADIFTKALKFDLFVKFRGKLGVAQV
Sbjct: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQV 1348

BLAST of IVF0014015 vs. NCBI nr
Match: TYK16225.1 (integrase [Cucumis melo var. makuwa])

HSP 1 Score: 2654 bits (6878), Expect = 0.0
Identity = 1331/1347 (98.81%), Postives = 1333/1347 (98.96%), Query Frame = 0

Query: 1    MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
            MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE
Sbjct: 466  MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 525

Query: 61   LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 120
            LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS
Sbjct: 526  LRENRKKDKKALFFIYQAVDEFIFERISTATSAKAAWDILRSTYQGEDKVKMIRLQALRS 585

Query: 121  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 180
            EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE
Sbjct: 586  EFDCIKMKETETIEEFFNRILVIVNSLRSNGEEVGDQRVVEKILRSMPRKFEHIVVAIEE 645

Query: 181  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 240
            SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY
Sbjct: 646  SKDLSTLSINSLMGSLQSHELRLKQFDVNPEEAFQMQTSFRGGSRGRRGGHGRRGGGRNY 705

Query: 241  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFLKFNALIA------ETNCWA 300
            DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNF +            + NCWA
Sbjct: 706  DNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQCFNCGKYGHFQANCWA 765

Query: 301  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 360
            LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD
Sbjct: 766  LKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSGCSNHMTGNRSIFVTLD 825

Query: 361  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 420
            ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV
Sbjct: 826  ESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLKHNLLSIGQLLQRGLKV 885

Query: 421  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 480
            SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF
Sbjct: 886  SFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSILKDPSWLWHFRYGHLNF 945

Query: 481  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 540
            KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP
Sbjct: 946  KSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAWRASKPLELIHTDLCGP 1005

Query: 541  MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 600
            MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR
Sbjct: 1006 MRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAFTENQSGYKIKTLRSDR 1065

Query: 601  GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 660
            GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA
Sbjct: 1066 GGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMARSMLKAKNLPNEFWGDA 1125

Query: 661  VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 720
            VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK
Sbjct: 1126 VACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYSHIPNQLRGKLDDKSEK 1185

Query: 721  CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 780
            CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA
Sbjct: 1186 CIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVDEAKSPFHVNIDENEVA 1245

Query: 781  QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 840
            QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD
Sbjct: 1246 QELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNRINDDHFANFALFAGVD 1305

Query: 841  PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 900
            PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK
Sbjct: 1306 PVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVEK 1365

Query: 901  YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 960
            YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK
Sbjct: 1366 YKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHLK 1425

Query: 961  EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1020
            EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY
Sbjct: 1426 EEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPYEHALY 1485

Query: 1021 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1080
            VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE
Sbjct: 1486 VKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNE 1545

Query: 1081 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1140
            GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL
Sbjct: 1546 GEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYL 1605

Query: 1141 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1200
            TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS
Sbjct: 1606 TATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDS 1665

Query: 1201 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1260
            DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML
Sbjct: 1666 DWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALWLRWML 1725

Query: 1261 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1320
            KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ
Sbjct: 1726 KELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRDLVKDGEVIVKYCKTQ 1785

Query: 1321 DQVADIFTKALKFDLFVKFRGKLGVAQ 1341
            DQVADIFTKALKFDLFVKFRGKLGVAQ
Sbjct: 1786 DQVADIFTKALKFDLFVKFRGKLGVAQ 1812

BLAST of IVF0014015 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 362.1 bits (928), Expect = 2.0e-99
Identity = 188/470 (40.00%), Postives = 287/470 (61.06%), Query Frame = 0

Query: 834  DPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGVKWVYRTKLKSDGNVE 893
            +P T++EA +   W  AMD EI A+    TWE+  LP NK+ +G KWVY+ K  SDG +E
Sbjct: 85   EPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIE 144

Query: 894  KYKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQNGWKVYQMDVKSAFLNGHL 953
            +YKARLV KGY Q+ G+D+ E F+PV ++ +++LIL+++A   + ++Q+D+ +AFLNG L
Sbjct: 145  RYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDL 204

Query: 954  KEEIFVAQPLGYVQRGEE----EKVYKLKKALYGLKQAPRAWYSRIDSFFLKTGFRRCPY 1013
             EEI++  P GY  R  +      V  LKK++YGLKQA R W+ +     +  GF +   
Sbjct: 205  DEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHS 264

Query: 1014 EHALYVKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIE 1073
            +H  ++K      FL V +YVDD++   N+    D+ K+ +K+ F++ D+G + YFLG+E
Sbjct: 265  DHTYFLKITA-TLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLE 324

Query: 1074 VNQNEGEIVISQQKYAHDLLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVG 1133
            + ++   I I Q+KYA DLL +  +    P + PMD ++       G+ VD   YR L+G
Sbjct: 325  IARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIG 384

Query: 1134 SLMYLTATRPDILFAVSMLSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMF 1193
             LMYL  TR DI FAV+ LS+F   P+ +H +A  ++L YI GT+  G++Y   +E  + 
Sbjct: 385  RLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQ 444

Query: 1194 GFCDSDWGGNVDDHKSTSGYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALW 1253
             F D+ +    D  +ST+GY   +G+ + SW SKKQ VV+ S+ EAEY +L+ A  + +W
Sbjct: 445  VFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMW 504

Query: 1254 LRWMLKELKCIQKCETVLFCDNGSAIALSKNPVFHGRSKHIRIKYHFIRD 1300
            L    +EL+      T+LFCDN +AI ++ N VFH R+KHI    H +R+
Sbjct: 505  LAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRE 553

BLAST of IVF0014015 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 169.5 bits (428), Expect = 1.9e-41
Identity = 87/222 (39.19%), Postives = 131/222 (59.01%), Query Frame = 0

Query: 1028 LYVDDLLFTGNDKFLCDDFKNSMKNEFEMSDMGLIHYFLGIEVNQNEGEIVISQQKYAHD 1087
            LYVDD+L TG+   L +     + + F M D+G +HYFLGI++  +   + +SQ KYA  
Sbjct: 5    LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 1088 LLKKFRMENASPCNTPMDANLKLCKDDIGEAVDPSLYRSLVGSLMYLTATRPDILFAVSM 1147
            +L    M +  P +TP+   L        +  DPS +RS+VG+L YLT TRPDI +AV++
Sbjct: 65   ILNNAGMLDCKPMSTPLPLKLN-SSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVNI 124

Query: 1148 LSRFMTNPKRSHWEAGKRVLRYILGTINFGIYYKKVSESVMFGFCDSDWGGNVDDHKSTS 1207
            + + M  P  + ++  KRVLRY+ GTI  G+Y  K S+  +  FCDSDW G     +ST+
Sbjct: 125  VCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTT 184

Query: 1208 GYVFSMGSGVFSWTSKKQSVVALSTTEAEYISLAAAGCQALW 1250
            G+   +G  + SW++K+Q  V+ S+TE EY +LA    +  W
Sbjct: 185  GFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of IVF0014015 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 103.6 bits (257), Expect = 1.3e-21
Identity = 50/118 (42.37%), Postives = 75/118 (63.56%), Query Frame = 0

Query: 817 NRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQAL 876
           N++N  +          +P +   A++D  W  AM +E+DA+ RN+TW L+  P N+  L
Sbjct: 10  NKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNIL 69

Query: 877 GVKWVYRTKLKSDGNVEKYKARLVVKGYKQEYGVDYEEIFAPVTRIETIRLILSLAAQ 935
           G KWV++TKL SDG +++ KARLV KG+ QE G+ + E ++PV R  TIR IL++A Q
Sbjct: 70  GCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVAQQ 127

BLAST of IVF0014015 vs. TAIR 10
Match: AT1G48720.1 (unknown protein; Has 229 Blast hits to 229 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 0; Plants - 228; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 86.7 bits (213), Expect = 1.6e-16
Identity = 43/94 (45.74%), Postives = 65/94 (69.15%), Query Frame = 0

Query: 1  MASNGNMLQPQLPRFSGKNFNQWSIQMKVLYGSQELWDIVERGYTEVENQSELTNQQLVE 60
          MASN   +  Q+P  +  N++ WS++MK + G+ ++W+IVE+G+ E EN+  L+  Q   
Sbjct: 1  MASNN--VPFQVPVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDG 60

Query: 61 LRENRKKDKKALFFIYQAVDEFIFERISTATSAK 95
          LR++RK+DKKAL  IYQ +DE  FE++  ATSAK
Sbjct: 61 LRDSRKRDKKALCLIYQGLDEDTFEKVVEATSAK 92

BLAST of IVF0014015 vs. TAIR 10
Match: AT3G20980.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 72.4 bits (176), Expect = 3.1e-12
Identity = 48/132 (36.36%), Postives = 67/132 (50.76%), Query Frame = 0

Query: 330 EPTWYLDSGCSNHMTGNRSIFVTLDESFQSEVK--TGD---NTRLQVKGQGDILVKTKKG 389
           E  W + S  SNHMT +   F TLD S + +VK  +GD    T   V+G GD+   T +G
Sbjct: 266 ENIWLISSTNSNHMTPHVKFFTTLDRSRKCKVKFISGDKSETTVAMVEGIGDVTFITNEG 325

Query: 390 TKRVTNVFYVPGLKHNLLSIGQLLQRGLKVSFEGDI-CAIKDQAGVLIAKVKMTANKMFP 449
            K + NV YVPG++ N LS+ QL + G +VS E    C + D+    +    M   + F 
Sbjct: 326 NKTIKNVLYVPGIEGNALSVSQLKRNGFEVSMERRTGCFVWDRTTGKMFGKNMWEKRGFC 385

Query: 450 LNFTY--GQISC 454
           L F+   G   C
Sbjct: 386 LRFSVIEGNFQC 397

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109783.9e-19332.70Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q94HW26.2e-16729.42Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P041461.1e-15829.31Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT944.1e-15829.18Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P256003.3e-4334.88Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A5D3E3T20.0e+0098.81Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold156G00030 PE=4... [more]
A0A5D3CLV10.0e+0098.74Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold132G00180 PE=4... [more]
A0A5D3CXM60.0e+0098.81Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold209G001540 PE=... [more]
A0A5A7UDP70.0e+0098.74Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold174G001450 PE=... [more]
A0A5A7TEJ00.0e+0098.74Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold122G002270 PE=... [more]
Match NameE-valueIdentityDescription
KAA0057291.10.098.81integrase [Cucumis melo var. makuwa] >KAA0060890.1 integrase [Cucumis melo var. ... [more]
TYJ95504.10.098.74integrase [Cucumis melo var. makuwa] >TYK08522.1 integrase [Cucumis melo var. ma... [more]
KAA0039947.10.098.74integrase [Cucumis melo var. makuwa][more]
KAA0051599.10.098.74integrase [Cucumis melo var. makuwa][more]
TYK16225.10.098.81integrase [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
AT4G23160.12.0e-9940.00cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.9e-4139.19DNA/RNA polymerases superfamily protein [more]
ATMG00820.11.3e-2142.37Reverse transcriptase (RNA-dependent DNA polymerase) [more]
AT1G48720.11.6e-1645.74unknown protein; Has 229 Blast hits to 229 proteins in 10 species: Archae - 0; B... [more]
AT3G20980.13.1e-1236.36Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 44..64
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 65..205
e-value: 4.0E-27
score: 94.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 241..261
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 217..274
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 781..805
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 333..1217
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 1190..1325
e-value: 2.39762E-77
score: 249.693
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 452..507
e-value: 1.6E-13
score: 50.2
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 861..1104
e-value: 1.5E-86
score: 290.0
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 516..695
e-value: 6.3E-45
score: 154.9
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 521..620
e-value: 5.1E-15
score: 55.6
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 519..683
score: 26.701679
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 517..680
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 860..1296

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0014015.1IVF0014015.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006397 mRNA processing
molecular_function GO:0003729 mRNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding