{"id":59,"date":"2025-06-13T12:14:11","date_gmt":"2025-06-13T09:14:11","guid":{"rendered":"https:\/\/sisu.ut.ee\/bioinfo\/?page_id=59"},"modified":"2025-06-19T15:25:38","modified_gmt":"2025-06-19T12:25:38","slug":"tools-and-datasets","status":"publish","type":"page","link":"https:\/\/sisu.ut.ee\/bioinfo\/tools-and-datasets\/","title":{"rendered":"Applications"},"content":{"rendered":"<p><strong>WEB-BASED APPLICATIONS<\/strong><br><strong><a href=\"http:\/\/primer3.ut.ee\/\" target=\"_blank\" rel=\"noreferrer noopener\">Primer3 version 4.1.0<\/a><br><\/strong>Primer3 is the most popular software for PCR primer design.<\/p>\n\n\n\n<p><strong>Primer3 version 0.4<br><\/strong>Older version of Primer3.<\/p>\n\n\n\n<p><strong>PhenotypeSeeker<br><\/strong>Predicts antimicrobial resistance and other phenotypes of bacterial isolates, using WMS or WGS data.<\/p>\n\n\n\n<p><strong>StrainSeeker 1.1<br><\/strong>Identification of bacterial species and strains from Illumina sequencing (WGS or WMS) data.<\/p>\n\n\n\n<p><strong>SNPmasker 1.1<\/strong><br>Masks known SNPS and other genomic regions that should be avoided when designing PCR primers<\/p>\n\n\n\n<p><strong><a href=\"https:\/\/bioinfo.ut.ee\/genometester\/\">GenomeTester 1.3<\/a><\/strong><br>Fast e-PCR program for detecting binding sites of primers and potential PCR products<\/p>\n\n\n\n<p><strong><a href=\"https:\/\/bioinfo.ut.ee\/multiplx\/index.php\">MultiPLX 2.1<\/a><\/strong><br>Software for grouping existing PCR primers into multiplex groups<\/p>\n\n\n\n<p><strong><a href=\"https:\/\/bioinfo.ut.ee\/oligotm\/\">OligoTM 1.0<\/a><\/strong><br>Predicts Tm of short oligonucleotides<\/p>\n\n\n\n<p><strong>cRegions<\/strong><br><br><strong>PhyloCube<\/strong><br><br><strong>Y-mer<\/strong><br>Predicts human chrY haplogroups, using a novel algorithm<\/p>\n\n\n\n<p><br><br><strong>SOFTWARE and DATASETS FOR DOWNLOAD<\/strong><br><br><strong>KATK package version 4.2<\/strong><br>KATK is a toolkit for discovering\u00a0and calling SNV and indel genotypes from human personal genomes. Unlike FastGT, KATK can detect\u00a0<strong>all<\/strong>\u00a0variants, even rare and de novo variants. It can process a deep-sequenced human genome in 1-3 hours.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"http:\/\/bioinfo.ut.ee\/KATK\/\" target=\"_blank\" rel=\"noreferrer noopener\">K-mer databases, user manuals and support.<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/github.com\/bioinfo-ut\/GenomeTester4\" target=\"_blank\" rel=\"noreferrer noopener\">Source and precompiled binaries of KATK<\/a>\u00a0(gmer_counter and gassembler)\u00a0are available in a subfolder under the GenomeTester4 project in GitHub.<\/li>\n<\/ul>\n\n\n\n<p><strong>Citation:<\/strong>\u00a0Kaplinski L, M\u00f6ls M, Puurand T, Pajuste FD, Remm M\u00a0(2021).\u00a0KATK: Fast genotyping of rare variants directly from unmapped sequencing reads.\u00a0<em>Human Mutation<\/em>, 42(6):777-786. doi: 10.1002\/humu.24197.\u00a0<a href=\"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/humu.24197\" target=\"_blank\" rel=\"noreferrer noopener\">[Full Text]<\/a><br><br><\/p>\n\n\n\n<p><strong>AluMine package<\/strong><br>AluMine is a toolkit for discovery and genotyping of polymorphic Alu element insertions in personal genomes. It can do discovery in 4 hours and genotyping in 20 minutes (from FASTQ of 30x personal genome).<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/github.com\/bioinfo-ut\/AluMine\" target=\"_blank\" rel=\"noreferrer noopener\">Source and precompiled binaries of AluMine from GitHub<\/a>.<\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/AluMine\/Alu.REF-plus.filtered.fas\" target=\"_blank\" rel=\"noreferrer noopener\">Sequences of 13,396 potentially polymorphic Alu elements.<\/a>\u00a0FASTA file with full-length sequences of potentially polymorphic REF+ elements. (REF+ are those elements that are present in the reference genome).<\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/AluMine\/ALU.REF-plus_v1.kmer.db\" target=\"_blank\" rel=\"noreferrer noopener\">List of 13,396 potentially polymorphic ALU elements.<\/a>\u00a0Coordinates and signature sequences of discovered REF+ Alu element insertions.<\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/AluMine\/ALU_only_v1.kmer.db\" target=\"_blank\" rel=\"noreferrer noopener\">List of potentially polymorphic ALU elements.<\/a>\u00a0Coordinates and signature sequences of discovered 23,108 REF+ and REF- Alu element insertions.<\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/FastGT\/downloads\/ALU_v1.kmer.db\" target=\"_blank\" rel=\"noreferrer noopener\">Combined set of ALU and SNP marker\u00a0<em>k<\/em>-mers for genotyping<\/a>.\u00a0<em>K<\/em>-mers for simultaneous genotyping of 23,108 polymorphic Alu element insertions and 30 million SNVs. FastGT format.<\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/AluMine\/filter_gt_markers.txt\" target=\"_blank\" rel=\"noreferrer noopener\">Stats of markers.<\/a>\u00a0Based on genotyping these markers on 2200 individual genomes. Contains allele frequencies, fraction of expected genotypes, fraction of markers violating HWE, etc. for each marker. See also Table S2 in Additional_File_2.xlsx.<\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/AluMine\/filter_gt_stats.txt\" target=\"_blank\" rel=\"noreferrer noopener\">Stats of tested individuals.<\/a>\u00a0Based on genotyping all Alu markers on 2200 individual genomes. Contains depth of coverage (based on 25-mer coverage), number of discovered REF+ and REF- elements per individual, and other information.<\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/AluMine\/pan_troglodytes_32.list\" target=\"_blank\" rel=\"noreferrer noopener\">pan_troglodytes_32.list<\/a>\u00a0Chimp 32-mer list used during the REF-plus element discovery.<\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/AluMine\/human_37_25.index\" target=\"_blank\" rel=\"noreferrer noopener\">human_37_25.index<\/a>\u00a0Human 25-mer list used during the REF-minus element discovery to localize potential novel elements in the human reference genome.<\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/AluMine\/human_37.names\" target=\"_blank\" rel=\"noreferrer noopener\">human_37.names<\/a>\u00a0Additional file required for gtester. This is used during the REF-minus element discovery.<\/li>\n<\/ul>\n\n\n\n<p><strong>Citation:<\/strong>\u00a0Puurand T, Kuku\u0161kina V, Pajuste F-D, Remm M. (2019). AluMine: alignment-free method for the discovery of polymorphic Alu element insertions. Mobile DNA, 10: 31.<br>doi:\u00a0<a href=\"http:\/\/dx.doi.org\/10.1186\/s13100-019-0174-3\" target=\"_blank\" rel=\"noreferrer noopener\">10.1186\/s13100-019-0174-3<\/a><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\">\n\n\n\n<p><strong>FastGT package version 4.2<\/strong><br>FastGT is a toolkit for counting\u00a0<em>k<\/em>-mers from FASTQ files and calling 30 millions human SNV genotypes. It can process 30x human genome in less than an hour.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"http:\/\/bioinfo.ut.ee\/FastGT\/\" target=\"_blank\" rel=\"noreferrer noopener\">K-mer databases, user manuals and support.<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/github.com\/bioinfo-ut\/GenomeTester4\" target=\"_blank\" rel=\"noreferrer noopener\">Source and precompiled binaries of FastGT<\/a>\u00a0are available in a subfolder under the GenomeTester4 project in GitHub.<\/li>\n<\/ul>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Citation:\u00a0<\/strong>Pajuste F-D, Kaplinski L, M\u00f6ls M, Puurand T, Lepamets M, Remm M. (2017). FastGT: an alignment-free method for calling common SNVs directly from raw sequencing reads. Scientific Reports, 7:2537.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\">\n\n\n\n<p><strong>GenomeTester package version 4.2<\/strong><br>GenomeTester4 is a toolkit for counting\u00a0<em>k<\/em>-mers from nucleic acid sequences and performing basic set operations (union, intersection and difference) on\u00a0<em>k<\/em>-mer lists. It can be used in many bioinformatic analyses. GenomeTester4 is released under GNU General Public License version 3.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/github.com\/bioinfo-ut\/GenomeTester4\" target=\"_blank\" rel=\"noreferrer noopener\">Source and precompiled binaries<\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Citation:<\/strong>\u00a0Kaplinski L, Lepamets M, Remm M. (2015). GenomeTester4: a toolkit for performing basic set operations \u2013 union, intersection and complement on k-mer lists. GigaScience; 4:58<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\">\n\n\n\n<p><strong>GenomeTester package version 1.3<\/strong><br>GenomeMasker (gmasker) masks over-represented words in the FASTA file, preventing design of primers in repeated regions.\u00a0GenomeTester (gtester) is the program that tests 1) whether PCR primers have excessive number of binding sites on template sequence and 2) how many PCR products would be amplified from the template DNA and where are they located.\u00a0Package contains also modified the PRIMER3 program (gm_primer3), to be able to design primers from lowercase-masked sequences.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=10\">32-bit Linux (x86)<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=22\">64-bit linux (x86)<\/a><\/li>\n\n\n\n<li><a href=\"http:\/\/bioinfo.ut.ee\/download\/dl.php?file=26\">64-bit MacOS X<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=11\">Example files<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=12\">README<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=14\">Release notes<\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Citation:<\/strong>\u00a0Andreson R, Reppo E, Kaplinski L, Remm M. (2006). GENOMEMASKER package for designing unique genomic PCR primers. BMC Bioinformatics; 7:172.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\">\n\n\n\n<p><strong>StrainSeeker<\/strong><br>StrainSeeker is a program for identification of bacterial strains from raw sequencing reads. StrainSeeker database contains custom-built phylogenetic tree of all bacterial strains and\u00a0<em>k<\/em>-mers specific to each node of the tree.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"http:\/\/bioinfo.ut.ee\/strainseeker\/index.php?r=site\/page&amp;view=downloadable\">StrainSeeker binaries and databases<\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Citation<\/strong><strong>:\u00a0<\/strong>Roosaare M, Vaher M, Kaplinski L, M\u00f6ls M, Andreson R, Lepamets M, Koressaar T, Naaber P, Koljalg S, Remm M. (2016). StrainSeeker: fast identification of bacterial strains from unassembled sequencing reads using user-provided guide trees.\u00a0PeerJ\u00a05:e3353. doi: 10.7717\/peerj.3353.<\/p>\n\n\n\n<p><a href=\"http:\/\/biorxiv.org\/content\/early\/2016\/02\/19\/040261\" target=\"_blank\" rel=\"noreferrer noopener\">\u00a0<\/a><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\">\n\n\n\n<p><strong>Primer3_masker<\/strong><br>Generates k-mer lists for masking genomic sequences with Primer3_masker. <br>The list files are available from the Primer3_masker webpage at\u00a0<a href=\"http:\/\/primer3.ut.ee\/lists.htm\" target=\"_blank\" rel=\"noreferrer noopener\">http:\/\/primer3.ut.ee\/lists.htm<\/a>.<\/p>\n\n\n\n<p><strong>Citation<\/strong>:\u00a0K\u00f5ressaar T, Lepamets M, Kaplinski L, Raime K, Andreson R, Remm M. (2018).\u00a0Primer3_masker: integrating masking of template sequence with primer design software.\u00a0Bioinformatics\u00a0<em>34(11):1937-1938. doi: 10.1093\/bioinformatics\/bty036.<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\">\n\n\n\n<p><strong>MultiPLX version 2.0<\/strong><br>MultiPLX is a tool for analyzing PCR primer compatibility and automatically finding optimal multiplexing (grouping) solution. It uses state-of-the-art nearest neighbour DNA binding thermodynamics to estimate possible unwanted pairings between PCR samples.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=23\">32-bit Linux (x86)<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=24\">64-bit Linux (x86)<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=25\">32-bit Windows<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=27\">64-bit MacOS X<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=4\">Demo primers for MultiPLX<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=5\">README<\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Citation<\/strong><strong>:\u00a0<\/strong>Kaplinski L, Andreson R, Puurand T, Remm M. (2005). MultiPLX: automatic grouping and evaluation of PCR primers. Bioinformatics. 2005 Apr 15;21(8):1701-2.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\">\n\n\n\n<p><strong>SLICSel\u00a0<\/strong>version 1.1<br>SLICSel is a program for designing specific oligonucleotide probes for microbial detection and identification. To obtain maximal specificity of designed oligonucleotides, SLICSel uses the Nearest-Neighbor thermodynamics-based approach for probe design.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=19\">32-bit Linux (x86)<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=20\">64-bit Linux (x86)<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=21\">32-bit Windows<\/a><\/li>\n<\/ul>\n\n\n\n<p><br><strong>Citation:\u00a0<\/strong>Scheler O, Kaplinski L, Glynn B, Palta P, Parkel S, Toome K, Maher M, Barry T, Remm M, Kurg A. (2011). Detection of NASBA amplified bacterial tmRNA molecules on SLICSel designed microarray probes. BMC Biotechnology 2011;11:17.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\">\n\n\n\n<p><strong>FastaGrep\u00a0<\/strong>version 2.0<br>FastaGrep is a tool for searching oligonucleotide binding sites from FastA genomic sequences. It can do both match\/mismatch based and thermodynamic binding energy searches.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=6\">32-bit Linux (x86)<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=28\">64-bit Linux<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/bioinfo.ut.ee\/download\/dl.php?file=7\">README<\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>WEB-BASED APPLICATIONSPrimer3 version 4.1.0Primer3 is the most popular software for PCR primer design. Primer3 version 0.4Older version of Primer3. PhenotypeSeekerPredicts antimicrobial resistance and other phenotypes of bacterial isolates, using WMS or WGS data. StrainSeeker 1.1Identification of bacterial species and strains &#8230;<\/p>\n","protected":false},"author":535,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"class_list":["post-59","page","type-page","status-publish","hentry"],"acf":[],"_links":{"self":[{"href":"https:\/\/sisu.ut.ee\/bioinfo\/wp-json\/wp\/v2\/pages\/59","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sisu.ut.ee\/bioinfo\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/sisu.ut.ee\/bioinfo\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/sisu.ut.ee\/bioinfo\/wp-json\/wp\/v2\/users\/535"}],"replies":[{"embeddable":true,"href":"https:\/\/sisu.ut.ee\/bioinfo\/wp-json\/wp\/v2\/comments?post=59"}],"version-history":[{"count":3,"href":"https:\/\/sisu.ut.ee\/bioinfo\/wp-json\/wp\/v2\/pages\/59\/revisions"}],"predecessor-version":[{"id":82,"href":"https:\/\/sisu.ut.ee\/bioinfo\/wp-json\/wp\/v2\/pages\/59\/revisions\/82"}],"wp:attachment":[{"href":"https:\/\/sisu.ut.ee\/bioinfo\/wp-json\/wp\/v2\/media?parent=59"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}