Python-based bioinformatics application for DNA motif analysis in nucleotide sequences with IUPAC motif support, FASTA parsing, NCBI integration and data visualization.
DNA Motif Analyzer is a desktop bioinformatics application developed in Python for identifying and analyzing nucleotide motifs in DNA sequences.
The application enables:
- loading and validation of FASTA sequence files
- motif searching with full IUPAC nucleotide code support
- quantitative analysis of motif occurrences
- visualization of results using heatmaps and bar plots
- export of results to CSV, HTML and PDF formats
- retrieval of nucleotide sequences directly from the NCBI database using accession numbers or UIDs
The project combines biological sequence analysis with modular software architecture and graphical data visualization.
- IUPAC nucleotide code support (e.g. N, R, Y)
- overlapping motif matching
- normalized motif occurrence analysis (per 1000 nt)
- graphical user interface (Tkinter)
- export of analytical results and visualizations
- integration with NCBI sequence retrieval
- matrix-based representation of motif analysis results
- Python 3
- Tkinter
- NumPy
- Matplotlib
- Requests
- NCBI E-utilities API
main.py– application entry pointgui_app.py– graphical user interfaceanalysis_engine.py– motif analysis enginefasta_parser.py– FASTA parsing and validationiupac.py– IUPAC motif matching logicexport_manager.py– export functionalityexport_tab.py– export GUI modulencbi_client.py– communication with NCBI databasefigures/– application screenshots and visualizationsresults/– example exported analysis reports
- Python 3.10+
Required libraries:
numpy
matplotlib
requests
pip install -r requirements.txtpython main.py- Load FASTA sequences or retrieve data from NCBI
- Define nucleotide motifs (e.g. ATG, TATAAA)
- Run motif analysis
- Review graphical and tabular results
- Export results and visualizations
This project demonstrates:
- biological sequence analysis
- motif searching algorithms
- FASTA data processing
- IUPAC ambiguity handling
- bioinformatics software development
- scientific data visualization
- integration with biological databases
- modular Python application architecture
Example exported reports generated by the application are available in:
results/
The repository includes example HTML reports demonstrating motif analysis and export functionality.
Alicja Stachura-Matyjewicz



