Blog Archive

Check out all blog posts in my blog archive. Click on a headline to read the teaser.

Bioinformatics › Simulating Protein-Ligand Complexes using Open Source tools
With the rise of computational modelling in the biotech industry, an emphasis is being put on the development of open-source toolkits for the simulation of drug-like molecular compounds in complex with proteins, fueling a rapid increase in computationally aided drug design. However, these workflows are often computationally expensive and are being migrated to the cloud. So what does cloud based drug discovery look like? Read More ›

Data Science › An investigation on wine
Can machine learning determine whether a wine is red or white based on its chemical composition? Can it predict whether a wine is good or bad? Using a dataset of 6,497 wines from the UCI Machine Learning Repository, this project explored wine classification and quality prediction. Binary Logistic Regression acheived a 97.3% accuracy in distinguishing between red and white wines. Further investigation using bayesian inference revealed optimal citric acid levels of 0.29 and 0.34 g.dm⁻³. Read More ›

Photochemistry › The Nuclear Ensemble Approach
The Nuclear Ensemble Approach (NEA) is a computational chemistry approach used in photochemistry to model photoabsorption spectra and simulate nonadiabatic dynamics. By generating a representative set of molecular geometries (ensemble) from a Boltzmann-weighted distribution, NEM captures the effects of vibrational broadening and conformational flexibility on electronic transitions. Recent advancements, such as simulated annealing and optimal sampling, improve efficiency by reducing redundant configurations, enhancing accuracy in predicting UV-Vis absorption spectra and excited-state properties of gas-phase molecules and biomolecules. Read More ›

Atmospheric Chemistry › AtmoSpec: A Revolutionary Tool for Atmospheric Chemists
AtmoSpec is an innovative UV/Vis spectroscopy tool designed to streamline the calculation of photoabsorption cross-sections for volatile organic compounds (VOCs). Developed by the InSilico Photochemistry group at the University of Bristol, AtmoSpec enables users to input a molecule's SMILES code, select a theoretical framework, and perform quantum chemistry calculations using ORCA. Read More ›

Masters project - Universtiy of Bristol › Representative sampling in the Nuclear Ensemble Approach - full report
Comprehensive analysis of unknown datasets › Biscuit dunking analysis report
A deep learning data mining project. Read More ›

An interactive graphing module › Atmospec data visualization module report
A cloud based processing approach › ATLAS data processing report
A demonstration of scalability potential using cloud technologies Read More ›

Web development › This Website
This website is an ongoing project, developing my abities in HTML and CSS. It aims to showcase my work, and help me track my development as a programmer and a chemist. Read More ›

Data Science › Representative Samping for the Nuclear Ensemble Approach
The Nuclear Ensemble Method allows for exploring UV vis spectroscopy properties of small molecules accurately, however, it suffers from high computational burden due to large ensemble sizes, which become increasingly limiting with larger, more complex organic compounds. As such, methods for reducing intensity of calculations have been proposed. The Representative sampling method, aiming to reduce the subset while maintaining a high quality spectrum, was implemented as an automated workflow using Python. Read More ›

Cloud paradigms › Improving scalability using cloud paradigms
Scaling an ATLAS data processing script, going from a monolithic script to a containerized, service-based architecture. Functionality demonstrated locally using Docker swarm, with dynamic scaling of containers for fetching and processing tasks. Read More ›

Data Science › Biscuit Dunking Data Analysis
How do you dunk your biscuit? Everyone seems to have an opinion, sometimes a strong one. Well it turns out that science does too, the Washburn equation for capillary action, has been shown to have great applicability to biscuits! This analysis presents a comprehensive description of biscuit dunking data to distinguish and predict brand of biscuit, as well as soaking time using physical data. Read More ›

Data Visualization › Atmospec Visualization Module
An interactive data visualization module, allowing for the exploration of photolysis properties in Atmospec. The photolysis rate constant was calculated using dynamic inputs from the user, and the graph is updated in real-time, allowing for researchers to interact with their results, and explore changes in data. Read More ›

Increasing Efficiency › Accelerating Python code
Exploring tools such as OpenMP, MPI and C++, as well as Python modules Cython and Numba to increase the efficiency of the computationally intensive Lebwhol-Lasher fluid model coded in Python Read More ›

Computational Chemistry › Predicting molecular properties: Solubility
Save the tedious lab work! Read More ›