Projects

You can find here my repositories. Some of them are projects worth mentioning. This projects are part of my daily work while others I enjoy developing at my own time. They are not ordered chronologically or by any special order. The titles are the link to the webpage or repository.

Bioconductor stats

Purpose: Analyzing packages in Bioconductor by their downloads, IPs and the question for support.

This project is to compare how well the packages in Bioconductor work, track the raise and fall of package usage and how much does affect having a good support.

WGCNA

Purpose: Adding examples, tests and using roxygen in the WGCNA package.

WGCNA let create weighted correlation network analysis. However it doesn’t have test and the documentation I found it sometimes confusing (although abundant). With this project I learned how to contribute to a project (Avoid big modifications!!) and to read someone else code.

topGO

Purpose: Correcting some errors in the topGO package of Bioconductor.

topGO is the only tool I found that takes into account the DAG structure of GO to calculate the enrichment scores. A fork of a prvious verion were I tryed to update and understand how it is implemented to correct bugs. The author’s github page of the package is here.

BioCor

Purpose: A package to calculate functional similarities thorough pathways.

I developed this package as part of my master thesis. You can find some how to use it in a vignette and the applications in another one. Some other ideas to work with the package can be found here

GSEAdv

Purpose: A package to analyse gene set collections.

It comes from the previous comments, and holds lots of code to analyze, simulate and compare different gene set collections, see also the blog post about the history of the package. Using this software to comapre pathways databases will probably require another article.

BaseSet

Purpose: A package to store set collections.

After finding some limitations on the GSEAdv project I started this project to test how sets could be stored efficiently. Later I found that a new system for holding gene sets was being considered. My idea is to end up merging GSEAdv methods into BaseSet. In this project I am using NSE (or tidy evaluations for the first time). Some discussion and consideration about the new gene set class is beind written here

GOSemSim

Purpose: Calculate similarities between genes using gene ontologies

I contributed by speeding calculations in several functions, improving the implementation of the algorithms. I am officially a contributor of the official repository.

BrokerMaster

Purpose: A website to link projects and investigators with master students to improve the process in our master.

This was a project (for a subject in the master) to learn how to use databases and create websites.

Analysis of the thyroid carcinoma data from the TCGA project

Purpose: Find why women have more thyroid carcinoma.

This analysis was part of a subject for my master. We (it is a team work) explored several methods of batch correction and contrast. The respository with the data and output is here.

Rhea

Purpose: Remove bugs for the main repository.

We are using this software as part of a collaboration, as it is aimed to be user-friendly I introduce changes to help my team mates to use it.

Thesis

Purpose: Write my thesis along the way, instead of waiting for the last final months.

My thesis is about the integration of data, how the microbiome is related to the transcriptome in the inflammatory bowel disease.

experDesign

Purpose: A package for preparing the samples to be sequenced in batches

Divides the samples in equal batches, provides methods to select randomly but equilibrated samples. The name is from expert design and experiment design.

RGCCA

Purpose: Understand better the methods implemented and improve the efficiency.

A fork from CRAN was used to implement tests to the package, speed the functions and add some checks of the input parameters (Using this fork has already saved my some time). I also added a function to check an input parameter.

BOE

Purpose: Help retrieve the data from the official journal of the goverment in Spain

Uses the API of the goverment to retrieve de publications and tidy the output for easy exploration. This project was to help a journalist who reads the relevant news each day. I ended up creating a website with some of my analysis: Histórico del BOE.