Open source tools for open data
Louhos: Since 2011 I have been developing open source tools for processing, analyzing and visualizing open data with a few Finnish friends in the Louhos project. It all started with our sorvi R package winning the Apps4Finland 2011 data opening category. Since then, our blog (in Finnish; migrated from Wordpress in June 2014) has received more than 30000 views and we’ve won also other prices in Apps4Finland.
rOpenGov: Currently we are actively developing the rOpenGov R ecosystem for open goverment data. We already have a lot of packages involved all around Europe and the US, and even more are being developed. In Finland, the original sorvi package has been split to several more focused packages. Personally I am most actively developing the helsinki and gisfin packages.
R packages for open data
- helsinki - Helsinki open data R tools
- gisfin - Finland GIS R tools
- sorvi - Finnish open government data R tools
Check out R-bloggers for interesting blogs related to the R language!
- Datademo funding for the helsinki and gisfin packages. Details (in Finnish) here and here. I was also interviewed in Citiscope and in a video.
- Apps4Finland 2013: Third place in the visualization category with the Alueellinen hyvinvointi-visualization that was developed together with Juha Leppänen and Satu Korhonen from Demos Helsinki think tank. Details (in Finnish).
- Apps4Finland 2012: Victory in the data opening category (official and audience votes): Data elections (Datavaalit). Details (in Finnish).
- Apps4Finland 2012: Victory in the data guide category (audience votes): Finnish data wiki. Details (in Finnish).
- Apps4Finland 2011: Victory in the data opening category (both official and audience votes): sorvi - R toolbox for open data. Details (in Finnish).
Open knowledges cover a wide set of topics from open data and science to democracy. Open Knowledge Finland (OKFFI) was founded few years ago to represent the emerging Finnish openness community. It works through working groups, and I’m also actively involved in some of these as listed below.
My Data has emerged as an important topic on the side of open data. The working group recently produced a My data survey that clarifies the related concepts nicely. An English version is coming in the future. I have also developed some code for analyzing personal data from various sources.
Open science - the openness of research publications, data and code - is a really hot topic now. Problems in research reproducibility have raised the need for better and more open ways to conduct science.
Research on machine learning and bioinformatics
I finished my PhD studies in Aalto University in 2014. In my thesis I developed machine learning methods for molecular biology and medicine applications.
- ProbCMap - Source code related to article Probabilistic drug connectivity mapping
- ICMg - Interation component model for gene modules. R packge related to article Searching for functional gene modules with interaction component models. ICMg is also included in the Netresponse package.
- Eyediagram - Code for creating eyediagram visualizations
Here some selected publications. See all of my publications in Google Scholar.
Comprehensive data-driven analysis of the impact of chemoinformatic structure on the genome-wide biological response profiles of cancer cells to 1159 drugs
Suleiman A Khan, Ali Faisal, John P Mpindi, Juuso A Parkkinen, Tuomo Kalliokoski, Antti Poso, Olli P Kallioniemi, Krister Wennerberg and Samuel Kaski
BMC Bioinformatics 2012, 13:112
Graph Visualization With Latent Variable Models
Juuso A Parkkinen, Kristian Nybo, Jaakko Peltonen and Samuel Kaski
In MLG ‘10: Proceedings of the Eighth Workshop on Mining and Learning with Graphs. Washington, D.C. 2010
Äänenkorkeuden havaitsemisen hermostollinen perusta
Bachelor’s thesis (in Finnish)
Helsinki University of Technology, Department of Electrical and Communications Engineering