In case you are a talented student interested in a summer project, we are looking for you! In Summer of Code 2013 we aim at:
|Soeren Sonnenburg, Gunnar Raetsch, Sebastian Henschel, Christian Widmer, Jonas Behr, Alexander Zien, Fabio de Bona, Alexander Binder, Christian Gehl, and Vojtech Franc. The SHOGUN Machine Learning Toolbox . Journal of Machine Learning Research, 11:1799-1802, June 2010.|
Stay tuned in melange for news regarding the accepted organizations (announcement on April 8th). Also, you may start checking out our ideas list and see if you find something interesting. Of course do not hesitate to contact us if you have a cool idea for a project - we may be as well interested in it.
We are looking for interested students to join us in improving the shogun machine learning toolbox in this year's google summer of code.
Application deadline is May 3, 19hrs UTC and the program will run from May to the end of September (cf. http://www.google-melange.com/gsoc/events/google/gsoc2013 ).
Google Summer of Code is a global program that offers students stipends ($5000 / per student) to write code for open source projects.
SHOGUN is a machine learning toolbox, which is designed for unified large-scale learning for a broad range of feature types and learning settings. It offers a considerable number of machine learning models such as support vector machines for classification and regression, hidden Markov models, multiple kernel learning, linear discriminant analysis, linear programming machines, and perceptrons. Most of the specific algorithms are able to deal with several different data classes, including dense and sparse vectors and sequences using floating point or discrete data types. We have used this toolbox in several applications from computational biology, some of them coming with no less than 10 million training examples and others with 7 billion test examples. With more than a thousand installations worldwide, SHOGUN is already widely adopted in the machine learning community and beyond.
SHOGUN is implemented in C++ and interfaces to all important languages like MATLAB, R, Octave, Python, Lua, Java, C#, Ruby and has a stand-alone command line interface. The source code is freely available under the GNU General Public License, Version 3 at http://www.shogun-toolbox.org.
During Summer of Code 2013 we intend to improve the accessibility of the library. Additionally we will be complementing it with promising (new) machine learning algorithms.
If you have further questions don't hesitate to ask on the shogun mailinglist (firstname.lastname@example.org, please note that you have to be subscribed in order to post) or on irc.freenode.net channel #shogun.
Almost a month has passed since SHOGUN has been accepted for Google Summer of Code 2013. Student application deadline was today (May 6) and shogun received 57 proposals from 52 students. This is quite some increase compared to 2012 (48 applications from 38 students). What is interesting though is that it didn't look that good in the very beginning (see the figure below):
Comparing this to 2012, this curve is much more flat in the beginning but exponentially increasing towards the end. Why is that? We didn't change the way we engaged with students (even though we tried to improve the instructions and added lots of entrance tagged tasks to github issues). We still require patches to be submitted to even be considered. So it is similarly tough to get into gsoc 2013 with us as it was in the previous year.
What is interesting though is that various organizations complained about a slow uptake in the beginning. And it turns out that google did limit the number of student applications from 20 (last year) to 5 (in 2013). This might explain the shape of the curve: Students are more cautious to apply but once the deadline is near the apply to the maximum of 5 to improve their chances. This goes hand-in-hand with the observation that the quality of newly submitted student applications tends to decrease towards the deadline.
So did this new limit hurt? To the contrary! In the end the quality of proposals increased a lot and we were able to even way before the student application deadline start to score/rank students. We are happy to have many very strong candidates this year again. Lets hope we get enough slots to accommodate all of the excellent students and then lets start the fun :)