IRC logs of #shogun for Friday, 2011-07-15

--- Log opened Fri Jul 15 00:00:21 2011
-!- blackburn [~blackburn@188.122.238.99] has quit [Quit: Leaving.]00:07
-!- f-x [~user@117.192.194.58] has quit [Ping timeout: 260 seconds]00:34
-!- in3xes_ [~in3xes@180.149.49.230] has joined #shogun07:00
-!- in3xes_ [~in3xes@180.149.49.230] has quit [Quit: Leaving]07:38
[1]warpymorning08:31
[1]warpyanyone here yet ?08:31
-!- gsomix [~gsomix@85.26.232.4] has joined #shogun08:41
@sonney2k[1]warpy, yes sure :)08:45
@sonney2ksaw your email - it is a good starting point08:46
CIA-32shogun: Baozeng Ding master * ra615b6e / (8 files): add remaining kernel examples - http://bit.ly/n9YeIT08:57
CIA-32shogun: Soeren Sonnenburg master * rf511a42 / (8 files):08:57
CIA-32shogun: Merge pull request #199 from sploving/master08:57
CIA-32shogun: add remaining kernel java examples - http://bit.ly/oh4mLl08:57
[1]warpythanks09:01
[1]warpyemail me back if you need a followup09:01
[1]warpyalso daniel is doing some work this weekend09:01
@sonney2k[1]warpy, I don't have much time today - so it would be great if daniel could just get the vector / array stuff to work. matrices and all that are then probably much more easy.09:06
-!- sploving1 [~sploving@124.16.139.134] has joined #shogun09:10
sploving1hi sonney2k09:10
@sonney2khi sploving109:10
@sonney2kdo the examples work for you?09:10
@sonney2kwork as in compile at least09:11
CIA-32shogun: Soeren Sonnenburg master * r98d5021 / (2 files): fix compile errors due to *kernel SGVector changes - http://bit.ly/rd31RG09:11
sploving1javac works09:11
sploving1some examples, java not work.09:12
@sonney2kwhich don't work?09:12
@sonney2kand did you now translate all or which are still missing?09:12
sploving1still working. maybe miss about some.09:13
sploving1for exmaple, structure_dynprog_modular.py09:14
sploving1I do not know how to translate it. as it use scipy a lot09:14
@sonney2ksploving1, yeah ignore that09:14
sploving1another thing, maybe other people recently add some examples, I have not add them in time now09:15
sploving1there are 13 sparse examples also not added09:16
@sonney2kbut these you can add09:16
sploving1no. java does not support it now09:17
@sonney2kyou can write sth like09:17
@sonney2kx=SparseRealFeatures()09:17
@sonney2kx.obtain_from_dense(y)09:17
@sonney2kdoesn't matter09:17
@sonney2kyou can just write09:18
@sonney2kfeat=RealFeatures(fm_test_real);09:18
@sonney2kfeats_test=SparseRealFeatures();09:18
@sonney2kfeats_test.obtain_from_simple(feat);09:18
sploving1oh. I should add others first.09:18
@sonney2kwhich others?09:19
sploving1some missing. I am still working now09:19
-!- gsomix [~gsomix@85.26.232.4] has quit [Ping timeout: 255 seconds]09:20
sploving1serialization_svmlight_modular like this09:20
@sonney2kyeah please do so09:21
-!- sploving1 [~sploving@124.16.139.134] has left #shogun []09:22
-!- gsomix [~gsomix@85.26.232.4] has joined #shogun09:22
-!- sploving1 [~sploving@124.16.139.134] has joined #shogun09:22
sploving1sonney2k, SerializableHdf5File has the same meaning as pickle.dump?09:30
[1]warpybbl09:30
-!- [2]warpy [~warpy@bzq-79-181-19-104.red.bezeqint.net] has joined #shogun09:31
-!- [1]warpy [~warpy@bzq-79-181-19-104.red.bezeqint.net] has quit [Ping timeout: 255 seconds]09:35
-!- [2]warpy [~warpy@bzq-79-181-19-104.red.bezeqint.net] has quit [Ping timeout: 276 seconds]09:37
CIA-32shogun: Baozeng Ding master * r899890c / examples/undocumented/java_modular/serialization_svmlight_modular.java : add serialization_svmlight_modular - http://bit.ly/roMVm509:41
CIA-32shogun: Soeren Sonnenburg master * r150cddb / examples/undocumented/java_modular/serialization_svmlight_modular.java :09:41
CIA-32shogun: Merge pull request #200 from sploving/master09:41
CIA-32shogun: add serialization example - http://bit.ly/mQhAEA09:41
-!- heiko [~heiko@main.uni-duisburg.de] has joined #shogun09:42
@sonney2khi heiko :)09:44
heikohi sonney2k :)09:44
heikono sleeping at all? :)09:44
@sonney2ksleep-what?09:45
heikohehe :)09:45
bettyboointeresting09:45
heikoindeed bettyboo09:46
bettybooheiko: in the classifier_averaged_perceptron_modular.py example, why parameter_list = ((traindat,testdat,label_traindat,1.,1000,1),(traindat,testdat,label_traindat,1.,1000,1))09:46
@sonney2kheiko, now that you example works - what are your next plans?09:46
heikooh, one little exaple works09:46
heikobut i still have to put together the other ones09:46
heikoespecially the grid-search09:47
heikobut this should be straight-forward09:47
heikoso, as for future plans09:47
heikoI still have the list: more subset classes, kernel machine sv stuff, ModelSelection Parameters generic09:47
@sonney2kok and what else?09:48
@sonney2kI mean anything else you can think of?09:48
-!- warpy2 [~androirc@bzq-79-181-19-104.red.bezeqint.net] has quit [Ping timeout: 252 seconds]09:48
heiko one more thing is the seralization version stuff, but the gsoc project is ready then :)09:49
heikodo you have anything in mind?09:49
heikoI mean, still possible:09:49
sploving1sonney2k, fstream = SerializableHdf5File("blaah.h5", "w").  no such .java class?09:49
sploving1why?09:50
heikomore modelSelection algos, gradient descent, bisection09:50
@sonney2kheiko, no I am just asking what you woudl be interested in doing later09:50
heikomore splitting strategies09:50
@sonney2ksploving1, you probably dont' have hdf5 installed09:50
@sonney2kheiko, ok...09:50
@sonney2kheiko, one thing that comes to my mind is of course easy illustrative examples09:50
@sonney2kheiko, and even a more compact way to write things at least for the python side09:51
sploving1sonney2k, anotherthing, SerializableAsciiFile fstream = new SerializableAsciiFile("blaah.asc", 'w');  svm.save_serializable(fstream); but class SerializableAsciiFile have no method save_serializable09:51
heikoyes more real-life examples would be nie09:51
heikowhat do you mean with a more compact way?09:51
sploving1GMNPSVM have no method save_serializable09:52
@sonney2kheiko, the example you have is long and complex already - hard to digest for outsiders, so sth like { 'C' : [1,2,3,], 'kernel' : [PowerKernel ...] would be easier to understand09:53
@sonney2kI guess that can be done with python dictionaries and lists09:53
heikoah ok, know what you mean09:53
@sonney2kand so one could write a typemap that translates this dictionary/list into ModelSelectionParameters09:54
heikoyes, this would be quite cool09:54
@sonney2kheiko, this is actually not even so difficult once one has a way to formulate it with dictionaries/lists09:54
heikook, sounds good09:55
heikoprobably makes sense09:55
heikothe examples are very long indeed09:55
@sonney2kheiko, I guess when you do just an example where you select the C of an SVm it would be much easier to digest09:56
heikoyes, but sonney2k, the current example is only an example for a complex parameter tree09:56
heikoi am building one where only the C is selected, this is much shorter an easier09:57
heikohave a look at the create_param_tree in libshogun/modelselection_grid_search_simple.cpp example09:57
-!- alesis-novik [~alesis@188.74.87.206] has joined #shogun09:58
-!- gsomix [~gsomix@85.26.232.4] has quit [Ping timeout: 252 seconds]10:15
sploving1Exception in thread "main" java.lang.UnsatisfiedLinkError: org.shogun.PreprocessorJNI.new_SortUlongString()J10:20
sploving1at org.shogun.PreprocessorJNI.new_SortUlongString(Native Method)10:20
sploving1at org.shogun.SortUlongString.<init>(Unknown Source)10:20
sploving1at preprocessor_sortwordstring_modular.run(preprocessor_sortwordstring_modular.java:36)10:20
sploving1at preprocessor_sortwordstring_modular.main(preprocessor_sortwordstring_modular.java:58)10:20
sploving1when I compile "java preprocessor_sortwordstring_modular "10:20
heikosonney2k, how to checkout shogun-data again? forgot it :)10:23
bettyboorotfl10:23
sploving1oh. I forgot loadLibrary("Preprocessor")10:25
-!- gsomix [~gsomix@85.26.235.71] has joined #shogun10:28
heikosonney2k, ok forgot it, i found it in the mailing list :)10:30
-!- sploving1 [~sploving@124.16.139.134] has left #shogun []10:37
-!- warpy2 [~androirc@bzq-218-206-2.red.bezeqint.net] has joined #shogun10:53
warpy2back10:55
warpy2anyone here?10:55
@sonney2kheiko, OK11:05
heikosonney2k, grid-search example runs :)11:05
@sonney2kvery good11:10
heikois there any real-life data in data?11:11
heikobecause accuracy is 1.-0 for fm_train_real11:11
@sonney2kheiko, you could generate some overlapping gaussians11:12
@sonney2kactually some example is already doing this11:12
heikoanother qeustion:11:12
@sonney2kclassifier_libsvm_minimal_modular.py11:12
@sonney2kdoes this11:12
heikomost of the examples define a function which takes parameters11:12
heikodo i have to do this?11:12
heikoor can I just construct a script?11:12
warpy2soeren do i need the data repo for running  examples?11:13
heikowarpy2 yes you do11:16
heikofor most of them11:16
-!- warpy2 [~androirc@bzq-218-206-2.red.bezeqint.net] has quit [Remote host closed the connection]11:17
heikosonney2k, just thought of another features: really cool would be using multiple threads for the grid search,11:17
@sonney2kheiko, please do it in the function way - these will be automagic tests11:21
heikooh ok11:21
heikothen wait with the merge of my pull request :)11:21
heikoalso with this parameter list?11:22
@sonney2kheiko, yes11:24
heikosonney2k, pushed11:29
CIA-32shogun: Soeren Sonnenburg master * r8567b11 / (10 files in 4 dirs):11:33
CIA-32shogun: Merge pull request #201 from karlnapf/master11:33
CIA-32shogun: working grid-search example in python_modular (+9 more commits...) - http://bit.ly/nJs0U611:33
@sonney2kheiko, looks great11:34
heikosonney2k, thanks :) finally it runs :)11:34
CIA-32shogun: Baozeng Ding master * rffb86ed / (11 files): add remaining examples - http://bit.ly/p1HlKd11:48
CIA-32shogun: Soeren Sonnenburg master * r747718f / (11 files):11:48
CIA-32shogun: Merge pull request #202 from sploving/master11:48
CIA-32shogun: add all the remaining examples - http://bit.ly/qLg7kk11:48
-!- gsomix [~gsomix@85.26.235.71] has quit [Ping timeout: 255 seconds]12:13
-!- gsomix [~gsomix@85.26.234.14] has joined #shogun12:25
-!- Sabrina [~sabrina@124.16.139.134] has joined #shogun12:29
-!- Sabrina [~sabrina@124.16.139.134] has left #shogun []12:29
-!- gsomix [~gsomix@85.26.234.14] has quit [Ping timeout: 260 seconds]12:53
CIA-32shogun: Heiko Strathmann master * rb454101 / (2 files): some work/interface changes towards computing confidence intervals - http://bit.ly/ovbPTz12:56
CIA-32shogun: Heiko Strathmann master * rae9e30f / (2 files in 2 dirs): apllied interface changes in other cross-validation class - http://bit.ly/pqUe7U12:56
CIA-32shogun: Heiko Strathmann master * r11771cc / examples/undocumented/libshogun/modelselection_grid_search_simple.cpp : applied interface changes in cross-validation class - http://bit.ly/nUe4Sa12:56
CIA-32shogun: Soeren Sonnenburg master * raf60362 / (5 files in 4 dirs):12:56
CIA-32shogun: Merge pull request #203 from karlnapf/master12:56
CIA-32shogun: some work towards computing confidence intervals and more precise cross-validation - http://bit.ly/q3DhSZ12:56
-!- sploving1 [~sploving@124.16.139.134] has joined #shogun13:06
sploving1features_string_file_modular: cannot apply org.shogun.BaseFeatures : load(org.shogun.SWIGTYPE_p_shogun__CFile) to  (org.shogun.AsciiFile)13:07
sploving1sonney2k, can you fix it?13:07
@sonney2klooking at it...13:09
sploving1sonney2k, another thing: public HistogramWordStringKernel(int size, SWIGTYPE_p_shogun__CPluginEstimate pie)13:10
sploving1it should be public HistogramWordStringKernel(int size, PluginEstimate pie)13:10
sploving1sonney2k, another thing,  I think  KernelPCACut(SWIGTYPE_p_shogun__CKernel k, double thresh) should be KernelPCACut(Kernel k, double thresh).13:21
sploving1whant does this error  mean:  number of symbols in observation (512) larger than M (64)13:39
@sonney2ksploving1, does classifier_averaged_perceptron_modular.java compile for you?14:19
@sonney2kmine here says14:19
@sonney2kclassifier_averaged_perceptron_modular.java:7: cannot find symbol14:19
@sonney2ksymbol: class test14:19
@sonney2kpublic class classifier_averaged_perceptron_modular implements test {14:19
@sonney2k                                                               ^14:19
@sonney2kkernel_histogram_word_string_modular.java:45: cannot find symbol14:19
-!- cwidmer [~quassel@connect.tuebingen.mpg.de] has joined #shogun14:22
-!- f-x [~user@117.192.197.185] has joined #shogun14:37
-!- blackburn [~blackburn@188.122.238.99] has joined #shogun14:37
blackburnhave I missed anything? ;)14:37
@sonney2kblackburn, yes - more includes!14:37
blackburnmore includes??14:38
-!- in3xes_ [~in3xes@210.212.58.111] has joined #shogun14:39
blackburnone would better include everything14:39
CIA-32shogun: Soeren Sonnenburg master * rdb900c3 / (7 files): fix org.shogun.SWIGTYPE_p_shogun__CFile issue in Features - http://bit.ly/n9Y2OK14:40
CIA-32shogun: Soeren Sonnenburg master * r97083bb / examples/undocumented/java_modular/Load.java : fix warnings in Load.java - http://bit.ly/ouBAWR14:40
CIA-32shogun: Soeren Sonnenburg master * r1d5de60 / (5 files in 4 dirs): Merge branch 'master' of github.com:shogun-toolbox/shogun - http://bit.ly/n1KkSn14:40
-!- in3xes [~in3xes@180.149.49.227] has quit [Ping timeout: 258 seconds]14:41
@sonney2ksploving1, what are all these Note: distribution_histogram_modular.java uses unchecked or unsafe operations.14:42
@sonney2kNote: Recompile with -Xlint:unchecked for details.14:42
@sonney2kerrors?14:42
CIA-32shogun: Soeren Sonnenburg master * rc28e5bf / (3 files):14:45
CIA-32shogun: Merge pull request #204 from sploving/master14:45
CIA-32shogun: fix some bug - http://bit.ly/nlyDAM14:45
@sonney2kblackburn, the downside of doing this is that it will take ages to recompile - as soon as you change something somewhere *everything* needs to be recompiled14:46
blackburnphew!14:49
blackburnhuh shogun is kinda active today14:50
sploving1I have no idea. Note: Recompile with -Xlint:unchecked for details14:53
@sonney2ksploving1, then please recompile with that option...14:53
-!- mikiobraun [~mikio@squid.ml.tu-berlin.de] has joined #shogun14:57
mikiobraunbetty: hi14:57
bettyboomikiobraun: you modified something related to14:57
mikiobraunhello all. Am I late ;)14:59
@sonney2kalesis-novik, do you know where Vojtech is?15:00
sploving1mikiobraun, not yet15:00
heikohi all15:00
@sonney2kalesis-novik, ping?15:00
alesis-noviksonney2k, pong15:00
@sonney2kalesis-novik, hmmhh15:01
@sonney2kalesis-novik, Any idea if Vojtech will join today?15:01
* blackburn closes WoW and Lineage 215:01
alesis-noviksonney2k, no idea really.15:01
@sonney2kok then lets start...15:02
@sonney2kSo now that everyone of you entered the midterm evals (thanks for that btw!) lets have a look where we want to be in 1 month in the future - which is the suggested coding deadline15:03
-!- VojtechFranc [~quassel@gw-101.scnet.cz] has joined #shogun15:03
@sonney2kalright15:03
@sonney2kSo now that everyone of you entered the midterm evals (thanks for that btw!) lets have a look where we want to be in 1 month in the future - which is the suggested coding deadline15:03
@sonney2k^Vojtech15:03
@sonney2k(was the first sentence)15:03
@sonney2kSo ideally everyone would be finished with whatever was planned and we would all work on polishing shogun to get it into release ready conditions15:04
@sonney2ksuch that by the end of August we can release shogun 1.0 that includes all your contributions15:04
@sonney2kSo it would be great if any student (or mentor - one of you is sufficient) just briefly says what the current state is and what his future plans are15:06
@sonney2klet me start with heiko :)15:06
@sonney2kheiko did the model selection stuff15:06
heikome or you? :)15:06
@sonney2kand it is already working pretty well15:06
heikook go ahead :)15:06
@sonney2kheiko, please continue - you know what you want to do :)15:06
heikook then, first model selection is working from python side15:07
@sonney2kand C++ side of course :)15:07
heikoyes, a grid search model selection is implemented15:08
heikoevaluation is done via cross-validation (currently n-fold-stratified)15:08
heikoand any parameter of an CSGObject may be selected for model selection15:08
heikoso now simple parameters as SVM-C may be searched for, as well as CSGObjects, so different kernels may be chosen for example15:09
@sonney2k..and there is an example / test for that all already.15:09
@sonney2kso now about the future :)15:09
heikoyes, I plan to write some real-life examples of typical usage15:09
heikothe current example does not use all possible features15:09
heikothen, some work has to be done on the subset selection side (implement it for more classes)15:10
heikothen, currently, only float64_t and CSGObject are possible for model selection, this should be generic15:10
heikoand Soeren also suggested to have a more simple interface from the python side15:11
heikocurrently, its a bit complicated to specify a complex combination of parameters15:11
heikoyou may all have a look at the examples:15:11
heikoundocumented/libshogun/modelselection_grid_search_simple.cpp15:11
heikoundocumented/python_modular/modelselection_grid_search_simple_modulay.py15:12
heikoalso another big issue:15:12
@sonney2kok we don't need it that detailed now - just highlevel is sufficient15:12
heikocross-validation currently only works with LibLinear15:12
heikoKernel/Distance machines currently store their model by storing indices to features15:12
heikothis has to be changed, they need to store feature copies to make cross-val work15:13
heikoso much so far15:13
heikoI think last thing has highest priority now15:13
@sonney2kcertainly enough work for the next month I would say - but we will see15:13
heikoyes, indeed :)15:13
@sonney2kalright, thanks heiko15:13
cwidmerjust a side-note, I found the cross-validation stuff in scikits learn quite useful (http://scikit-learn.sourceforge.net/cross_validation.html). Might be worth a look for design considerations.15:14
@sonney2kcwidmer, yes we looked at this15:14
cwidmerok, great15:14
* blackburn thinks that chris is scikit-learn spy15:14
cwidmer;)15:14
@sonney2kI think we will create typemaps inspired by that for convenience15:14
heikoyes, we also lend some inspirations there ;)15:14
bettyboogreat15:14
@sonney2kany further comments15:15
@sonney2kif not then who wants to be next? f-x ?15:15
f-xsonney2k: sure15:15
f-xcurrent status is:15:15
f-xthe online framework is fine and working, able to load arbitrarily large datasets15:15
f-xthough optimizations will surely have to be made for speed15:16
f-xAlso, versions of SGD, LibLinear, and most of SGD-QN have also been implemented using this framework15:16
f-xThe next step would be to complete SGD-QN and begin integration of Vowpal Wabbit15:17
f-xWhile benchmarking using larger datasets at the same time15:17
-!- gsomix [~gsomix@85.26.233.175] has joined #shogun15:17
f-xPlus, examples using the newly implemented algorithms as and when they're ready15:18
f-xVW should take a good deal of time, however15:18
f-xsonney2k: Any specifics?15:18
@sonney2kf-x, I agree - I guess you shouldn't forget that a lot of time will go into benchmarking / identifying and ironing out bottlenecks15:19
@sonney2kand fixing bugs of course15:19
@sonney2kbut again certainly quite a lot of work again15:19
f-xDefinitely.15:19
@sonney2kand it is important here that we stop adding features when we start realizing that it might be too much to fully test/support/document them15:20
@sonney2kdoes anyone have questions to f-x?15:20
@sonney2kif not thanks for your work!15:20
@sonney2kand alesis-novik do you want to continue?15:20
alesis-novikSure15:21
alesis-novikEssentially, the first part of my project is done and I have implemented, tested and created examples for the simple EM for GMM15:21
@sonney2kalesis-novik, btw you wanted to send me some python examples - maybe I've missed them15:22
alesis-noviksonney2k, I'll send a pull request later today15:22
@sonney2kalesis-novik, thanks - please continue15:23
alesis-novikit uses the 3 "improvements" over the vanilla one: doing computations in log domain, having a minimal covariance to avoid singular gaussians15:23
alesis-novikand using svd to decompose covariances15:24
alesis-novikFor multivariate gaussians it also allows you to choose one of 3 types of covariances you want to use15:24
alesis-novikfull, diagonal or spherical15:24
alesis-novikThis implementation has been tested with both small examples (1d and 2d Gaussians, 200 points)15:25
alesis-novikand a large dataset with 256d Gaussians and thousands of points if I remember correctly15:26
alesis-novikCurrently I'm working on implementing a simple Split-Merge EM15:26
alesis-novikI should finish implementing it by the end of the week, then I'll test and work out any bugs15:27
alesis-novikHopefully it will be able to produce better fits and remove the problem of EM sometimes deciding to fit one Gaussian over all of the data (sometimes happens in the small examples(15:28
@sonney2kyeah it would be great to see any illustrative examples for that15:28
@sonney2kVojtechFranc, do you want to add anything?15:28
VojtechFrancjust a small comment15:29
VojtechFranc1. I would like to add some other examples on real data15:29
VojtechFranc2. if there is time it would be cool to implement support for other cov type- Factor analysis15:30
VojtechFrancotheriwise, we need testing/debuggin/tuning and writting documentation15:30
VojtechFrancthat's it15:30
@sonney2kOK, does anyone else have questiosn?15:31
alesis-novikI've been meaning to ask, what should I do about the large example? The small ones are self sufficient but the large one uses the dataset VojtechFranc provided15:31
VojtechFrancit is USPS database of handwritten characters, it is publicly available15:32
@sonney2kI guess we provide a download link then15:32
VojtechFrancI'll collect other data which can be made public15:32
blackburn+1 for USPS nums - I'll make an example for dimreduction15:32
blackburnanyway we could share some data of that kind15:32
@sonney2kif we are allowed to distribute it we can add it to shogun-data15:32
@sonney2kI fully agree15:32
blackburnjust licence issues15:33
@sonney2kwe need some real-world but intuitive data sets15:33
@sonney2kcould very well be....15:33
@sonney2kalright15:33
@sonney2ksploving1 - do you want to be next?15:33
sploving1okay15:33
sploving1I mainly finised java_modular: Now it can support jblas library well, includes vector, matrix, and string features. In addition, add java_modular 131 examples, and docs.15:34
sploving1also, it can support ujmp matrix well15:34
sploving1that is the current state15:35
@sonney2ksploving1, as far as I can see some of the examples don't compile yet though15:35
sploving1it maybe some reason by shogun15:36
sploving1some bug not fixed yet15:36
@sonney2kI guess you will need to do a bit of testing for them too (compare if python modular returns the same result)15:36
@sonney2kyes I am working on some of them15:36
@sonney2ksploving1, please continue15:37
sploving1about the testsuit, it cannot work now.15:38
sploving1I think the object need implement seriazation15:38
@sonney2ksploving1, yes lets ignore the test suite for now15:38
sploving1javac can work for examples, but java *.java not work for all. some bugs still need fix15:40
mikiobraunI'm just answering your last email. Sorry for my replying so late.15:40
sploving1it doesnot matter15:40
@sonney2ksploving1, mikiobraun so what about future?15:41
blackburn.. it will be utterly fantastic :D15:41
@sonney2kand shogun world domination of course ;-)15:42
mikiobraunExisting tests should compile and run, of course. Otherwise we should probably use the rest of the time for the Lua thing. (assuming that sploving has more fun on Lua) ;)15:42
@sonney2kIIRC sploving1 did many typemaps for lua and ruby too already - right?15:43
@sonney2kso I think they mostly need examples again and some missing typemaps15:43
@sonney2kthough this is a lot easier to do this time with a scripting language than java15:43
sploving1yeap. I would like to work on lua. The main plan is to support lua  matrix15:44
-!- f-x [~user@117.192.197.185] has quit [Ping timeout: 260 seconds]15:44
sploving1and add lua examples, docs and clean up java/lua binding15:44
blackburnman down15:44
@sonney2kmy suggestion would be to this time start earlier with adding examples15:44
@sonney2kthis way we can test for breakage more early15:44
@sonney2kjava caused a lot of unforeseen problems15:45
mikiobraunI agree15:45
@sonney2ksploving1, like when typemaps for vectors are there - illustrate how Labels() work15:45
blackburnwe will have a Kernel problem in ruby ;)15:45
blackburnI even would say the Kernel Problem15:45
@sonney2kwhen the ones for matirx are there, e.g. use gaussian kernel15:46
@sonney2kblackburn, but that is ruby :) lua first15:46
blackburnyeah I just recalled The Kernel Problem serialhex reported month ago or so15:46
@sonney2ksploving1, again a lot of work but still - it is do-able.15:47
sploving1I have no idea how to make Labels work now. I write them nearly a month ago15:47
sploving1and forgot it15:47
@sonney2ksploving1, copy and paste the example from python :)15:47
@sonney2kalright, does anyone else have questions?15:47
sploving1I suggest not write so many examples15:48
sploving1it is borning15:48
sploving1My main advantage is to write typemap15:48
sploving1there will be no much time to debug15:49
blackburnI can help writing examples if it is needed ;)15:49
blackburnor gsomix can too15:49
@sonney2ksploving1, well the examples will tell us if your typemaps work15:49
@sonney2kso we definitely will need some for each feature type15:49
@sonney2kof course lua is much more close to python15:49
@sonney2kso it will be a lot easier now15:50
sploving1thx. blackburn.15:50
@sonney2kbut sploving1, yes if you want to e.g. write more typemaps for other languages like c# instead we can arrange sth15:50
-!- in3xes_ is now known as in3xes15:51
blackburnI have an suggestion - I could finish up work with java examples15:51
@sonney2kblackburn, that would be great...15:51
blackburnokay so let sploving1 just work on typemaps15:52
@sonney2kthis way sploving1 can have some more fun with typemaps15:52
-!- f-x [~user@27.57.28.101] has joined #shogun15:52
@sonney2kso sploving1 what do you think - if you now just do lua / ruby /c# typemaps - no examples?15:52
@sonney2kbetter15:52
@sonney2k?15:52
sploving1no examples? that is impossible, how to test?15:53
blackburnwe could help with examples, you could work only on typemaps15:53
sploving1I suggest I write some of them, not all15:53
@sonney2ksploving1, well then port 1 example for each datatype - the others we will do15:53
sploving1that souds good15:53
sploving1okay.15:54
sploving1I will try my best. lua/ruby/c#15:54
@sonney2kok - is everyone (more) happy now?15:54
cwidmerone happy family15:55
blackburnmore communism for shogun15:55
@sonney2kthat's how it should be15:55
@sonney2kwhich brings us to sweet soviet russia15:55
@sonney2kblackburn, your turn15:55
blackburnokay I have finished working on three base algorithms with some modifications15:55
blackburnin particular it is15:56
blackburnLocallyLinearEmbedding15:56
blackburnClassicMDS15:56
blackburnstands for Classic Multidimensional Scaling15:56
blackburnand LandmarkMDS15:56
blackburnin addition to both of MDSs I have implemented Isomaps15:56
blackburnso, ClassicIsomap and LandmarkIsomap15:56
blackburnduring work on these algos I have notified we should use some other solver for eigenproblems15:57
blackburnin all of them eigenproblem is stated15:57
blackburnso, I have integrated ARPACK (which is considered to be fastest for that kind of eigenproblems)15:58
blackburneven faster than lobpcg15:58
blackburnmy plan for next month is to implement three variations of LLE: Hessian LLE, Modified LLE and LTSA (local tangent space)15:59
blackburnand I would like to integrate SuperLU for sparse LU factorization15:59
blackburnit is needed in shift-invert mode of ARPACK15:59
* sonney2k googles superlu16:00
blackburnmoreover, some real data examples should be good - I have been thinking about it for a while16:00
blackburnhave some ideas, will add some of them during next month16:01
cwidmerblackburn, on the bioinf side, I can provide you with some examples from biological sequences16:01
blackburnyes it would be wonderful16:01
blackburnI'm still interested if manifold learning is applicable to biological data16:02
cwidmersame here ^^16:02
@sonney2kblackburn, ok thanks...16:03
@sonney2kdoes anyone else want to comment?16:03
blackburnabout superlu16:03
blackburnit is funny why I want to do it: scikit-learn use it16:04
blackburnit seems that I'm a scikit-learn spy too16:04
cwidmer:)16:04
bettyboo:)16:04
blackburnah not scikit-learn, scipy uses it16:04
cwidmerspying seems to pay off16:04
@sonney2k:D16:05
cwidmerare there any license issues with that?16:05
blackburnwith what?16:05
cwidmersuperlu16:05
cwidmeror is it not a lib16:05
blackburnit is a lib16:05
blackburne.g. it is available even in ubuntu16:06
cwidmersounds good16:06
blackburnokay I have no idea what licence do they use16:06
@sonney2kit seems to be 3-clause BSD16:07
@sonney2kso all good16:07
blackburnah ok16:07
heikowhen its in ubuntu they probably use some gpl stuff :)16:07
blackburnnot really16:07
* sonney2k just checked from the source 16:07
@sonney2kso good to go with superlu16:07
blackburnhttp://crd.lbl.gov/~xiaoye/SuperLU/License.txt16:07
blackburnI'm not sure it will speed it up really16:07
blackburn:D16:07
blackburni.e. it will but may be only for LARGE scale16:08
@sonney2kblackburn, I mean the alternative to doing optimizations is to do other algorithms...16:08
@sonney2kbut up to you and cwidmer :)16:08
blackburnI have chosen the most essential ones ;) if we will notify any very good algo - of course it should be implemented16:09
blackburnnotice*16:09
@sonney2kOK, so we will keep you busy if we do :)16:09
bettyboohe16:09
cwidmerwe have quite a few algorithms already, but blackburn, if you still have a few on the stack, it might be worth discussing doing these before the optimizations16:09
VojtechFrancsonney2k, is there any other global issue to discuss? I'll need to go.16:09
blackburnhehe sorry my report took a while16:10
@sonney2kVojtechFranc, yes next meeting16:10
@sonney2kI would say we do it in 2 weeks from today16:11
cwidmerfine with me16:11
@sonney2khow about Wednesday Jul 27 13:00 UTC?16:11
blackburnokay16:11
heikook16:11
cwidmerk16:11
VojtechFrancok16:11
@sonney2kmikiobraun, alesis-novik sploving1 ?16:11
f-xokay16:12
sploving1no problem16:12
@sonney2kand then we see how far everyone is16:12
@sonney2kand readjust16:12
@sonney2kre-adjust16:12
alesis-novikSounds fine16:12
@sonney2kif possible I'd like to see all the big changes in the next 2 weeks16:12
@sonney2kafterwards only incremental work16:13
@sonney2ksuch that we can stabalize everything in August16:13
@sonney2k...to make a release16:13
blackburnrelease race ;)16:13
@sonney2k(e.g. documenting, examples, minor bugfixes etc all in august)16:13
@sonney2kso we will probably have to have some bug squashing party at the end of august with everyone of us in the chat for some time16:14
@sonney2kanyway that's it form my side16:14
@sonney2kany questions?16:14
@sonney2kconcerns?16:14
@sonney2kvodka related issues?16:15
blackburnI have enough vodka16:15
@sonney2kif not then thanks again for attending16:15
blackburn:D16:15
@sonney2kand keep up the good work!!!16:15
alesis-novikblackburn, if you ever have vodka quantity related issues, we are all doomed16:15
blackburnyeeeeah16:16
blackburnlet the vodka begins okay16:16
blackburn:D16:16
* gsomix away16:17
@sonney2kblackburn, regarding the java examples16:18
-!- [1]warpy [~warpy@bzq-79-181-19-104.red.bezeqint.net] has joined #shogun16:18
alesis-novikit would probably mean that the Russian economy has collapsed. I remember seeing an ad where they use vodka to pay taxes in czar Russia16:18
blackburnhehe16:18
VojtechFrancso, I need to go. bye16:18
bettyboozzz16:18
-!- VojtechFranc [~quassel@gw-101.scnet.cz] has quit [Remote host closed the connection]16:18
@sonney2kVojtechFranc, cu!16:18
blackburnin fact russian economy is more related to oil and gas16:19
CIA-32shogun: Soeren Sonnenburg master * r377c55b / (8 files): fix a few more java examples ... even more includes - http://bit.ly/qR5IQE16:19
cwidmerok, I  gotta go, happy hacking!16:19
CIA-32shogun: Soeren Sonnenburg master * rd531d4c / (3 files): Merge branch 'master' of github.com:shogun-toolbox/shogun - http://bit.ly/omeALF16:19
blackburncwidmer: see ya16:19
@sonney2kCIA-32, cu16:19
@sonney2kcwidmer, cu16:19
blackburnsonney2k: CIA-32 is not leaving16:19
blackburnsonney2k: what about java examples?16:19
-!- CIA-32 was kicked from #shogun by sonney2k [CIA-32]16:19
@sonney2khe is :D16:19
blackburnoh16:19
-!- CIA-20 [~CIA@cia.atheme.org] has joined #shogun16:19
blackburn:D16:19
blackburnkiller16:20
@sonney2kI knew that this would happen16:20
@sonney2ksome of the java examples currently issue a couple of warnings or don't compile16:20
@sonney2kso just fix those16:20
blackburnokay, I'll take a look16:21
@sonney2kthen we still need to run every example one-by-one (that works using sploving1 check.sh script)16:21
@sonney2kand at some point create functions in similar spirit to java_modular16:21
@sonney2kthough I am not so sure about serialization at the moment...16:21
@sonney2kbecause I had the following idea16:22
@sonney2kone could use python to write down the serialized shogun objects16:22
@sonney2kand nothing more16:22
-!- cwidmer [~quassel@connect.tuebingen.mpg.de] has quit [Remote host closed the connection]16:22
@sonney2kif we do that, we could serialize the shogun objects from java too16:22
@sonney2kand load the ones from python and compare...16:22
blackburnI think there is no problem with serialization16:23
@sonney2kfor now I would just check if e.g. kernel matrices are the same by just printing them16:23
@sonney2kblackburn, in which respect?16:23
blackburnI mean it is possible to compare python and java outputs16:24
-!- f-x` [~user@117.192.203.179] has joined #shogun16:24
blackburnnow compiling java_modular :)16:25
alesis-novikblackburn, I don't know why I decided to look this up, but "The taxes on vodka became a key element of government finances in Tsarist Russia, providing at times up to 40% of state revenue."16:26
blackburnhehe16:26
blackburnna samom dele eto strashno :)16:26
-!- f-x [~user@27.57.28.101] has quit [Ping timeout: 260 seconds]16:26
blackburnsonney2k: will you have some fancy lecture about shogun 1.0? ;) just like for older ones16:28
f-x`sonney2k: any ideas about the SGD-QN?16:28
@sonney2kf-x`, lets ask the mighty masters, leaon and antoine16:29
@sonney2kleon16:29
@sonney2kf-x`, I will email them16:29
f-x`sonney2k: cool :) thanks!16:29
@sonney2kf-x`, I know them pretty well so they might even respond :D16:30
heikosonney2k, does shogun actually use boost?16:33
@sonney2kheiko, no16:33
f-x`great :) they should surely know what modifications will be needed for online behaviour. i'm done implementing most of the functions except that determine_t0()16:33
heikoand you probably do not want to add it ? :)16:33
@sonney2kheiko, boost is a huge dependency so one has to be very careful about that16:33
@sonney2kheiko, what do you need?16:33
heikook then I'll compute confidence intervals myself16:33
@sonney2kf-x`, sent the email16:33
heikobut now, off for some coffee :)16:34
heikosee you later16:34
@sonney2kcoffee?! vodka!16:34
blackburncoffee is not that shoguneers drink16:35
-!- mikiobraun [~mikio@squid.ml.tu-berlin.de] has left #shogun ["Leaving."]16:39
-!- sploving1 [~sploving@124.16.139.134] has left #shogun []16:41
@sonney2kblackburn, do you see a way to modify the SGObject.java to have an implements serializable?16:47
blackburnI have to take a look on it first ;)16:47
blackburnsonney2k: where it is?16:49
@sonney2kblackburn, recall that we can pickle objects from python?16:50
blackburnyes16:50
@sonney2kwouldn't it be cool to have serializable shogun objects in java too?16:50
blackburnI mean where to find SGObject.java??16:50
@sonney2kso all that is needed for that is that the generated SGObject.java derives from serializable16:50
blackburnфр16:51
blackburnah16:51
@sonney2kblackburn, in java_modular/org/shogun16:51
@sonney2k(org/shogun is being generated)16:51
blackburnah okay, compiling16:52
f-x`sonney2k: do you know of any scripts to readily convert between common input formats? like convert.py from the lsl site did from dense->SVMLight16:54
f-x`would be great for dense<->libsvm<->svmlight etc16:54
@sonney2kf-x`, you could use shogun for that, i.e. load in svmlight format, convert to dense features and write out16:56
@sonney2kand vice versa16:56
@sonney2kblackburn there seems to be a "%pragma(java) jniclassinterfaces=x"16:56
blackburnno idea wtf is it :D16:57
@sonney2kI will have a look at this - would be too cute if that worked :)16:59
@sonney2k%pragma(java) moduleimports=%{17:05
@sonney2k    import java.io.*; // For Serializable17:05
@sonney2k%}17:05
@sonney2k%pragma(java) jniclassinterfaces="Serializable"17:05
@sonney2k%pragma(java) moduleinterfaces="Serializable"17:05
@sonney2klets see what happens17:05
@sonney2kkaboom of course17:09
blackburnI guess readObject and writeObject should be implemented somehow..17:12
blackburnhow it works with python?17:12
@sonney2kblackburn, yes sure17:19
@sonney2kthat would be easy17:19
@sonney2kthe only problem is that we need SGObject to implement Serializable17:19
blackburnokay compiles now17:19
@sonney2kand currently I don't see how to do this (if not using sed  on the swig'd file)17:19
@sonney2kthe  jniclassinterface indeed did add a implements17:20
@sonney2kbut not sure what the *.java class has to say about this17:20
@sonney2kanyway17:20
@sonney2kgtg now17:20
@sonney2kl8r17:20
blackburnokay17:20
blackburnsee you17:20
-!- in3xes is now known as gpk17:33
-!- gpk is now known as in3xes17:33
-!- heiko [~heiko@main.uni-duisburg.de] has quit [Ping timeout: 258 seconds]18:31
-!- gsomix [~gsomix@85.26.233.175] has quit [Ping timeout: 255 seconds]18:33
blackburnsonney2k: have problem with java modular installation :)18:43
bettyboonot bad18:43
blackburnbettyboo: it is bad18:43
bettybooblackburn: ah, you mean that some kernels are 'bad'? sure it is18:43
-!- gsomix [~gsomix@85.26.232.71] has joined #shogun18:45
blackburninstall -m755 java_modular/shogun.jar "//share/java"19:03
blackburninstall: cannot stat `java_modular/shogun.jar': No such file or directory19:03
blackburnsonney2k: ^^ am I doing something wrong?19:03
-!- gsomix [~gsomix@85.26.232.71] has quit [Ping timeout: 255 seconds]19:07
-!- gsomix [~gsomix@85.26.232.71] has joined #shogun19:09
-!- blackburn1 [~blackburn@188.122.238.99] has joined #shogun19:36
-!- blackburn [~blackburn@188.122.238.99] has quit [Ping timeout: 255 seconds]19:38
-!- blackburn1 [~blackburn@188.122.238.99] has quit [Client Quit]19:38
-!- blackburn [~blackburn@188.122.238.99] has joined #shogun19:38
-!- [1]warpy [~warpy@bzq-79-181-19-104.red.bezeqint.net] has quit [Ping timeout: 246 seconds]20:16
-!- gsomix [~gsomix@85.26.232.71] has quit [Ping timeout: 250 seconds]21:17
-!- in3xes_ [~in3xes@180.149.49.230] has joined #shogun21:41
-!- warpy2 [~androirc@bzq-219-227-204.pop.bezeqint.net] has joined #shogun21:52
CIA-20shogun: Sergey Lisitsyn master * r32a8bd0 / (12 files in 4 dirs): Renamed and cleaned PCA - http://bit.ly/pdXwNP22:04
-!- in3xes_ [~in3xes@180.149.49.230] has quit [Quit: Leaving]22:14
-!- warpy2 [~androirc@bzq-219-227-204.pop.bezeqint.net] has quit [Read error: Connection reset by peer]22:36
-!- in3xes_ [~in3xes@180.149.49.230] has joined #shogun22:50
-!- CIA-87 [~CIA@cia.atheme.org] has joined #shogun23:01
-!- Netsplit *.net <-> *.split quits: CIA-2023:07
-!- [1]warpy [~warpy@bzq-79-181-19-104.red.bezeqint.net] has joined #shogun23:09
-!- f-x` [~user@117.192.203.179] has quit [Read error: Connection reset by peer]23:33
-!- f-x` [~user@117.192.203.179] has joined #shogun23:42
-!- in3xes_ [~in3xes@180.149.49.230] has quit [Quit: Leaving]23:43
-!- Todd_ [4526dd82@gateway/web/freenode/ip.69.38.221.130] has joined #shogun23:48
Todd_Hi-I am organizer for the San Francisco Data Mining meetup...was wondering to any of you (contributors to shogun) are located near San Francisco, and would be interested in giving a talk about shogun to our group on Aug 16?23:50
blackburnHi, Todd_23:51
blackburnI don't really know but I guess no, most of us are located in europe23:52
Todd_thats too bad...I'm a fan of the project...thanks anyhow!23:53
blackburnTodd_: you could leave some email or so, I'll ask 'a boss', sonney2k23:53
Todd_that would be great...my email is todd.holloway@gmail.com23:53
blackburnokay, thanks for interest anyway :)23:54
-!- Todd_ [4526dd82@gateway/web/freenode/ip.69.38.221.130] has quit [Quit: Page closed]23:54
blackburnserialhex: give a talk at san francisco :D23:55
--- Log closed Sat Jul 16 00:00:25 2011

Generated by irclog2html.py 2.10.0 by Marius Gedminas - find it at mg.pov.lt!