IRC logs of #shogun for Sunday, 2013-06-02

--- Log opened Sun Jun 02 00:00:27 2013
-!- hushell [~hushell@c-67-189-100-116.hsd1.or.comcast.net] has left #shogun ["WeeChat 0.3.7"]04:51
-!- zxtx [~zv@cpe-75-83-151-252.socal.res.rr.com] has joined #shogun07:41
-!- Netsplit *.net <-> *.split quits: flxb_08:10
-!- Netsplit over, joins: flxb_08:21
-!- nube [~rho@49.244.74.63] has quit [Ping timeout: 276 seconds]08:33
-!- Netsplit *.net <-> *.split quits: flxb_08:36
-!- nube [~rho@49.244.80.228] has joined #shogun08:47
-!- Netsplit over, joins: flxb_08:49
-!- lambday [67157d37@gateway/web/cgi-irc/kiwiirc.com/ip.103.21.125.55] has joined #shogun09:04
lambdaygood morning sonney2k09:16
lambdayशुभ प्रभात lisitsyn :P09:16
-!- gsomix [~gsomix@83.234.54.61] has joined #shogun09:19
gsomixgood mornning09:19
-!- sanyam [uid10602@gateway/web/irccloud.com/x-cgbsdbswjbrxmjzw] has quit [Quit: Connection closed for inactivity]09:22
lambdaygsomix: morning :)09:27
-!- foulwall [~foulwall@2001:da8:215:503:c5a2:4030:17a3:c8ce] has joined #shogun09:39
@lisitsynlambday: yes lambday exactly!09:43
gsomixlisitsyn, hi, listing.09:46
lambdaylisitsyn: :P09:49
@lisitsyngsomix: hi mogsix09:49
gsomixlisitsyn, lol http://www.freelancer.com/u/mogsix.html09:51
@lisitsynshalom09:52
* gsomix sings "hava nagila, hava nagila"09:59
-!- iglesiasg [d58f3264@gateway/web/freenode/ip.213.143.50.100] has joined #shogun10:20
-!- mode/#shogun [+o iglesiasg] by ChanServ10:21
-!- foulwall [~foulwall@2001:da8:215:503:c5a2:4030:17a3:c8ce] has quit [Remote host closed the connection]10:31
gsomixsonney2k, around?10:32
-!- vgorbati [~vgorbati@212.2.159.34] has joined #shogun10:48
-!- ipondering [~iponderin@202.112.128.135] has joined #shogun10:57
-!- ipondering [~iponderin@202.112.128.135] has quit [Client Quit]11:01
-!- ipondering [~iponderin@202.112.128.135] has joined #shogun11:02
vgorbatilisitsyn: hello, present?11:17
@lisitsynvgorbati: yes11:17
vgorbatilisitsyn: well, at least on the datasets I had, the algorithm performed better compared to simple NN and GMDH:) I guess I will use it as a baseline - if some very complex algorithm, that involves a lot of cool math, has worst performance than this one - then I probably won't use it:)11:21
@lisitsynvgorbati: the one with derivatives?11:21
vgorbatilisitsyn: yes11:21
@lisitsynvgorbati: good11:22
vgorbatilisitsyn: it usually does not grow more then the 1st derivative, but this is still enough to beat the cool NN's and GMDH:) (for the data I has)11:23
@lisitsynvgorbati: what's NN you compare to?11:24
vgorbatilisitsyn: multi-layer perceptron, built-in matlab11:24
@lisitsynI see11:24
vgorbatilisitsyn: it is pretty good there, they use levenberg-marquardt training method, logistic hidden layer, random division of data set and all that stuff11:26
vgorbatilisitsyn: by default11:26
@lisitsynvgorbati: I see11:26
@lisitsynvgorbati: that's good for you then :)11:26
vgorbatilisitsyn: :) btw, if, say, I have such parameters of SVM as: support vectors, weight vector, bias, kernel function, scaling parameters and classes for support vectors, then I can calc output as:11:29
vgorbatilisitsyn: sign(sum_over_i(w_i*output_i*kernel_func(supp_vec_i, scaling(input_vector))) - bias)11:29
vgorbatiright?11:29
@lisitsynwhat is output_i?11:30
vgorbati-1 or 1 for support vector at index i11:30
@lisitsynto avoid confusion rather use \alpha_i to denote support vector coefficient11:30
vgorbatibut is the overall formula right?11:31
@lisitsyn\sign (\sum_i \alpha_i k(x_i, x_input) - b)11:31
@lisitsynthere is no need to put a sign of support vector there actually11:31
@lisitsynthey can go to alphas11:31
@lisitsynvgorbati: http://latex.codecogs.com/gif.latex?f(x_{input})%20=%20\mathrm{sign}%20\left(\sum_i%20\alpha_i%20k(x_i,%20x_{input})%20-%20b\right)11:32
@lisitsynvgorbati: I mean it is conventional that alphas are to be signed11:34
@lisitsynvgorbati: I guess you might have seen such notation in vorontsov's lectures11:35
@lisitsynit is more strict actually11:35
@lisitsynas alphas are lagrange multipliers and they are positive by definition11:35
@lisitsynnon-negative i mean11:36
vgorbatilisitsyn: yeah, I got it, thanks very much:) for better performance (forecasting performance I mean), I tried to get not the class label (-1 or 1) from svmclassify, but the actual value under the sign function - and I failed at it, because matlab probably makes it the way you have described - it stores alpha's signed, and when I also multiplied by class_i, I got totally crap results)11:37
@lisitsynvgorbati: hmm why do you use {-1,1}?11:38
vgorbatilisitsyn: what is wrong with -1 or 1?11:38
@lisitsynvgorbati: SVM is easily transformed to SVR11:38
@lisitsynwhere inputs are real-valued11:38
@lisitsynvgorbati: you predict real values right?11:39
vgorbatilisitsyn: SVM's predict only the sign of the derivative11:39
@lisitsynah11:39
vgorbatilisitsyn: so, -1 or 1 is actually quite suited for this purpose11:39
vgorbatilisitsyn: I then just multiply by it11:39
@lisitsynI see11:40
@lisitsynyes then it is ok11:40
gsomixsonney2k, so, CircularBuffer and LineReader are testing now. line counter takes ~3.3 sec for data/asp/Human.dat11:40
gsomixsonney2k, docs and clean up are needed for code. I'll send PR at evening.11:41
gsomixsonney2k, now I want try to prepare for pass-fail exam - digital signal processing. cu later.11:44
-!- gsomix [~gsomix@83.234.54.61] has quit [Quit: Leaving]11:44
-!- lambday [67157d37@gateway/web/cgi-irc/kiwiirc.com/ip.103.21.125.55] has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]11:44
-!- ipondering [~iponderin@202.112.128.135] has quit [Quit: ipondering]11:46
-!- gsomix [~Miranda@83.234.54.61] has joined #shogun11:47
-!- vgorbati_ [~vgorbati@212.2.159.34] has joined #shogun11:53
-!- vgorbati [~vgorbati@212.2.159.34] has quit [Ping timeout: 246 seconds]11:55
-!- vgorbati_ is now known as vgorbati11:55
-!- vgorbati [~vgorbati@212.2.159.34] has quit [Read error: Connection reset by peer]12:43
-!- vgorbati [~vgorbati@212.2.159.34] has joined #shogun12:43
-!- lambday [67157d37@gateway/web/cgi-irc/kiwiirc.com/ip.103.21.125.55] has joined #shogun13:02
lambdaythere are some files (SGSparseVector.cpp, SGMatrix.cpp) which misses the GPL header13:16
lambdayI'm adding complex64_t support for those.. should I add the header too?13:17
-!- vgorbati [~vgorbati@212.2.159.34] has quit [Read error: Connection reset by peer]13:21
-!- vgorbati [~vgorbati@212.2.159.34] has joined #shogun13:22
-!- vgorbati_ [~vgorbati@212.2.159.34] has joined #shogun13:35
-!- vgorbati [~vgorbati@212.2.159.34] has quit [Ping timeout: 264 seconds]13:36
-!- vgorbati_ is now known as vgorbati13:36
-!- iglesiasg [d58f3264@gateway/web/freenode/ip.213.143.50.100] has quit [Ping timeout: 250 seconds]14:04
@lisitsynlambday: yeah please add one14:54
-!- lambday [67157d37@gateway/web/cgi-irc/kiwiirc.com/ip.103.21.125.55] has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]15:01
-!- vgorbati_ [~vgorbati@212.2.159.34] has joined #shogun15:58
-!- vgorbati [~vgorbati@212.2.159.34] has quit [Ping timeout: 252 seconds]15:59
-!- vgorbati_ is now known as vgorbati15:59
-!- vgorbati_ [~vgorbati@212.2.159.34] has joined #shogun16:32
-!- vgorbati [~vgorbati@212.2.159.34] has quit [Ping timeout: 245 seconds]16:32
-!- vgorbati_ is now known as vgorbati16:32
-!- votjakovr [~votjakovr@host-46-241-3-209.bbcustomer.zsttk.net] has joined #shogun16:36
-!- iglesiasg [~iglesiasg@213.143.50.76] has joined #shogun16:45
-!- mode/#shogun [+o iglesiasg] by ChanServ16:45
-!- gsomix [~Miranda@83.234.54.61] has quit [Quit: Miranda IM! Smaller, Faster, Easier. http://miranda-im.org]17:21
-!- lisitsyn [~blackburn@109-226-97-52.clients.tlt.100megabit.ru] has left #shogun []17:22
-!- iglesiasg [~iglesiasg@213.143.50.76] has quit [Ping timeout: 252 seconds]17:38
-!- iglesiasg [~iglesiasg@213.143.50.111] has joined #shogun18:05
-!- mode/#shogun [+o iglesiasg] by ChanServ18:05
-!- vgorbati [~vgorbati@212.2.159.34] has quit [Quit: vgorbati]18:20
-!- n4nd0 [~iglesiasg@213.143.50.81] has joined #shogun18:50
-!- iglesiasg [~iglesiasg@213.143.50.111] has quit [Ping timeout: 245 seconds]18:53
-!- n4nd0 is now known as iglesiasg18:54
-!- mode/#shogun [+o iglesiasg] by ChanServ18:55
-!- iglesiasg [~iglesiasg@213.143.50.81] has quit [Ping timeout: 252 seconds]19:32
-!- van51 [~van51@athedsl-320452.home.otenet.gr] has joined #shogun20:45
-!- gsomix [~gsomix@46.20.65.107] has joined #shogun20:50
gsomixgood evening20:50
van51hello20:52
-!- votjakovr [~votjakovr@host-46-241-3-209.bbcustomer.zsttk.net] has quit [Quit: Leaving]21:20
@sonney2kgsomix, hey there21:51
gsomixsonney2k, hey21:51
@sonney2kvan51, did you meet already?21:51
van51sonney2k: hey, no21:51
van51they said tomorrow21:51
van51if I want to perform some file I/O21:55
van51should I use classes like SimpleFile?21:55
@sonney2kvan51, what file i/o?21:57
@sonney2kvan51, ascii? then gsomix' thing21:58
@sonney2kvan51, and what for?21:58
@sonney2kvan51, ohh btw - can you do the combineddotfeature?21:58
@sonney2ks21:58
van51sonney2k: a step in the hashing trick is about tokenizing text documents22:00
van51sonney2k: do what about it? :)22:00
gsomixsonney2k, btw, I'm preparing PR.22:02
@sonney2kvan51, errm convert from list -> dynarray22:05
van51sonney2k: yea sure22:06
@sonney2kvan51, regarding tokenizing22:06
@sonney2kgsomix, just wrote a fast line reader which you can use for tokenizing too22:06
@sonney2kvan51, so basically you can give it stop symbols and then say 'give me next'22:06
van51it's in AsciiFile, right?22:06
@sonney2kand then you can do whatever22:06
@sonney2kvan51, no will be a new file22:07
@sonney2kerr class22:07
van51is it already integrated in the development branch?22:07
@sonney2kvan51, gsomix just said he prepares the PR22:08
@sonney2kvan51, but I would focus on doing it in a way that you have a string in memory22:08
@sonney2kbecause then this will work offline and online22:08
van51sonney2k: ok , was just was wandering if maybe there was a skeleton or an API already22:08
van51yea this would be just a front-end22:08
van51I think in order to compare the results of the soon to be created class22:09
@sonney2kvan51, well I would do the tokenizer by creating a char[256] array with stop tokens tagged and then just go through the string char by char, computing the hash of words22:11
@sonney2kvan51, this is something you could already do now22:12
van51sonney2k: yea I thought so too22:12
van51I was just wondering whether to use shogun classes for IO or from the standard libraries22:13
@sonney2kvan51, no I mean totally w/o IO22:15
@sonney2kvan51, just consider you get a SGString<char>22:15
@sonney2kas input and operate on that22:16
@sonney2kthen it does not matter if it is a string or sth else22:16
@sonney2kI mean input comes from a file22:16
van51sonney2k: ok got it! thanks for the suggestion22:18
van51i'll work a bit on combinedotfeatures and look into it after that22:18
@sonney2kvan51, the combineddotfeatures should be piece of cake for you now giving your combinedfeatures experience22:25
van51sonney2k: yea I guess :p22:43
van51sonney2k: are the iterators from this class used?22:51
-!- hushell [~hushell@8-92.ptpg.oregonstate.edu] has joined #shogun23:07
van51nvm23:09
gsomixsonney2k, done. just have updated PR.23:31
gsomixsonney2k, I'm little unhappy with my code, because it's ugly and over, I think. hard review is needed. :)23:32
-!- nube [~rho@49.244.80.228] has quit [Ping timeout: 264 seconds]23:33
gsomixsonney2k, btw PR is still WIP. may be I can find some bugs tomorrow... and then...23:33
* gsomix z-z-z23:34
gsomixsonney2k, hope I'm not too slow. :(23:35
gsomixnite guys23:42
van51gsomix: cu23:48
-!- nube [~rho@49.244.100.165] has joined #shogun23:48
--- Log closed Mon Jun 03 00:00:29 2013

Generated by irclog2html.py 2.10.0 by Marius Gedminas - find it at mg.pov.lt!