Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Solid summary of some powerful tools, no wonder python is the new default for academic data analysis.


Lush is an excellent platform for machine learning. There are bindings to gnuplot ,opencv, lapack, gsl, an optimization library for gradient descent, a machine learning framework, a nerual network simulator.

It also has very nice matrix and vector manipulations features built in to the language and is very easy to bind to C code.


Some people really do seem to get a lot done in Lush, so I'm not discounting its utility, but the language is sort of a mess. I took Yann's class and gave up in frustration after a few homeworks. I was very happy working in Matlab and relieved to to never see a 'bloop', 'eloop', or whatever-loop again.


Lush's purpose a little dfferent than Matlabs. The abstractions are a little lower level than Matlab for instance. But then again you you can compile your functions directly to machine code. There are trade offs to everything in life.

Matlab,Ocatave,R,S are great but if you need to be closer to the metal, Lush offers a very good compromise.


Andrew Ng, in his machine learning class [1] urges people do use Matlab instead of Python, because in his experience people develop faster with Matlab than wit any other tool/language.

Personally, I am experienced with Matlab but not so much with Python, so I am not able to judge. I definitely hate the fact that Matlab is proprietary and partly closed source. Also, I think Python syntax is much more pretty while Matlab is not even designed to be a real language. But alas, it comes with very powerful functionality out of the box.

Note, that the OP mentioned in the introduction that he had no access to Matlab over his employer or university and hence dismissed it.

[1] https://www.coursera.org/ml (one of the first videos)


Some good points for comparison:

http://www.scipy.org/NumPy_for_Matlab_Users

IMHO, it's best to prototype in Octave and then build in python. I find that the Matlab/Octave syntax is too focused on linear algebra, so it's better for small prototypes (and for people coming from non-SW fields). For big projects, I prefer the 0-based arrays, more than one function per file, and all the rest of the python goodies. I estimate that 70% of my time is usually spend preparing the data (e.g. parsing xml, or some other files, etc), for which I find python more suitable.

In fact, I usually work with them side by side, testing ideas in Octave, then implementing these pieces into a large python project.

Edit: this has also been discussed here before, e.g.

http://news.ycombinator.com/item?id=363096

http://news.ycombinator.com/item?id=689183


>Also, I think Python syntax is much more pretty while Matlab is not even designed to be a real language.

If you're only coding the core of an algorithm (rather than a full-featured library with lots of plumbing), and your logic fits naturally into Matlab's native array operators, then using Matlab is a joy.


> Andrew Ng, in his machine learning class [1] urges people do use Matlab instead of Python, because in his experience people develop faster with Matlab than wit any other tool/language.

Never trust academics when it comes to programming. :)

Seriously, Matlab might be a tiny bit better for scientific programming than Python. But: if you start building an eco system around your machine learning code (distributed evaluation of models, email reporting of results, web reporting of results, online tracking of training progress, data base related things, web services for other people, proper documentation, ...) you are happy if you chose python.

Also, there is theano for python which has auto diff, transparent GPU/CPU use and symbolic optimization. It makes your life easy if you are using complicated models.


It's too bad the OP didn't find Octave, which is an opensource Matlab clone that was also used in Andrew Ng's ML course.


I am not sure that Octave preforms as fast as MatLab; for example, I think MatLab does a better job in parallelizing non-vectorized code.


If you have taken Andrew Ng's Machine class the handwriting recognition system that is mentioned in the course was implemented in Lush. I think the original code is is even included in the demos distributed with Lush.


Word. Given Cython, Jython, etc.... this list keeps going (I use Weka in Python).


Can I get your email address? Check my profile for mine please. Thank you!


Why?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: