antixk's comments

antixk | 1 year ago | on: Start presentations on the second slide

Hi, mind sharing the links of your talks or slides? Thanks!

Not sure if you have heard of Pytorch Mobile but it is very possible[0]

[0] https://pytorch.org/tutorials/beginner/deeplabv3_on_ios.html

antixk | 3 years ago | on: A Master Perfumer's Reflections on Patchouli and Vetiver

Patchouli is of Tamil origin as well. From pachai - green, ilai - leaf.

antixk | 5 years ago | on: Deep learning model compression methods

Becuase unless you have a certain (high) level of sparsity, sparse formats are infact ineffective in storing. There are cases where sparse formats take more memory than storing dense tensors.

antixk | 5 years ago | on: Microsoft Coffee

Did you study at JAIST? Sorry to ask, but when you said "deep in the mountains outside of Kanazawa", that's the only university that popped up in my mind that was more open to foreigners(which I assume you are/were).

antixk | 5 years ago | on: Matrix multiplication inches closer to mythic goal

I guess OP meant that each tensor core in the latest NVIDIA GPUs fundamentaly performs 4x4 matrix multiplication [0].

[0] https://nla-group.org/2020/07/21/numerical-behaviour-of-tens...

antixk | 5 years ago | on: An Elementary Introduction to Information Geometry [pdf]

Good question. Most people are focussing on the natural gradient and making it as efficient as SGD. But some have been exploring if we can introduce inductive bias in the function space rather than the weight space, using IG. But it is still quite a new field.

antixk | 5 years ago | on: An Elementary Introduction to Information Geometry [pdf]

As a student of Information Geometry, let me provide some context why this is such an exciting field. Usually geometry reminds someone of triangle and other shapes that we see in our immediate surroundings, which is called as Euclidean or just flat geometry i. e a space where Euclid's axioms are valid. But there is a lot more to the story, we can bend/reformulate or even exclude certain axioms to conjure up new spaces - such as hyperbolic geometry or in general some complex curved geometry. Turns out, these indeed have wide applications in our real world.

Now what's all this gotta do with Information? Usually, information is represented in terms of statistical distributions, from Shannon's information theory. What the early founders of IG observed is that, these statistical distributions can be represented as points on some curved space called a Statistical Manifold. Now, all the terms used in information theory can be reinterpreted in terms of geometry.

So, why is it so exciting? Well in Deep Learning people predominantly work statistical distributions, some even without realising it. All our optimizations involve reducing distance between some statistical distributions like the distribution of of the data and the distribution that the neural network is trying to model. Turns out, such optimization when done in the space of statistical manifold, amounts to the gradient descent that we all know and love. All the gradient based optimisations are only approximations to the local geometry like gradient(local slope) , Hessian(local quadratic approximation of curvature), but optimisation in the statistical manifold can yield the exact curvature and thus are more efficient. This method is called Natural Gradient.

Hope this helps.

antixk | 6 years ago | on: Show HN: Squirrel Curve Studio – A simple tool to design spline curves

Thanks! I just wanted to dabble with Electron (and p5.js, being totally new to JS). It gave me the confidence to work on a bit more serious project.