(no title)
antognini | 3 months ago
However the embedding space of a typical neural network that is representing the data is not a manifold. If you use ReLU activations the kinks that the ReLU function creates break the smoothness. (Though if you exclusively used a smooth activation function like the swish function you could maintain a manifold structure.)
macleginn|3 months ago
hansvm|3 months ago
- For language, individual words might be discrete, but concepts being communicated have more nuance and fill in the gaps.
- For language, even to the extent that discreteness applies, you can treat the data as being sampled from a coarser manifold and still extract a lot of meaningful structure.
- Images of cars are more continuous than you might imagine because of hue differences induced by time of day, camera lens, shadows, etc.
- Images of cars are potentially smooth even when considering shape and color discontinuities. Manifolds don't have to be globally connected. Local differentiability is usually the thing people are looking for in practical applications.