SirOibaf | 2 years ago | on: S3 is files, but not a filesystem
SirOibaf's comments
SirOibaf | 3 years ago | on: EU Open Web Search project kicked off
SirOibaf | 3 years ago | on: Mikhail Gorbachev has died
SirOibaf | 4 years ago | on: Fisher Price Chatter Telephone with Bluetooth
SirOibaf | 4 years ago | on: Ask HN: Who is hiring? (August 2021)
Logical Clocks is the company behind Hopsworks. Hopsworks is an open-source feature store that allows teams to organize and scale their feature engineering efforts. (https://hopsworks.ai) We recently raised our series A (5M euro) and we are expanding the teams. We are looking for a mid/senior frontend developer. Our frontend stack includes Typescript, React JS, Redux.
Your responsibilities will be to lead the development the new Hopsworks frontend, lead the technical discussions/decisions and help with the expansion of the UI/UX team.
If you want to apply/have any questions you can reach me at fabio [at] logicalclocks [dot] com or through our website: https://www.logicalclocks.com/job-positions/senior-front-end...
SirOibaf | 4 years ago | on: This Website is hosted on an Casio fx-9750GII Calculator
SirOibaf | 4 years ago | on: The cost of cloud, a trillion dollar paradox
SirOibaf | 5 years ago | on: HopsFS: 100x Times Faster Than AWS S3
`doesn’t fully support regular file system semantics or consistency guarantees (e.g. atomic rename of directories, mutual exclusion of open exclusive, append to file requires rewriting the whole file and no hard links).`
HopsFS does provide strongly consistent metadata operations like atomic directory rename, which is essential if you are running frameworks like Apache Spark.
SirOibaf | 5 years ago | on: HopsFS: 100x Times Faster Than AWS S3
It's conceptually similar to EMR in the way it works. You connect your AWS account and we'll deploy a cluster there. HopsFS will run on top a S3 bucket in your organization. You get a fully featured Spark environment (With metrics and logging included - no need for cloudwatch). UI with Jupyter notebooks, the Hopsworks feature store and ML capabilities that EMR does not provide.
SirOibaf | 5 years ago | on: HopsFS: 100x Times Faster Than AWS S3
There are other papers that describe HopsFS architecture in more details if you are interested: https://www.usenix.org/system/files/conference/fast17/fast17...
SirOibaf | 5 years ago | on: How to build your own feature store for ML
In the Hopsworks Feature Store we group features together in feature groups. Feature groups can be then joined to create training datasets. (You can also select a subset of features from a feature group) Training datasets are stored in a ML Framework friendly format (e.g. TFRecords if you are using TensorFlow) and you can feed them directly to your model.
If you are interested, we have a longer blog post explaining the core concepts of the Hopsworks feature store: https://www.logicalclocks.com/blog/feature-store-the-missing...
SirOibaf | 5 years ago | on: We Need DevOps for ML Data
SirOibaf | 7 years ago | on: Integrating NVMe Disks in HopsFS (HDFS)
Interesting solution presented here - keep the block size constant, but put the small files on NVMe disks.
You can read the research paper here if you are curious: https://www.hopsworks.ai/research-papers/hopsfs-s3-extending...