nicktorba | 4 years ago | on: Are We Really Engineers? (2021)
nicktorba's comments
nicktorba | 4 years ago | on: Are We Really Engineers? (2021)
If building software was expensive like engineering in the physical world, we wouldn't have to have this argument. It's too bad that physical engineering is so expensive, because we'd probably have a lot more people trying it out.
nicktorba | 4 years ago | on: Are We Really Engineers? (2021)
A great example from the second post in the series is how software people don't have experience with the unpredictability of traditional engineering projects: "Part of this misconception comes from us seeing the results of the engineering process, not the process itself. We don’t see the friction, the overruns, the delays that happen because someone built the wall one inch to the left, or when a critical supplier goes out of business. To assume that software is uniquely unpredictable is a special kind of arrogance."
This has me wondering why the software world is so unpredictable in the first place and why we aren't working on that? Or at least getting better at dealing with it. Also, why are we so inclined to think the physical world is so predictable? Probably because we spend so much time on our computers...
excited to finish the rest of this series
nicktorba | 4 years ago | on: On Building Glue Systems
For example, there are plenty of tools to create ML model endpoints (send data, receive prediction) but many times, those models are not customer/user facing. They will be hit by some other program/system (often on a regularly timed schedule).
One solution is to embed the model in the application that needs the predictions, but this can get hairy when it comes to updates/maintanence/etc.
Over the next few years, we will see some generalized "model invocation" components that integrate with common data sources for these use cases.
You can see an inkling of this with the seldon-core kafka integration: https://docs.seldon.io/projects/seldon-core/en/stable/stream...
nicktorba | 4 years ago | on: Ask HN: Why is Confluence Wiki Search so bad?
nicktorba | 4 years ago | on: Minus
i do like the sentiment behind it though
nicktorba | 4 years ago | on: Minus
nicktorba | 4 years ago | on: How I Experience Web Today
nicktorba | 4 years ago | on: Ask HN: Favorite Blogs by Individuals?
nicktorba | 5 years ago | on: Pony – High-Performance Safe Actor Programming
Turns out an easy google search answered my question as well... this post https://medium.com/distributed-computing-with-ray/ray-for-th... does say that ray uses the actor model, and ray is listed on The Actor Model wikipedia page as an Actor Model library.
nicktorba | 5 years ago | on: Pony – High-Performance Safe Actor Programming
If anyone from the Pony teams sees this, are actors in Pony conceptually similar to actors created with ray?
nicktorba | 5 years ago | on: Snapchat is a CRM for college kids
Snapping a new person is much lower pressure than texting them.
Pictures also make it much easier for people to be involved, because you don't need to be so descriptive with your writing. That is part of the reason snap groups tend to be so much more active on snap than text groups.
nicktorba | 5 years ago | on: If it will matter after today, don't talk about it in a chat room
How can we transform chats into a reasonable, permanent reference materials?
Maybe it would be possible to build a slack-bot that could identify which part of these threads are the most important, and even move them to a more permanent storage.
Or, slack channels could add functionality to allow users to vote for which messages are most pertinent, or should be moved to longer-term storage or a featured position for everyone.
I think there is probably value in these rapid fire IM's, but I feel the Mike's (the author's) pain. I skip over threads everyday because reading them never feels worth it.
nicktorba | 5 years ago | on: The Great Software Stagnation
It makes sense that progress cycles from a big breakthroughs to years of seeing how far we can push it.
Even so, the world is significantly different than it was in ‘96. “Stagnation” doesn’t feel like the right word.
What about advances in quantum computing? Is that not a large enough paradigm shift for the author to acknowledge?
nicktorba | 5 years ago | on: Talking out loud to yourself is a technology for thinking
I did not expect the personal thought benefits that come from speaking to yourself in long form. After starting to send recordings to friends, I quickly became more comfortable speaking for long periods and found it led to more interesting ideas and better understanding of my thoughts and ideas. Plus, it helped me better structure thoughts verbally. I've also started making a recording right after I listen to any podcast. It has made listening to them much more engaging.
I was also surprised by how energizing recording audio is. I find myself refreshed after going for a walk and recording my thoughts, or creating a recording for a friend (much more so than if I listened to a podcast on the same walk, or just walked in silence). It's also a fantastic way to make time in the car creative and productive.
Voice recording or self talking is definitely an underutilized tool for thought and creation.
nicktorba | 5 years ago | on: Netflix's Metaflow: Reproducible machine learning pipelines
Based on this thread, the comparison should include
* metaflow (model training on AWS Batch) * polyaxon (model training on kubernetes) * pachyderm (experimentation) * hopsworks (model training/serving/ and more, mostly on kubernetes) * cortex (model serving on kubernetes) * seldon-core (model serving and monitoring on kubernetes)
and likely more that I missed.
I can see why it would be so hard to put together this comparison.
Even with all these tools, there is still a lot of manual work for data scientists or DevOps engineers the data scientists pass their models off to.
It also seems there is yet to be a fully open source DevOps stack. Most companies still build custom software to glue together manual processes (like integrations between different tools for training, deploying, monitoring, etc). This could be one factor why more comparisons of these tools and stack discussions have not been more popular - they can't share them yet.
nicktorba | 5 years ago | on: Netflix's Metaflow: Reproducible machine learning pipelines
I've played with cortex before, and it is easy to use, but I am still questionable if automating kubernetes deployments through an easy code interface, without much kubernetes know-how, is safe.
In my experience, even when you have a tool automating a lot of kubernetes for you, you will still run into trouble that will be best handled if you are familiar with kubernetes. I'm not sure what debugging utilities cortex has, but I think the ultimate solution to this problem will be a tool that truly allows users to not think about the fact their deployments are running on kubernetes at all.
I'm also interested in the similarities of Cortex and Seldon-core. Of course, seldon-core does not automate infra provisioning, but based on my previous point, I think many teams are better off being more hands on with this infra.
Lastly, there is a third tool missing from the mix - monitoring. I think cortex offers some tools in this area, but I wish they would make a part two showing how the monitoring functionality they offer can integrate into a retraining pipeline within metaflow. This post shows you how to get started, but it doesn't show you how to maintain applications long term.