tkyjonathan | 6 years ago | on: We Need DevOps for ML Data
tkyjonathan's comments
tkyjonathan | 6 years ago | on: AI startups raised $6.9B in Q1 2020
If you don't have some AI in your company, you won't get investors.
tkyjonathan | 6 years ago | on: Rules for Data Modeling with DynamoDB
But forget to do normalisation and you will be paying 5 figures a month on your AWS RDS server.
"Storage is cheap as can be, while compute is at a premium."
This person fundamentally does not understand databases. Compute has almost nothing to do with the data layer - or at least, if your DB is maxing on CPU, then something is wrong like a missing index. And for storage, its not like you are just keeping old movies on your old hard disk - you are actively accessing that data.
It would be more correct to say: Disk storage is cheap, but SDRAM cache is x1000 more expensive.
The main issue with databases is IO and the more data you have to read, process and keep in cache, the slower your database becomes. Relational or non-relation still follows these rules of physics.
tkyjonathan | 6 years ago | on: Ask HN: Who wants to be hired? (April 2020)
tkyjonathan | 6 years ago | on: Ask HN: Who wants to be hired right now?
Remote: Yes
Willing to relocate: No
Technologies: Data Architect, Data Modeling, Data Performance Engineer, SQL, MySQL, Percona, MariaDB, Galera, Postgres, Kafka, ElasticSearch, RDS, Redshift, Infobright, Java, Python, R.
Resume/CV: https://www.linkedin.com/in/jonathanlevin/
Email: [email protected]
tkyjonathan | 6 years ago | on: Smart Algorithms beat Hardware Acceleration for Large-Scale Deep Learning
We need to get back to forming algorithms as well as concepts and first principles. We cannot and should not expect ML to brute force finding patterns and just sit back and relax.
Here is another prediction for you: we will not solve ray-tracing in games and movie CGI with more hardware. We will need some algorithm that gets us 80-90% of the way there in a smart way.
tkyjonathan | 6 years ago | on: Millions of Tiny Databases
/s
tkyjonathan | 6 years ago | on: The boss who put everyone on 70K
37signals would be proud.
If you decentivize the most productive people, then you might hurt the rest of the company.
tkyjonathan | 6 years ago | on: We use Kubernetes and spot instances to reduce EC2 billing up to 80%
tkyjonathan | 6 years ago | on: Domain Logic and SQL (2003)
Its also important to remember that in databases, you are more often optimising for IO usage than CPU.
tkyjonathan | 6 years ago | on: Domain Logic and SQL (2003)
tkyjonathan | 6 years ago | on: Domain Logic and SQL (2003)
Real life example for a regulatory batch job:
6 tomcat servers + 1 RDS. 30X lines of code + UTs in java. 30+ minutes time.
In SQL No tomcat servers + 1 RDS instance. 1X lines of code(SQL) + UTs (in java). 3+ minutes
Here is a good book on it: https://www.amazon.co.uk/Relational-Database-Programming-Set...
tkyjonathan | 6 years ago | on: Facebook asks for a moat of regulations it already meets
tkyjonathan | 6 years ago | on: Japan has 33k businesses at least a century old
tkyjonathan | 6 years ago | on: My productivity app for the past 12 years has been a single .txt file
tkyjonathan | 6 years ago | on: Philosophy of Architecture (2015)
tkyjonathan | 6 years ago | on: The Rise of SQL-Based Data Modeling
tkyjonathan | 6 years ago | on: The Rise of SQL-Based Data Modeling
tkyjonathan | 6 years ago | on: 'Cancel Culture' Comes to Science?
tkyjonathan | 6 years ago | on: ClickHouse cost-efficiency in action: analyzing 500B rows on an Intel NUC