jmcminis's comments

jmcminis | 2 years ago | on: Show HN: LLMs can generate valid JSON 100% of the time

Are there edge cases here due to context length?

1. I have a json schema with required fields. I complete the json, but do not include the required fields.

2. I run out of token from the model before I finish the json object because I'm in the middle of some deep, nested structure.

These seem solvable, just edge cases to control for by either reserving tokens, randomly generating required tokens until completing the json, or something more sophisticated.

jmcminis | 2 years ago | on: How to scale LLMs better with an alternative to transformers

Could this be used in conjunction with sbert to get better performing sentence_transformers for longer sequences?

jmcminis | 3 years ago | on: Show HN: "GPT Take the Wheel" – write what you want, let GPT implement it

Super fun!

jmcminis | 3 years ago | on: Ask HN: Best resources to learn how to use OpenAI for your startup?

Here are a few things that might help

General advice:

People + AI Guidebook - A toolkit for teams building human-centered AI products. https://pair.withgoogle.com/guidebook/

LLM advice:

Tips to improve prompt and answer quality. https://github.com/openai/openai-cookbook/blob/main/techniqu...

I wrote a short overview of some of the LLM Application development tools and platforms that might be helpful: https://mcminis1.github.io/jekyll/update/2023/01/23/llm-land...

jmcminis | 3 years ago | on: Ask HN: Firing an employee under a month before vest?

In general the CEO has the ability to award equity however they want. It's fairly common for exercise windows to be extended, awards to be increased as a part of a severance package, cashless exercise to be allowed, etc. You may need to get sign off from the board, but that's usually easily justifiable.

As stated above, after that, the communication to the rest of the team is key.

jmcminis | 4 years ago | on: Show HN: Webcrepe – SQL for the Internet

I think there is a trend towards using SQL to do the T part of ELT. For example, see the rise in popularity of dbt. Analysts are often limited by the data that's in the warehouse already. Instead of asking a data engineer or developer to do something so that more data gets pushed to the warehouse, I wanted to be able to pull it in myself.

Because of that, I'm starting an open source project, WebCrepe, to empower analysts to pull data directly into their databases using SQL. The idea is that we pair a database extension with a web app to enable searching the internet and pulling in structured data. It's really early right now. I have a docker-compose file you can use to spin up a postgres database and the backend. I still need to write some better documentation on how to write queries but it's basically using the advanced google search language.

I'm interested in analytics folks that have use cases I can build out and engineers interested in working on it. If there is any interest then I'll write up better docs and build more functionality.

jmcminis | 4 years ago | on: Ask HN: Need Advice, ML dev – bangalore on move to Canada

I think it would be easier to move and then find something. Toronto has a good and growing tech scene. I think you should be able to find something interesting fairly quickly with your experience.

jmcminis | 4 years ago | on: Ask HN: Who wants to collaborate?

I'm building a postgres extension that allows you to do web searches using a SQL query. The idea is to be able to pull in data from the web with some structure (which you define using custom scrapers) on demand.

Right now I have a proof of concept that's pretty simple. It's a multicorn extension that calls to a FastAPI backend. I have it all running using docker-compose.

I'm open to working with people that want to use it, or people that want to build it. I don't have any real plans to open source it or commercialize it. It's just a little side project I think is neat. I'm open to any ideas or use cases you might have.

Send me an email (in profile) or dm. Looking forward to it!

jmcminis | 5 years ago | on: Ask HN: Who is hiring? (August 2020)

elovee (https://elovee.com) | ML Engineer, Data Scientist, Full-Stack Software Engineer | North America | Remote | Full Time

We are elovee, a healthcare startup focused on developing A.I. based technology to improve day-to-day care for seniors. We're building a voice user interface for seniors living with dementia. Our mission is to solve loneliness and isolation for seniors.

Roles we're hiring for - ML Engineer/ Data Scientist. - Full Stack Engineer

Why you want to work with us - We are small. You get to help set the culture and direction - Cutting edge technology. We are pushing SoTA Speech-to-text, conversation modeling, text-to-speech models. Tuning where needed and building what we have to.

What we are looking for - Experienced engineers that can take requirements and build products. - A strong sense of ownership. - Empathetic, team oriented teammates. - Connection to our mission

Please connect at [email protected] or reach out with a DM.

jmcminis | 6 years ago | on: Cortex – An ML model deployment platform that runs in your AWS account

Thanks!

I bet you could get Cortex running on Kubeflow pretty easily since it's all K8s anyway.

jmcminis | 6 years ago | on: Cortex – An ML model deployment platform that runs in your AWS account

Compare and contrast with Seldon on Kubeflow?

jmcminis | 7 years ago | on: Ask HN: Best ways of letting someone go in an early stage startup?

Be direct, honest, and compassionate. They are good at something, but not the thing you need most now. Explain that to them, let them go, look for someone else.

Don’t forget that this person is a part of your network and always will be. They might be a good fit later. Someone they know might be a good fit.

jmcminis | 7 years ago | on: Ask HN: How old were the most talented software engineers you've met?

This is kind of an odd question. It only indicates how old they were when you were lucky enough to meet them. Presumably, that person has been gifted before and would be gifted after as well, not just when they were XX years old.

I would be interested to know the trajectory these engineers had as they aged. Were they average until they got some experience under their belt? Genius from day one, but productivity improved? How did they develop over time and what were the predictors of greatness?

jmcminis | 7 years ago | on: Ask HN: Is moving to SV/SF Bay Area still worth it?

Yes. You should go. The job market is so much better. You will probably have a choice of high paying, interesting offers when you choose to leave your postdoc.

I did a postdoc in computational chemistry/condensed matter physics at LLNL and transitioned into industry in SF five years ago. I worked at two high growth startups and was promoted a few times. I got very lucky in picking good companies and got in as an early employee (around 10 each time). I gained a world of knowlege and experience in just a few years. Now I take that experience with me wherever I go. I would do it again in a heartbeat.

The options you have from Stanford will likely be as good or better than mine were. Best of luck!

jmcminis | 7 years ago | on: Ask HN: What do I need to learn to be data analyst?

This is a really nice summary of some of the technical components required. You also need to know how to do different kinds of analysis to answer different kinds of questions. A few more things:

0. Scientific method - probably true for all domains. Not really a kind of analysis, more an approach to doing analysis.

1. Cohort analysis - used in aquisition and retention analysis.

2. Model building - used in all kinds of financial analysis.

3. A/B/... testing - determining the difference between 2 or more populations.

4. Exploratory - understanding the relationships in your data to develop intuition about it.

There are plenty of analysis techniques in use. You can learn more about these and others if you survey blogs and other literature. One that I find interesting is Tom Tunguz. He has a particular theme, but his analysis is very good. The methods and way of thought are transferrable. http://tomtunguz.com/

jmcminis | 7 years ago | on: Ask HN: What non technical skills do you think are valuable to startups?

Sales for b2b. Marketing for b2c.

No matter how good your product is, if you don’t get traction, you’re dead in the water.

jmcminis | 7 years ago | on: Bye HN: Existentially-rational, self-termination

Your email and post/comment history seem quite cogent. You still have the ability to write clearly and effectively. Please consider your options. Reach out to someone locally for help/contact. I would be surprised if there weren’t something you could work productively towards.

jmcminis | 7 years ago | on: Ask HN: Ever been the last engineer at a failing startup?

As long as you're having fun, learning new things, and not missing out on a great opportunity it makes sense to stay. That being said, depending on your job market, it doesn't hurt to look around. You don't have a hard decision to make until you have another job offer on the table.

jmcminis | 7 years ago | on: Fibonacci Hashing: The Optimization That the World Forgot

I think the search part is implementing the Fibonacci Search Technique[0]. It’s related to Golden ratio search[1].

[0] https://en.wikipedia.org/wiki/Fibonacci_search_technique

[1] https://en.wikipedia.org/wiki/Golden-section_search

jmcminis | 8 years ago | on: The Case for Learned Index Structures

So you want a LSM for inserts and the DNN for reads? Seems OK. You still have to update/retrain the DNN after an insert into a larger layer, which will be expensive. So you’d probably get high latency at the 99% (or some high number).