TheOnly92's comments

TheOnly92 | 2 years ago | on: Show HN: Bulletpapers – ArXiv AI paper summarizer, won Anthropic Hackathon

An abstract generally has the following format, it starts by describing the background of the problem, the problem the paper aims to solve, the method the paper uses, and finally a conclusion. The abstract doesn't assume much prior knowledge, and can probably still be understood 10 or 20 years from now. Whereas you can see how the LLM summarized version totally skips the background and jumps straight to the problem and the method.

Now, I'm not saying there is no room for improvements. The fixed format an academic paper has with abstract and the actual paper may actually be replaced by what is shown here, and I genuinely hope to see more experimentation with the communication of scientific studies, but that is unfortunately not being focused on in the academic world.

TheOnly92 | 2 years ago | on: Teaching Programming in the Age of ChatGPT

A simple virtual machine might be nice, but imagine the pains of trying to guide students to install something across different environments.

Godbolt is a compiler explorer, it shows disassembly of a code but there's nothing to visualize each step in the process.

TheOnly92 | 2 years ago | on: Teaching Programming in the Age of ChatGPT

True that based on my experience the variable names and function names remains in English despite the prompt, maybe its just the convention overall in the programming world, or maybe ChatGPT is finetuned to do so.

I don't think Jupyter notebooks or like similar REPL interfaces will help too much for my course, at least in the current syllabus. I'm aiming to teach about pointers, memory management etc, the more fundamental parts of how to interact with computers instead of a high level language. Though I would agree that the current UI is suboptimal, some improvements in allowing students to visualize memory layouts and see how their code manipulates memory will help a lot.

TheOnly92 | 2 years ago | on: Teaching Programming in the Age of ChatGPT

It definitely depends on the task at hand, but when you're teaching programming you don't teach stuff with boilerplate. Using ChatGPT for reference to replace Google/Stackoverflow was definitely one of the ways I'd expected the students to utilize it, but it probably wasn't providing answers in ways a beginner/novice could understand.

I'd expect simple tasks like calculating RMSE to definitely be within the abilities of LLM, you might combine things like actually reading the vectors from a CSV file (or a custom format) and calculating RMSE then sorting them etc to see the limitations of LLMs. Most students have no issues with calculating RMSE, they have issues with trying to do all the other stuff that leads to it, and then the combination of sorting and other tasks.

Regarding the restrictions, most of them are just don't use itoa/strtod or strcpy or some other standard library functions.

TheOnly92 | 2 years ago | on: Teaching Programming in the Age of ChatGPT

Interesting, I'd suppose it makes sense that rephrasing the problem in a different way and also adding a bunch of nouns that have no relations with the problem at hand will definitely confuse LLMs. It will be interesting to see how LLMs will adapt to these as more and more of these techniques develop.

In my own assignments however, I focus less on algorithmic stuff but more on adding and mixing several things together. E.g. instead of just sorting, do group & sort, and a combination of a bunch of other practical stuff like reading big-endian binary files.

TheOnly92 | 2 years ago | on: Teaching Programming in the Age of ChatGPT

I happen to be teaching a programming course currently, though it's not in English and the language I'm teaching is C. My current experience is that it does not seem like a majority of the students are using ChatGPT at all, even though I did encourage the use of it at the beginning of the course.

For my own course, I think several factors contributed to students not utilizing ChatGPT as much:

    - The assignments are not in English, and performance of ChatGPT in languages other than English is subpar.
    - The programming language that I'm teaching is C, I'd imagine Python/Javascript and other more popular languages might lead to different outcomes
    - I did specifically design the assignments so that copy/pasting the assignment to ChatGPT does not lead to a usable answer (by restricting use of certain standard library functions, making the assignment more complicated)
    - The course is not introductory, i.e. a previous course already taught the basic syntax of C and basics of programming, so I can make my assignments much more advanced
It's difficult to say if advancements in LLMs will make my job harder, where say copy/pasting my more complicated assignments can lead to correct results. But from what I can see right now, LLMs still have trouble solving novel problems, so it's probably always possible to come up with assignments that's difficult for them to solve.

TheOnly92 | 2 years ago | on: Are emergent abilities of large language models a mirage?

I agree that perhaps the metrics are not as useful themselves, but I think you're giving too little credit to the paper where maybe some credit is due.

I think the paper is correct that there are no "emergent abilities", i.e. abilities that might suddenly appear when scale of the model is increased. And though it might not be accurate, but the paper did make some effort to formalize and I think it is a good attempt to kind of prove the point.

However as we recognize, there are still some weird discontinuities in which at one point the model is useless and suddenly it becomes very useful. This "discontinuity" IMHO is probably just perceptional, but the underlying metric is continuous.

TheOnly92 | 3 years ago | on: Lost USB with 460k city residents’ personal data found

They actually announced the length and the characters (letters/numbers) used in the password in yesterday's press conference, if you could believe it...

Many people on the internet guessed what the password probably was (city name + year).

TheOnly92 | 4 years ago | on: A quick breakdown of what SWIFT is and why it matters

Well the simple way to look at this is that usually when you transact with another bank, you're required to have an account at that bank. Just as a consumer can't magically change their balance, a bank simply can't magically increase their balance at other banks. To perform increased transactions, you have to send money to that bank.

TheOnly92 | 4 years ago | on: Banks beware, Amazon and Walmart are cracking the code for finance

The problem is in the end, you still need to move the money from the customer's bank to yours. That can happen either directly, or through an intermediary. If you have a direct relationship with the customer's bank, then all is well and dandy, but as a risky business you'd probably have a hard time doing this kind of direct relationship. The easier alternative is to find an intermediary bank that has connections to a lot of major banks. If you can actually find an intermediary bank willing to do business with you, i.e. open a corresponding account for your business, then you can pretty much accept payments. The trouble is if other banks find out that your intermediary bank is doing business with you, they might be able to force your intermediary bank to cut off that relation. I believe Bitfinex allegedly had that issue. So in the end, unless the customer is directly handing in cash to you, you're pretty much at someone's mercy, regardless of what channel the payment is performed in. If you want to setup a new credit card brand, better make sure the customer's bank is willing to send you the funds when its time for settlement.
page 1