cyclecycle | 8 months ago | on: Springer Nature book on machine learning is full of made-up citations
cyclecycle's comments
cyclecycle | 1 year ago | on: I want flexible queries, not RAG
What we've found is that vector similarity is often not the final solution. It is still only a crude proxy for the true goal of 'informativeness' or 'usefulness' with relation to the user goal/query. Works okay, but we're definitely seeing a need for more rigorous LLM-postprocessing to enrich the results set.
Which, yes, the time adds up quick!
cyclecycle | 3 years ago
cyclecycle | 3 years ago | on: The Future of Work in an AI-Powered World
cyclecycle | 3 years ago | on: Ask HN: Is there an AI service to interpret a collection of articles?
cyclecycle | 3 years ago | on: Show HN: We created a tool to visualize scientific knowledge
What kind of thing would you hope to see here? A textual summary of the relationship? Or perhaps there is more that can be done with shapes and colours in this area?
cyclecycle | 3 years ago | on: Show HN: We created a tool to visualize scientific knowledge
The keywords are grouped such that keywords that occur together often are in the same group. The colour represents this grouping.
There may be more useful groupings we could give or allow the user to choose between. We would be interested to hear any ideas for that
cyclecycle | 7 years ago | on: Show HN: Simplified music notation
We face the problem that people differ in what they think the features should be though. Ideally we would have a method to deduce what's the best symbolism for maximising input/output speed to human's minds. Some kind of scientific voodoo.
Beyond me what that might be.
cyclecycle | 7 years ago | on: Show HN: Simplified music notation
cyclecycle | 7 years ago | on: Ask HN: Those making $500+/month on side projects in 2018 – Show and tell
cyclecycle | 7 years ago | on: Biomedical knowledge graph-backed service: seeking collaborators
Practically speaking: Aristo takes questions in unstructured text, and answers in unstructured text. I'm interested in providing mechanistic queries and comprehensive, highly-structured result sets.
For a question such as, "what are the biological consequences of increasing the activity of molecule A", I want tabular and filterable results (where the number of rows depends on the volume of underlying data and the degrees of separation you carry the inference to). For this reason (alongside their current limitation to elementary science) I argue that Aristo is not currently a relevant resource for researchers and students looking to query and survey biomedical relationships.
The solution I'm aiming at takes a structured query and returns structured results. E.g, a query: [entity: molecule A, direction: increase] generates a list of direct and inferred consequences. It is more like a logic-driven search engine over structured information than it is a question answering system.
I work on Veracity https://groundedai.company/veracity/ which does citation checking for academic publishers. I see stuff like this all the time in paper submissions. Publishers are inundated