top | item 43907634

(no title)

willbryk | 10 months ago

We're a startup, so most of our resources go towards use cases that our users care most about. So the search should work best for - people, companies, papers, high quality written content (e.g., blogs, news). It should work well at more than just those (try Github repo search, it's quite good :D), but those are the best supported.

Types of searches Websets doesn't currently do well at: - products (e.g., ecommerce sites) - Content that requires authentication/permissions to access - non-English content

Some of the above are on our roadmap, and let us know if there's some type of data you'd like us to support!

discuss

order

mdaniel|10 months ago

> (try Github repo search, it's quite good :D)

Since you called it out, I gave it a whirl:

https://websets.exa.ai/api/trpc/getPreview?batch=1&input=%7B...

and it did nothing to the page at all, choosing to still show the "Full-stack engineers in SF that are great at design, and have worked at an AI startup" example table

I'm open to the fact that "I'm holding it wrong" or whatever, but the response payload included things that are clearly not GitHub Repositories

          {
            "id": "https://authzforce.ow2.org",
            "entityId": "https://authzforce.ow2.org",
            "properties": {
              "type": "custom",
              "description": "AuthzForce (Community Edition) - XWiki",
              "url": "https://authzforce.ow2.org",
and its .text contains no mention of ReBAC

later on it came closer

          {
            "id": "https://github.com/authzforce",
            "entityId": "https://github.com/authzforce",
            "properties": {
              "type": "custom",
              "description": "AuthzForce Community Edition",
              "url": "https://github.com/authzforce",
but, of course, no ReBAC in its .text either

It seems it is about 30/70 on finding the things I asked for, so I don't mean to imply it's worthless, but it is yet another example of "turns out, AI does not solve all problems"

---

I make a habit out of having the dev-tools open when interacting with things where the comments have explicitly called out "we were down and we don't check our response.statusCode" and that's the only reason I am able to offer you any feedback whatsoever

colkassad|10 months ago

Geospatial data would be great. This stuff is notoriously annoying to search for. For example:

"Give me a list of free imagery service endpoints I can use in a maplibre style sheet. Include information such as name, description, service endpoint, service type, extent (global/regional)."

willbryk|10 months ago

This might be possible if you specify geospatial location as an enriched column. The visualization of it as a map though is not supported in the UI, but can be built by giving an LLM access to the Websets API