How to leverage Claude's capabilities with interactive visualization

127 points by allan666 a year ago

Yesterday my company changed a single line to switch our application’s LLM backend API from Claude to ChatGPT, because Claude started adding stuff its answers in QA style prompts that it wasn’t before, at least since August 1st.

I wish I could pay for a guarantee of performance, really quantization, which seems so simple but because it can 2-4x decrease their costs, LLM API providers keep quantizing and distilling without telling anyone. It’s a longer journey: to be an enterprise API. Which by the way, is a terrible business to be in.

mellosouls a year ago

This is interesting and appreciated but I'm not sure it's a Show HN unless the OP is representing Anthropic?

rubslopes a year ago

All 3 submissions of OP start with "Show HN". I think they are not aware of HN protocols.
- diggan a year ago
  
  All 9: https://i.imgur.com/TflImv5.png
  To OP:
  > Show HN is for something you've made that other people can play with. HN users can try it out, give you feedback, and ask questions in the thread.
  > The project must be something you've worked on personally and which you're around to discuss.
  https://news.ycombinator.com/showhn.html

zurfer a year ago

Well done. We are building something similar [1] and found that generating UIs (mostly charts) on the fly works surprisingly well in most cases, but can be a bit frustrating if you know exactly what you want and just can't prompt it (as a user) to do that because of some edge case.

While this is a cool demo that shows what LLMs can do I am a bit surprised how polished and advanced it looks (even PDF upload) for a quick start. Anyway I love that it's open source so we can learn from it.

[1] https://getdot.ai

zurfer a year ago

Here is a link to the prompt.
https://github.com/anthropics/anthropic-quickstarts/blob/9d5...
On a second glance I'd say that the visualizations are pretty limited but good enough for a demo.
croes a year ago

They want and need your data, of course they make it easy for you to give them what they want
fblp a year ago

Are there any other services out there like getdot.ai that help do analysis?
- zurfer a year ago
  
  There are a bunch that I put into this list: https://github.com/Snowboard-Software/awesome-ai-analytics ( probably missing a similar amount since many people try)
  I would be curious to understand why you are looking for an alternative?

dash2 a year ago

This seems like Anthropic showing people how to build a thin layer around Claude. Can building a thin layer around Claude be a valuable business model? If there are good profitable UIs for Claude, wouldn't Anthropic implement them itself?

anonzzzies a year ago

I would indeed (like Altman promised for OpenAI; 'we're gonna steamroll you'), implement all these cases (and many more) on their side, so their 'chat' becomes a full toolkit for building, visualising, prompting etc and allow people to plugin their data/processes (maybe with a few partners for that part which they can easily replace or have multiple of).
Currently, the "Added to project" button that remains for n seconds and you have to wait for to add another file (sometimes Claude generates 4-5 files per chat) is such an annoyance that I guess they should stick to training and nothing else.
fragmede a year ago

Opportunity cost. Anthropic's deal is in training Claude and whatever they choose to call their next model, not whatever weird little niche you're going after. I might not go after programming, but, say, a dnd character backstory generator would be a wrapper that's probably not interesting enough for them to build themselves to compete with yours. Or maybe it is, but your DND character backstory generator also doesn't have to use anthropic as the backend, there are others for you to choose from, so it's a bit of a standoff.
- Larrikin a year ago
  
  But is there any reason to use them besides a demo to investors while you actually build the business on llama? Why build a business with a permanent subscription that's the entire core to your business?
  
  fragmede a year ago
  
  because for this hypothetical niche, Claude is better than llama. now whether or not that's actually true, I don't know, but while it would be nice to sell shovels in a gold rush, not everyone has the privilege of being able to do that. In this metaphor, some people only know how to mine for gold, and pivoting to selling shoves is an entirely different skillset that the miner doesn't posess.
peer2pay a year ago

I think the idea here is to build a thin layer but BYOD.
I currently work for a company where most our value add lies in the data collection, cleaning and running of proprietary algorithms. A UI like this would be a game changer for us and something that Anthropic couldn’t easily replicate due to all the IP in our data pipeline.
billsunshine a year ago

This...so this. There is no value capture in building a shell around Claude
- Viliam1234 a year ago
  
  Perhaps there is a lot of money you could get in short term. Enough to pay the costs and generate some profit.
  Also, most people are not computer experts; if you show them something can be done using your website, they will continue to do it using your website long after others have added the same functionality.

bl4ckneon a year ago

(didn't look at the code yet but) Would a challenge of building an app like this that heavily depends on a LLM be getting a deterministic response back? I guess you could code for it to check if it gave you a certain format of data or if it was what you expected, but if I upload something that Claude doesn't understand and it gives back something that breaks the data analysis then that seems it would be tricky to handle that case.

Please correct me if I am wrong. Thanks!

SparkyMcUnicorn a year ago

Anthropic and OpenAI let you define a JSON schema to adhere to for tool calling.
Here's the part you're looking for: https://github.com/anthropics/anthropic-quickstarts/blob/mai...
- cj a year ago
  
  For some reason, the guarantee in the format of the response doesn't seem sufficient in preventing backwards incompatible changes that may happen to models.
  Yes, the response might be in a standard format. But a well formed response can still be bad/broken.
  Another way to think about it, is it can "pass QA" one day, and "fail QA" the next day even if the API response is identically formatted/structured.
  
  SparkyMcUnicorn a year ago
  
  This is why OpenAI and Anthropic provide date versioned models.
  gpt-4o can change, but gpt-4o-2024-05-13 will always use the 2024-05-13 snapshot.
  
  cj a year ago
  
  i have a feeling those dates are an illusion of sorts.
  I get the feeling they frequently deploy hot patches for edge cases. I hate to call them edge cases because they are actually “real cases” - things like adjusting system prompts so one day it might happy answer “Fill in the blank: F _ _ _ you”.
  To truly freeze a model, you would need to freeze its weights, freeze its system prompts (no one sees those), and avoid any and all action that might impact its output. Perhaps would even need the default temperature to be 0 so it’s truly a deterministic API, with the option to add in some temperature to the responses.
  Until then, I consider those “versions” but only reference the model weights and not the abstractions around the model
  
  felixvolny a year ago
  
  Tangent, but it seems like such a tough engineering challenge to keep all these models around and available at an instant

rerdavies a year ago

Kind of fun. I recently used Claude to generate scripts for Gnu Plot, with only slightly less convenience than this. It's kind of spooky what you can ask claude to do. e.g. "Rotate the x-ais labels by 90%; use "Arial Black" for the title, and "Roboto" for the rest of the fonts". Etc.

Kiro a year ago

It says it supports PDF but I don't see that in https://github.com/anthropics/anthropic-quickstarts/blob/mai...

What am I missing?

weinzierl a year ago

I wish they'd focus more on getting the basics solid. Currently Claude can't even render anything beyond the most basic form of a table.

For example, try to let it turn multiple items in a table cell into a bulleted list. It just outputs a mess of literal HTML tags.

ideashower a year ago

Can you take these resulting interactives and export them to publish?

troupo a year ago

Do they have any plans on opening up APIs to private individuals?

paxcoder a year ago

[dead]

Youssef-1992 a year ago

[flagged]

albert_e a year ago

Looks very interesting.

I am more familiar with React - am looking for a React example that achieves similar UI, any working examples I can take inspiration from?

SparkyMcUnicorn a year ago

This is React.
- albert_e a year ago
  
  My bad. I had a brainfade - registered something else on my first skim. Thanks.