Astro/Solid - Hacker News

$sherlockxu 10 hours ago

Hi everyone. I'm one of the maintainers of this project. We're both excited and humbled to see it on Hacker News!

We created this handbook to make LLM inference concepts more accessible, especially for developers building real-world LLM applications. The goal is to pull together scattered knowledge into something clear, practical, and easy to build on.

We’re continuing to improve it, so feedback is very welcome!

GitHub repo: https://github.com/bentoml/llm-inference-in-production

[-]

$armcat 8 hours ago

Amazing work on this, beautifully put together and very useful!

$subset 6 hours ago

Ooh this looks really neat! I'd love to see more content in the future on Structured outputs/Guided generation and sampling. Another great reference on inference-time algorithms for sampling is here: https://rentry.co/samplers

[-]

$larme 2 hours ago

Wow that's really thorough

$aligundogdu 9 hours ago

It's a really beautiful project, and I’d like to ask something purely out of curiosity and with the best intentions. What’s the name of the design trend you used for your website? I really loved the website too.

[-]

$Jimmc414 15 minutes ago

it appears to be using Infima, which is Docusaurus's default CSS framework plus a standard system font stack

[0] font-family: -apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, Helvetica, Arial, sans-serif;

$qrios 5 hours ago

Thanks for putting this together! From now on I only need one link to point interested ones to learn.

Only one suggestion: On page "OpenAI-compatible API" it would be great to have also a simple example for the pure REST call instead of the need to import the OpenAI package.

$srameshc 3 hours ago

If I remember, BentoML was about MLOps, I remember trying it about a year back. Did the company pivot ?

[-]

$fsjayess 3 hours ago

There is a big pie in the market around LLM serving. It make sense for a serving framework to extend into the space

$holografix 6 hours ago

Very good reference thanks for collating this!

$Domainzsite 4 hours ago

[dead]

LLM Inference Handbook