3Play makes content accessible across disability and language. Our human-QA'd captioning, audio description, subtitling, and dubbing services are used by over 10,000 media, higher ed, creator, enterprise, and government customers. We’ve worked on really cool projects: recent examples including captioning the 2024 Paris Olympics and providing quality AI dubbing for high-profile YouTube creators.
I'm leading the Growth team, and we're working to build new, beautiful UI experiences on our long-loved services. Our stack is a Remix/Rails product and Astro/Sanity marketing site, both backed with modern design system foundation (what you might expect with Typescript, Tailwind, ShadCN, Tanstack, etc). I'm looking for a principal-level engineer with strong taste in front-end: someone who cares about UX and styling, takes accessibility and page weight seriously, builds strong validation / testing harnesses, and keeps a dependency tree sane instead of letting it metastasize.
The other part of this role is contributing to our agents and workflows. We've invested heavily here with the right mix of enthusiasm, guardrails, and budget. You'll have heavy influence over how it evolves: what we hand to agents vs. keep in human hands, how work gets specified and parallelized, how we keep the results beautiful, and where the security and review gates live. There’s a ton of opportunity to work with the latest AI tooling and determine best practices as the industry matures. We’re a small team happily working over 15 years with high impact, high autonomy, and low bureaucracy. Come join us! Apply at https://www.3playmedia.com/company/jobs-post/?gh_jid=7761973.... Mention HN when you apply!
We are hiring a GPU Engineer to work on the fastest LLM inference engine on standard datacenter GPUs.
You would own low-level kernel work in CUDA/PTX or HIP/CDNA ISA, the monokernel pipeline, profiling infrastructure inside it, scaling to the frontier MoE models that run in production, and building our own agents that optimize kernels and inference autonomously.
We generate 3,000 tokens/s per request on 8x AMD MI300X and 2,100 on 8x NVIDIA H200, at batch size 1, FP16, no speculative decoding.
At batch size 1, the decode is GEMV, so it is memory bandwidth bound, and MBU is what counts.
We rewrote the whole hot path ourselves, from the assembly on the chip up to the Transformer we designed around it, with the full decode running as a single persistent GPU kernel.
Try it at https://playground.kog.ai
Showing your code is part of the process.
If you are outside a Europe-compatible timezone, relocation to one is required.
Apply: https://jobs.ashbyhq.com/kog/e3950334-a2a6-43cc-a744-df6c386...
Questions, email me at nicolas.constant@kog.ai
We are looking for a senior sofware engineer (Rust, React) to join our team. Aqora has just raised its seed round and we are growing our team to build the "Huggingface of Quantum Computing". Reach out via hiring at aqora.io
Rail Europe is a global travel tech company and the reference brand for European train booking. We're looking for experienced Ruby on Rails developers.
Stack: Ruby on Rails, Hotwire, AWS, Postgresql
Open roles here: https://www.welcometothejungle.com/en/companies-v1/rail-euro...
The Open Source, AI-native billing platform - Lago is the billing platform that gives teams full transparency, control and flexibility to manage and scale any pricing model. Trusted by PayPal, Synthesia, CoreWeave and Mistral.ai to handle their billing.
Right now our hiring is deliberate and specific. We'd especially love to hear from applicants to our open Growth Chief of Staff, Engineering Squad Lead and Account Executive positions.
All of our open roles here: https://jobs.ashbyhq.com/lago
Listed -
GTM:
- Technical Account Executive
- Forward Deployed Engineer
- Growth Chief of Staff
- Solutions Engineer (Pre & Post Sales)
Product/Engineering:
- Product Engineer
- Product Designer
- Engineering Squad Lead