• Shaarli
  • Tag cloud
  • Picture wall
  • Daily
  • RSS
  • Login
4251 shaares
1 / 2
Filters
21 results tagged llm

Generating Content with ChatGPT - Perl Hacks

QRCode

Back in January, I wrote a blog post about adding JSON-LD to your web pages to make it easier for Google to understand what they were about. The example I used was my ReadABooker site, which encourages people to read more Booker Prize shortlisted novels (and to do so by buying them using my Amazon

https://perlhacks.com/2025/06/generating-content-with-chatgpt/
June 20, 2025 at 2:49:13 PM EDT *
perl openapi chatgpt llm
FILLER

How did *thinking* reasoning LLM's go from a github experiment 4 months ago, to every major company offering super advanced thinking models only 4 months later, that can iterate code, internally plan code, it seems a bit fast? Was it already developed by major companies, but unreleased? : MLQuestions

QRCode

It was like a revelation when chain-of-thought AI became viral news as a GitHub project that supposedly competed with SOTA's with only 2 developers and some nifty prompting...
Did all the companies just jump on the bandwagon an weave it into GPT/ Gemini / Claude in a hurry?
Did those companies already have e.g. Gemini 2.5 PRO thinking in development 4 months ago and we didn't know?

https://old.reddit.com/r/MLQuestions/comments/1kmg8ne/how_did_thinking_reasoning_llms_go_from_a_github/
June 10, 2025 at 2:49:19 PM EDT *
reddit ai llm
FILLER

A simple search engine from scratch* | Max Bernstein

QRCode

*if you include word2vec.

https://jaketae.github.io/study/word2vec/#testing-the-model

https://bernsteinbear.com/blog/simple-search/
June 2, 2025 at 3:09:55 PM EDT *
llm search embeddings
FILLER

Why the Coolest Job in Tech Might Actually Be in a Bank

QRCode

For tech and AI talent, jobs at financial services companies are more desirable than they have ever been. Banks have been working hard to make it happen.

https://www.wsj.com/articles/why-the-coolest-job-in-tech-might-actually-be-in-a-bank-274330c5
April 5, 2025 at 10:30:26 AM EDT *
career llm ai
FILLER

Presentation Slide Templates | Beautiful.ai

QRCode

Build your next presentation in minutes with our free slide templates! No matter what you’re creating, Beautiful.ai has the template for you.

https://www.beautiful.ai/slide-templates
March 19, 2025 at 11:03:48 AM EDT *
llm presentation slideshow design
FILLER

Personal Software: The Unbundling of the Programmer?

QRCode

Why LLMs will transform development but not how you think

it's about how AI tools are enabling a new category of software that simply couldn't exist before.

When someone can describe their specific needs conversationally and receive working code in response, the economics of personal software development shift dramatically.

Think of it this way: just as spreadsheets enabled non-programmers to perform complex calculations and data analysis, AI-assisted development tools are enabling non-programmers to create personal software solutions.

https://addyo.substack.com/p/personal-software-the-unbundling
March 6, 2025 at 3:53:20 PM EST *
ai llm webdesign career mobile
FILLER

Pulse AI Blog - Why LLMs Suck at OCR

QRCode

LLM’s suck at complex OCR, and probably will for a while. LLMs are excellent for many text-generation or summarization tasks, but they falter at the precise, detail-oriented job of OCR—especially when dealing with complicated layouts, unusual fonts, or tables. These models get lazy, often not following prompt instructions across hundreds of pages, failing to parse information, and “thinking” too much.

LLMs process images through high-dimensional embeddings, essentially creating abstract representations that prioritize semantic understanding over precise character recognition

Consider a simple table cell containing "1,234.56". The LLM might understand this represents a number in the thousands, but lose critical information about:

Exact decimal placement
Whether commas or periods are used as separators
Font characteristics indicating special meaning
Alignment within the cell (right-aligned for numbers, etc.)

https://news.ycombinator.com/item?id=42966958

https://www.runpulse.com/blog/why-llms-suck-at-ocr
February 12, 2025 at 10:56:59 AM EST *
llm gemini pdf ocr
FILLER

Ingesting Millions of PDFs and why Gemini 2.0 Changes Everything

QRCode

Markdown extraction is just the first step. For documents to be effectively used in RAG pipelines, they must be split into smaller, semantically meaningful chunks.

Recent studies have shown that using large language models (LLMs) for this task can outperform other strategies in terms of retrieval accuracy. This intuitively makes sense - LLMs excel at understanding context and identifying natural boundaries in text, making them well-suited for generating semantically meaningful chunks.

The problem? Cost. Until now, LLM-based chunking has been prohibitively expensive. With Gemini Flash 2.0, however, the game changes again - it's pricing makes it feasible to use it to chunk documents at scale.

https://news.ycombinator.com/item?id=42952605

(disclaimer I am CEO of llamaindex, which includes LlamaParse)
Nice article! We're actively benchmarking Gemini 2.0 right now and if the results are as good as implied by this article, heck we'll adapt and improve upon it. Our goal (and in fact the reason our parser works so well) is to always use and stay on top of the latest SOTA models and tech :) - we blend LLM/VLM tech with best-in-class heuristic techniques.

Some quick notes: 1. I'm glad that LlamaParse is mentioned in the article, but it's not mentioned in the performance benchmarks. I'm pretty confident that our most accurate modes are at the top of the table benchmark - our stuff is pretty good.

  1. There's a long tail of issues beyond just tables - this includes fonts, headers/footers, ability to recognize charts/images/form fields, and as other posters said, the ability to have fine-grained bounding boxes on the source elements. We've optimized our parser to tackle all of these modes, and we need proper benchmarks for that.

  2. DIY'ing your own pipeline to run a VLM at scale to parse docs is surprisingly challenging. You need to orchestrate a robust system that can screenshot a bunch of pages at the right resolution (which can be quite slow), tune the prompts, and make sure you're obeying rate limits + can retry on failure.

https://www.sergey.fyi/articles/gemini-flash-2
February 12, 2025 at 10:51:05 AM EST *
llm pdf google gemini ocr
FILLER

Which AI to Use Now: An Updated Opinionated Guide

QRCode

Picking your general-purpose AI

Also:

https://www.oneusefulthing.org/p/doing-stuff-with-ai-opinionated-midyear

https://www.oneusefulthing.org/p/which-ai-to-use-now-an-updated-opinionated
February 3, 2025 at 3:02:24 PM EST *
ai llm
FILLER

Collaborators needed for bring full OpenAI support to Perl

QRCode

Thus, that module was deprecated in favor of Nelson's OpenAPI::Client::OpenAI module. Throw the 13K+ lines OpenAPI spec for OpenAI at it and it just works. Further, the module is pretty much a single Perl class rather than a bunch of hand-crafted code.

CPAN authors know it can be hard to keep modules up-to-date (mea culpa, mea culpa!) and this module is no exception. I need this module so I offered to collaborate and created a PR to update it to version 2.0.0 of the OpenAI spec. It now passes all the tests (for those wondering, you need an OpenAI key and it costs $0.04 USD to run the test suite).

In trying to build a Whisper pipeline for that, I found that I couldn't. There was a PR for Whisper support for the older module, but for the newer one, I can't figure out how to get it to issue a request with multipart/form-data support. I've noted the issue in the PR.

https://www.reddit.com/r/perl/comments/1cwa4zd/collaborators_needed_for_bring_full_openai/?rdt=43096
May 20, 2024 at 2:34:47 PM EDT *
llm openapi perl
FILLER

Photoshop for text — Steph Ango

QRCode

In the near future, transforming text over an entire document will become as commonplace as filtering images.

Up until now, text editors have been focused on input. The next evolution of text editors will make it easy to alter, summarize and lengthen text. You’ll be able to do this for entire documents, not just individual sentences or paragraphs. The filters will be instantaneous and as good as if you wrote the text yourself. You will also be able to do this with local files, on your device, without relying on remote servers.

In a “A camera for ideas”, I coined the term synthography to describe synthetic images created with generative models.

https://stephango.com/photoshop-for-text
January 13, 2024 at 8:03:16 PM EST *
blog obsidian llm chatgpt writing
FILLER

Text generator plugin for Obsidian to generate text content using GPT-3 (OpenAI) | GitHub - nhaouari/obsidian-textgenerator-plugin

QRCode

Text generator is a handy plugin for Obsidian that helps you generate text content using GPT-3 (OpenAI). - GitHub - nhaouari/obsidian-textgenerator-plugin: Text generator is a handy plugin for Obsidian that helps you generate text content using GPT-3 (OpenAI).

Use Text Generator to generate ideas, attractive titles, summaries, outlines, and whole paragraphs based on your knowledge database.

https://github.com/nhaouari/obsidian-textgenerator-plugin
January 13, 2024 at 8:01:49 PM EST *
chatgpt llm obsidian notetaking
FILLER

14islands | The art of prompting: An introduction to Midjourney

QRCode

A great deal of my learnings and inspiration comes from the great content from Yubin Ma at AiTuts, where you can learn more about prompting and view a myriad of examples.

https://www.14islands.com/journal/the-art-of-prompting
January 13, 2024 at 11:16:15 AM EST *
midjourney llm ai
FILLER

GitHub - abi/screenshot-to-code: Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

QRCode

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue) - GitHub - abi/screenshot-to-code: Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

https://github.com/abi/screenshot-to-code
January 13, 2024 at 11:14:31 AM EST *
llm programming chatgpt webdesign
FILLER

Ask HN: Tutorial on LLM / already grasp neural nets | Hacker News

QRCode

I've watched the 4 videos from 3blue1brown on neural nets. The web and youtube are awash with mediocre videos on Large Language Models. I'm looking for a good one.

This is part of a longer series but is maybe the single best video I know of on the topic:
https://youtu.be/kCc8FmEb1nY?si=zmBleKwlpV06O3Mw

I thought this video from Steven Wolfram was also quite good:
https://www.youtube.com/live/flXrLGPY3SU?si=SrP1EJFMPJqVCFPL

https://news.ycombinator.com/item?id=38438202
January 13, 2024 at 11:12:32 AM EST *
llm chatgpt ai
FILLER

GitHub - varunshenoy/opendream: An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨

QRCode

An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨 - GitHub - varunshenoy/opendream: An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨

https://github.com/varunshenoy/opendream
September 14, 2023 at 9:51:38 AM EDT *
llm stablediffusion
FILLER

LLM: A CLI utility and Python library for interacting with Large Language Models

QRCode

A CLI utility and Python library for interacting with Large Language Models, both via remote APIs and models that can be installed and run on your own machine.

https://llm.datasette.io/en/stable/
September 14, 2023 at 9:37:08 AM EDT *
llm python
FILLER

What are embeddings?

QRCode

A deep-dive into machine learning embeddings.

https://vickiboykis.com/what_are_embeddings/
August 11, 2023 at 2:42:02 PM EDT *
ai llm machinelearning
FILLER

How to Use AI to Do Stuff: An Opinionated Guide

QRCode

Covering the state of play as of Summer, 2023

https://www.oneusefulthing.org/p/how-to-use-ai-to-do-stuff-an-opinionated
August 11, 2023 at 2:24:24 PM EDT *
ai llm
FILLER

Patterns for Building LLM-based Systems & Products

QRCode

Evals, RAG, fine-tuning, caching, guardrails, defensive UX, and collecting user feedback.

https://eugeneyan.com/writing/llm-patterns/
August 3, 2023 at 8:12:19 AM EDT *
llm chatgpt
FILLER
1 / 2
Shaarli · The personal, minimalist, super fast, database-free, bookmarking service by the Shaarli community · Documentation
Fold Fold all Expand Expand all Are you sure you want to delete this link? Are you sure you want to delete this tag? The personal, minimalist, super fast, database-free, bookmarking service by the Shaarli community