Search: [llm] - Shaarli

How Stanford Teaches AI-Powered Creativity in Just 13 MinutesㅣJeremy Utley

Stanford's Jeremy Utley reveals that "most people are not fully utilizing AI's potential." Why is that? He explains that it lies in how we approach AI. He said a simple mindset shift could be what you've been missing in the AI revolution.

Creativity is doing more than the first thing you think of

Think of LLM as a teammate and not just a tool. Provide it feedback! Let it ask you questions!

Key Insights:
📌How treating AI as a teammate rather than just a tool can dramatically improve outcomes
📌Why you should have AI ask you questions instead of just answering yours
📌How non-technical professionals can leverage AI to achieve extraordinary results
📌The difference between treating AI as a tool versus as a teammate

00:00 Intro

If you want to learn more about creativity using AI with Professor Jeremy, please refer to the link below!
👉 https://www.jeremyutley.design/ai-newsletter

https://www.youtube.com/watch?v=wv779vmyPVY

September 23, 2025 at 11:01:33 AM EDT *

llm career ai

FILLER

222. Automating Processes with Software is HARD

We have decades of experience trying to automate processes. The biggest lesson is that automation is not about the easy and known flow, but about exception handling.

The best diagnosis for exception handling I can think of is to wait on line at the post office. If you’ve ever done that, you know the thought of “doesn’t anyone just want to mail a package” comes to mind. As it turns out the entire flow at the post office (or DMV or tax office) is about exception handling. No amount of software is going to get you out of there because it is piecing together a bunch of inputs and outputs that are outside the bounds of a system.

The ability to automate hinges not just on the ability to know the steps to take for predefined inputs, and not even the steps to take if some inputs are erroneous or incomplete, but what to do if you can’t even specify the inputs.

My favorite example of the latter is how the arrival of IBM computing in the 60s and 70s totally changed the definition of accounting, inventory control, and business operations. Every process that was "computerized" ultimately looked nothing at all like what was going on under those green eyeshades in accounting. Much of the early internet (and still most bank and insurance) look like HTML front ends to mainframe 3270 screens. Those might eventually change, just not quickly. It might be that the "legacy" or "installed base" of many processes is such that the cost to change is too monumental.

https://news.ycombinator.com/item?id=41765594

https://hardcoresoftware.learningbyshipping.com/p/222-automating-processes-with-software

August 19, 2025 at 10:43:07 AM EDT *

software career insurance llm ai

FILLER

Generating Content with ChatGPT - Perl Hacks

Back in January, I wrote a blog post about adding JSON-LD to your web pages to make it easier for Google to understand what they were about. The example I used was my ReadABooker site, which encourages people to read more Booker Prize shortlisted novels (and to do so by buying them using my Amazon

https://perlhacks.com/2025/06/generating-content-with-chatgpt/

June 20, 2025 at 2:49:13 PM EDT *

perl openapi chatgpt llm

FILLER

How did thinking reasoning LLM's go from a github experiment 4 months ago, to every major company offering super advanced thinking models only 4 months later, that can iterate code, internally plan code, it seems a bit fast? Was it already developed by major companies, but unreleased? : MLQuestions

It was like a revelation when chain-of-thought AI became viral news as a GitHub project that supposedly competed with SOTA's with only 2 developers and some nifty prompting...
Did all the companies just jump on the bandwagon an weave it into GPT/ Gemini / Claude in a hurry?
Did those companies already have e.g. Gemini 2.5 PRO thinking in development 4 months ago and we didn't know?

https://old.reddit.com/r/MLQuestions/comments/1kmg8ne/how_did_thinking_reasoning_llms_go_from_a_github/

June 10, 2025 at 2:49:19 PM EDT *

reddit ai llm

FILLER

A simple search engine from scratch* | Max Bernstein

*if you include word2vec.

https://jaketae.github.io/study/word2vec/#testing-the-model

https://bernsteinbear.com/blog/simple-search/

June 2, 2025 at 3:09:55 PM EDT *

llm search embeddings

FILLER

Why the Coolest Job in Tech Might Actually Be in a Bank

For tech and AI talent, jobs at financial services companies are more desirable than they have ever been. Banks have been working hard to make it happen.

https://www.wsj.com/articles/why-the-coolest-job-in-tech-might-actually-be-in-a-bank-274330c5

April 5, 2025 at 10:30:26 AM EDT *

career llm ai

FILLER

Presentation Slide Templates | Beautiful.ai

Build your next presentation in minutes with our free slide templates! No matter what you’re creating, Beautiful.ai has the template for you.

https://www.beautiful.ai/slide-templates

March 19, 2025 at 11:03:48 AM EDT *

llm presentation slideshow design

FILLER

Personal Software: The Unbundling of the Programmer?

Why LLMs will transform development but not how you think

it's about how AI tools are enabling a new category of software that simply couldn't exist before.

When someone can describe their specific needs conversationally and receive working code in response, the economics of personal software development shift dramatically.

Think of it this way: just as spreadsheets enabled non-programmers to perform complex calculations and data analysis, AI-assisted development tools are enabling non-programmers to create personal software solutions.

https://addyo.substack.com/p/personal-software-the-unbundling

March 6, 2025 at 3:53:20 PM EST *

ai llm webdesign career mobile

FILLER

Pulse AI Blog - Why LLMs Suck at OCR

LLM’s suck at complex OCR, and probably will for a while. LLMs are excellent for many text-generation or summarization tasks, but they falter at the precise, detail-oriented job of OCR—especially when dealing with complicated layouts, unusual fonts, or tables. These models get lazy, often not following prompt instructions across hundreds of pages, failing to parse information, and “thinking” too much.

LLMs process images through high-dimensional embeddings, essentially creating abstract representations that prioritize semantic understanding over precise character recognition

Consider a simple table cell containing "1,234.56". The LLM might understand this represents a number in the thousands, but lose critical information about:

Exact decimal placement
Whether commas or periods are used as separators
Font characteristics indicating special meaning
Alignment within the cell (right-aligned for numbers, etc.)

https://news.ycombinator.com/item?id=42966958

https://www.runpulse.com/blog/why-llms-suck-at-ocr

February 12, 2025 at 10:56:59 AM EST *

llm gemini pdf ocr

FILLER

Ingesting Millions of PDFs and why Gemini 2.0 Changes Everything

Markdown extraction is just the first step. For documents to be effectively used in RAG pipelines, they must be split into smaller, semantically meaningful chunks.

Recent studies have shown that using large language models (LLMs) for this task can outperform other strategies in terms of retrieval accuracy. This intuitively makes sense - LLMs excel at understanding context and identifying natural boundaries in text, making them well-suited for generating semantically meaningful chunks.

The problem? Cost. Until now, LLM-based chunking has been prohibitively expensive. With Gemini Flash 2.0, however, the game changes again - it's pricing makes it feasible to use it to chunk documents at scale.

https://news.ycombinator.com/item?id=42952605

(disclaimer I am CEO of llamaindex, which includes LlamaParse)
Nice article! We're actively benchmarking Gemini 2.0 right now and if the results are as good as implied by this article, heck we'll adapt and improve upon it. Our goal (and in fact the reason our parser works so well) is to always use and stay on top of the latest SOTA models and tech :) - we blend LLM/VLM tech with best-in-class heuristic techniques.

Some quick notes: 1. I'm glad that LlamaParse is mentioned in the article, but it's not mentioned in the performance benchmarks. I'm pretty confident that our most accurate modes are at the top of the table benchmark - our stuff is pretty good.

There's a long tail of issues beyond just tables - this includes fonts, headers/footers, ability to recognize charts/images/form fields, and as other posters said, the ability to have fine-grained bounding boxes on the source elements. We've optimized our parser to tackle all of these modes, and we need proper benchmarks for that.
DIY'ing your own pipeline to run a VLM at scale to parse docs is surprisingly challenging. You need to orchestrate a robust system that can screenshot a bunch of pages at the right resolution (which can be quite slow), tune the prompts, and make sure you're obeying rate limits + can retry on failure.

https://www.sergey.fyi/articles/gemini-flash-2

February 12, 2025 at 10:51:05 AM EST *

llm pdf google gemini ocr

FILLER

Which AI to Use Now: An Updated Opinionated Guide

Picking your general-purpose AI

Also:

https://www.oneusefulthing.org/p/doing-stuff-with-ai-opinionated-midyear

https://www.oneusefulthing.org/p/which-ai-to-use-now-an-updated-opinionated

February 3, 2025 at 3:02:24 PM EST *

ai llm

FILLER

Collaborators needed for bring full OpenAI support to Perl

Thus, that module was deprecated in favor of Nelson's OpenAPI::Client::OpenAI module. Throw the 13K+ lines OpenAPI spec for OpenAI at it and it just works. Further, the module is pretty much a single Perl class rather than a bunch of hand-crafted code.

CPAN authors know it can be hard to keep modules up-to-date (mea culpa, mea culpa!) and this module is no exception. I need this module so I offered to collaborate and created a PR to update it to version 2.0.0 of the OpenAI spec. It now passes all the tests (for those wondering, you need an OpenAI key and it costs $0.04 USD to run the test suite).

In trying to build a Whisper pipeline for that, I found that I couldn't. There was a PR for Whisper support for the older module, but for the newer one, I can't figure out how to get it to issue a request with multipart/form-data support. I've noted the issue in the PR.

https://www.reddit.com/r/perl/comments/1cwa4zd/collaborators_needed_for_bring_full_openai/?rdt=43096

May 20, 2024 at 2:34:47 PM EDT *

llm openapi perl

FILLER

Photoshop for text — Steph Ango

In the near future, transforming text over an entire document will become as commonplace as filtering images.

Up until now, text editors have been focused on input. The next evolution of text editors will make it easy to alter, summarize and lengthen text. You’ll be able to do this for entire documents, not just individual sentences or paragraphs. The filters will be instantaneous and as good as if you wrote the text yourself. You will also be able to do this with local files, on your device, without relying on remote servers.

In a “A camera for ideas”, I coined the term synthography to describe synthetic images created with generative models.

https://stephango.com/photoshop-for-text

January 13, 2024 at 8:03:16 PM EST *

blog obsidian llm chatgpt writing

FILLER

Text generator plugin for Obsidian to generate text content using GPT-3 (OpenAI) | GitHub - nhaouari/obsidian-textgenerator-plugin

Text generator is a handy plugin for Obsidian that helps you generate text content using GPT-3 (OpenAI). - GitHub - nhaouari/obsidian-textgenerator-plugin: Text generator is a handy plugin for Obsidian that helps you generate text content using GPT-3 (OpenAI).

Use Text Generator to generate ideas, attractive titles, summaries, outlines, and whole paragraphs based on your knowledge database.

https://github.com/nhaouari/obsidian-textgenerator-plugin

January 13, 2024 at 8:01:49 PM EST *

chatgpt llm obsidian notetaking

FILLER

14islands | The art of prompting: An introduction to Midjourney

A great deal of my learnings and inspiration comes from the great content from Yubin Ma at AiTuts, where you can learn more about prompting and view a myriad of examples.

https://www.14islands.com/journal/the-art-of-prompting

January 13, 2024 at 11:16:15 AM EST *

midjourney llm ai

FILLER

GitHub - abi/screenshot-to-code: Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue) - GitHub - abi/screenshot-to-code: Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

https://github.com/abi/screenshot-to-code

January 13, 2024 at 11:14:31 AM EST *

llm programming chatgpt webdesign

FILLER

Ask HN: Tutorial on LLM / already grasp neural nets | Hacker News

I've watched the 4 videos from 3blue1brown on neural nets. The web and youtube are awash with mediocre videos on Large Language Models. I'm looking for a good one.

This is part of a longer series but is maybe the single best video I know of on the topic:
https://youtu.be/kCc8FmEb1nY?si=zmBleKwlpV06O3Mw

I thought this video from Steven Wolfram was also quite good:
https://www.youtube.com/live/flXrLGPY3SU?si=SrP1EJFMPJqVCFPL

https://news.ycombinator.com/item?id=38438202

January 13, 2024 at 11:12:32 AM EST *

llm chatgpt ai

FILLER

GitHub - varunshenoy/opendream: An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨

An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨 - GitHub - varunshenoy/opendream: An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨

https://github.com/varunshenoy/opendream

September 14, 2023 at 9:51:38 AM EDT *

llm stablediffusion

FILLER

LLM: A CLI utility and Python library for interacting with Large Language Models

A CLI utility and Python library for interacting with Large Language Models, both via remote APIs and models that can be installed and run on your own machine.

https://llm.datasette.io/en/stable/

September 14, 2023 at 9:37:08 AM EDT *

llm python

FILLER

What are embeddings?

A deep-dive into machine learning embeddings.

https://vickiboykis.com/what_are_embeddings/

August 11, 2023 at 2:42:02 PM EDT *

ai llm machinelearning

FILLER