• Shaarli
  • Tag cloud
  • Picture wall
  • Daily
  • RSS
  • Login
4261 shaares
7 / 214
Filters

Fuzzy Name Matching in Postgres | Crunchy Data Blog

QRCode

The page "Falsehoods Programmers Believe About Names" covers some of the ways names are hard to deal with in programming. This post will ignore most of those complexities, and deal with the problem of matching up loose user input to a database of names.

https://www.crunchydata.com/blog/fuzzy-name-matching-in-postgresql
February 13, 2025 at 2:18:33 PM EST *
search postgresql sql
FILLER

Interactions by Nitish Khagwal

QRCode

Handcrafted interactions focused on utility & beauty.

https://khagwal.com/interactions/
February 13, 2025 at 8:05:30 AM EST *
webdesign inspiration ux
FILLER

Pulse AI Blog - Why LLMs Suck at OCR

QRCode

LLM’s suck at complex OCR, and probably will for a while. LLMs are excellent for many text-generation or summarization tasks, but they falter at the precise, detail-oriented job of OCR—especially when dealing with complicated layouts, unusual fonts, or tables. These models get lazy, often not following prompt instructions across hundreds of pages, failing to parse information, and “thinking” too much.

LLMs process images through high-dimensional embeddings, essentially creating abstract representations that prioritize semantic understanding over precise character recognition

Consider a simple table cell containing "1,234.56". The LLM might understand this represents a number in the thousands, but lose critical information about:

Exact decimal placement
Whether commas or periods are used as separators
Font characteristics indicating special meaning
Alignment within the cell (right-aligned for numbers, etc.)

https://news.ycombinator.com/item?id=42966958

https://www.runpulse.com/blog/why-llms-suck-at-ocr
February 12, 2025 at 10:56:59 AM EST *
llm gemini pdf ocr
FILLER

Ingesting Millions of PDFs and why Gemini 2.0 Changes Everything

QRCode

Markdown extraction is just the first step. For documents to be effectively used in RAG pipelines, they must be split into smaller, semantically meaningful chunks.

Recent studies have shown that using large language models (LLMs) for this task can outperform other strategies in terms of retrieval accuracy. This intuitively makes sense - LLMs excel at understanding context and identifying natural boundaries in text, making them well-suited for generating semantically meaningful chunks.

The problem? Cost. Until now, LLM-based chunking has been prohibitively expensive. With Gemini Flash 2.0, however, the game changes again - it's pricing makes it feasible to use it to chunk documents at scale.

https://news.ycombinator.com/item?id=42952605

(disclaimer I am CEO of llamaindex, which includes LlamaParse)
Nice article! We're actively benchmarking Gemini 2.0 right now and if the results are as good as implied by this article, heck we'll adapt and improve upon it. Our goal (and in fact the reason our parser works so well) is to always use and stay on top of the latest SOTA models and tech :) - we blend LLM/VLM tech with best-in-class heuristic techniques.

Some quick notes: 1. I'm glad that LlamaParse is mentioned in the article, but it's not mentioned in the performance benchmarks. I'm pretty confident that our most accurate modes are at the top of the table benchmark - our stuff is pretty good.

  1. There's a long tail of issues beyond just tables - this includes fonts, headers/footers, ability to recognize charts/images/form fields, and as other posters said, the ability to have fine-grained bounding boxes on the source elements. We've optimized our parser to tackle all of these modes, and we need proper benchmarks for that.

  2. DIY'ing your own pipeline to run a VLM at scale to parse docs is surprisingly challenging. You need to orchestrate a robust system that can screenshot a bunch of pages at the right resolution (which can be quite slow), tune the prompts, and make sure you're obeying rate limits + can retry on failure.

https://www.sergey.fyi/articles/gemini-flash-2
February 12, 2025 at 10:51:05 AM EST *
llm pdf google gemini ocr
FILLER

Cabbage with eggs is better than pizza! Easy, quick and very delicious recipe!

QRCode

Cabbage with eggs is better than pizza! Easy, quick and very delicious recipe! Delicious, easy, quick and very delicious breakfast, lunch, dinner or quick snack recipe. You won't believe how tasty it is – even more delicious than pizza, or okay definitely healthier 😅 It's so easy to make, and you'll enjoy every step of the cooking process. Try this easy and very delicious dish and let me know in a comments "How do you like it?". 🤩🥰😋

INGREDIENTS:
1/2 small cabbage (250g / 9 oz)
2 eggs
Season with salt
1 tbsp Olive oil
1 tbsp pizza sauce
mozzarella
basil

https://www.youtube.com/watch?v=7BBKC67cXL0
February 11, 2025 at 10:22:18 AM EST *
youtube recipe cabbage
FILLER

Cabbage with eggs tastes better than meat! Easy, quick and very delicious dinner recipe!

QRCode

Cabbage with eggs tastes better than meat! Easy, quick and very delicious dinner recipe. Delicious and quick recipe for cabbage with onions! You won't believe how tasty it is – even more delicious than meat! I can't believe I didn't know about this recipe before, but now it's become one of my favorites. It's a simple recipe that only requires cabbage, onions, carrot, eggs and spices. If you have these ingredients at home, you can easily whip up this delectable cabbage pie in a pan. The taste of the cabbage in this pie is absolutely incredible, and your whole family will devour it. Whether you're looking for a quick lunch, dinner, or snack, this recipe is perfect. It's so easy to make, and you'll enjoy every step of the cooking process. So go ahead, grab some cabbage and onions, and let's cook this delightful dish together! 🤩🥳🥰😋

INGREDIENTS:
2 tbsp Olive oil
1 medium Onion
1 Carrot
2 lb / 900g white cabbage
5-6 Eggs
1 tbsp All purpose Flour (optional)
Manchego cheese (optional)
Fresh parsley (optional)

Season with Salt and black pepper
1/2 tsp Oregano
1/2 tsp Sweet Paprika
1/2 tsp Garlic powder
1/2 tsp Onion powder
1/2 tsp Dry Parsley

https://www.youtube.com/watch?v=AShFeKnEOGw
February 11, 2025 at 10:20:52 AM EST *
cabbage recipe youtube
FILLER

Clojure by Example

QRCode
https://kimh.github.io/clojure-by-example/#recur
February 11, 2025 at 10:00:20 AM EST *
clojure
FILLER

Learn Eleventy

QRCode

The best way to learn Eleventy is to build something with it.

This course was originally authored by Andy Bell and hosted at learneleventyfromscratch.com. This fork is now maintained by uncenter.

https://learn-eleventy.pages.dev/
February 11, 2025 at 9:12:09 AM EST *
eleventy blog
FILLER

Timer Replacement on Bosch SHU3032UC Dishwasher - Appliance Repair Forum - Free Service from Appliance Parts Pros

QRCode

I am looking for help replacing the timer on my 9 year old dishwasher. The timer stops when the timer knob is pointing down (almost 6 o'clock) in the wash cycle. If the knob is advanced...

https://forum.appliancepartspros.com/posts/t124088-timer-replacement-on-bosch-shu3032uc-dishwasher
February 10, 2025 at 6:44:09 PM EST *
dishwasher
FILLER

Problems solved by OpenType | Roel Nieskens | CSS Day 2024

QRCode

About Roel: https://pixelambacht.nl

Depending on whether the information is technical or not, you might want different font features.

29:20 - "Never grab a Monotype font"

font-variant-numeric: tabular-nums;

@font-face {
size-adjust: 110%;
}

https://wakamaifondue.com/
(what can my font do?)

https://pixelambacht.nl/

https://www.youtube.com/watch?v=TreBK-EyACQ
February 7, 2025 at 9:59:24 AM EST *
youtube css webdesign typography fonts
FILLER

Pattern (Java SE 9 & JDK 9 )

QRCode

Java Regular Expressions

https://docs.oracle.com/javase/9/docs/api/java/util/regex/Pattern.html
February 6, 2025 at 9:07:10 AM EST *
java regex clojure
FILLER

IronCalc - Open-Source Spreadsheet Engine

QRCode

IronCalc is an open-source spreadsheet engine and ecosystem. Try it now and help us build the future of spreadsheets!

https://www.ironcalc.com/
February 5, 2025 at 11:58:10 AM EST *
excel
FILLER

Welcome to Eleventy Notes

QRCode

A simple, lightweight, and flexible note-taking template for Eleventy.

Welcome to Eleventy Notes, an opinionated template for publishing your personal notes, documentation, or even your blog! Built with Eleventy, the powerful static site generator, Eleventy Notes is packed with features you'll love.

https://eleventy-notes.sandroroth.com/
February 4, 2025 at 9:04:35 AM EST *
eleventy blog wikipedia
FILLER

Shopmade screw blocks - FineWoodworking

QRCode

When I made these blocks to attach the solid-wood top to a dining table (“English-Inspired Trestle Table,” FWW #310)) I was aiming for the blocks’ shape to be in sympathy with the design of the corbels that act to stiffen the connection of the foot and the brace to the trestle’s post. It’s an example of echoing the overall design even in the smallest of ways.

** Could be used for poker racks??

https://www.finewoodworking.com/2024/04/03/shop-made-screw-blocks
February 3, 2025 at 3:31:46 PM EST *
poker tables woodworking
FILLER

Colette Bresilla's 'Se Refléter Dans' is a lively, lovely and layered survey of the artist's work - Cambridge Day

QRCode

Se Refléter Dans (“Reflected”) is a lively and lovely survey of 30 years of work by artist Colette Bresilla pairing multimedia sculptures with paintings large and small.

https://www.cambridgeday.com/2024/06/23/colette-bresillas-se-refleter-dans-is-a-lively-lovely-and-layered-survey-of-the-artists-work/
February 3, 2025 at 3:08:27 PM EST *
art inspiration
FILLER

Which AI to Use Now: An Updated Opinionated Guide

QRCode

Picking your general-purpose AI

Also:

https://www.oneusefulthing.org/p/doing-stuff-with-ai-opinionated-midyear

https://www.oneusefulthing.org/p/which-ai-to-use-now-an-updated-opinionated
February 3, 2025 at 3:02:24 PM EST *
ai llm
FILLER

Paul Sellers | How to make a Shooting Board

QRCode

In this video Paul Sellers shows how to make a simple shooting board from pine. This is an important piece of equipment for woodworking in the hand tool shop as it allows you to make sure the end of a board is either exactly square or exactly forty-five degrees. It also allows you to plane the end of a board straight across without 'break-out' on the far side.

At 1:47 the middle measurement should be 12 1/2" or 320mm. Sorry for the confusion.

https://www.youtube.com/watch?v=-Ypbvcxb-8M
February 3, 2025 at 2:59:13 PM EST *
woodworking planes
FILLER

MR11 vs MR16 - difference between MR11 and MR16 bulbs

QRCode

The main difference between MR11 and MR16 are as follows:

  1. Size The MR-16 bulbs diameter is 2 inch (50mm)
    The MR-11 bulbs diameter is 1.375 inch (35mm)

  2. Base the most common base for MR16 bulbs is GU5.3 or GX5.3 base - 2 pin base with 5.3mm between pins the most common base for MR11 bulbs is GU4 - 2 pin base with 4mm between base

https://www.bulbamerica.com/pages/mr11-vs-mr16-difference-between-mr11-and-mr16-bulbs
February 3, 2025 at 2:46:16 PM EST *
lighting bulbs
FILLER

Loomings – Christopher Volpe

QRCode
https://www.christophervolpe.com/loomings/
February 3, 2025 at 2:44:59 PM EST *
drawing art inspiration
FILLER

For Retirees in Their 60s, the Move That Adds Years to a Nest Egg

QRCode

Roth IRA Conversions

Taxable income tends to increase for many retirees at 73, when those born between 1950 and 1959 are required to take withdrawls from traditional retirement accounts.

https://www.wsj.com/personal-finance/taxes/for-retirees-in-their-60s-the-move-that-adds-years-to-a-nest-egg-199e467e
February 3, 2025 at 2:43:40 PM EST *
retirement finance
FILLER
7 / 214
Shaarli · The personal, minimalist, super fast, database-free, bookmarking service by the Shaarli community · Documentation
Fold Fold all Expand Expand all Are you sure you want to delete this link? Are you sure you want to delete this tag? The personal, minimalist, super fast, database-free, bookmarking service by the Shaarli community