More

mritchie712 · 2026-04-09T15:50:17 1775749817

> The data infrastructure underneath it took two years.

yep, that's what Definite is for: https://www.definite.app/

All the data infra (datalake + ELT/ETL + dashboards) you need in 5 minutes.

RobRivera · 2026-04-09T15:55:10 1775750110

If I order now, do I get a second set for free?

mritchie712 · 2026-04-07T21:35:11 1775597711

tldr: this caches your S3 data in EFS.

we run datalakes using DuckLake and this sounds really useful. GCP should follow suit quickly.

hiyer · 2026-04-07T23:21:29 1775604089

I was thinking of using it with Duckdb as well but seems it would be of limited benefit. Parquet objects are in MBs, so they would be streamed directly from S3. With raw parquet objects, it might help with S3 listing if you have a lot of them (shave off a couple of seconds from the query). If you are already on Ducklake, Duckdb will use that for getting the list of relevant objects anyway.

wenc · 2026-04-08T01:01:12 1775610072

Maybe the OP is thinking of reading/writing to DuckDB native format files. Those require filesystem semantics for writing. Unfortunately, even NFS or SMB are not sufficiently FS-like for DuckDB.

Parquet is static append only, so DuckDB has no problems with those living on S3.

huntaub · 2026-04-08T10:21:50 1775643710

What does DuckDB need that NFS/SMB do not provide?

anentropic · 2026-04-07T22:41:39 1775601699

I am curious about this use case

How do you see it helping with DuckLake?

arpinum · 2026-04-08T05:38:40 1775626720

Latency, predicate pushdown.

Pre-compaction the recent data can be in small files, and the delete markers will also be in small files. This will bring down fetch times, while ducklake may have many of the larger blocks in memory or disk cache already.

Reading block headers for filtering is lots of small ranges, this could speed it up by 10x.

prpl · 2026-04-10T18:15:23 1775844923

For files up to 100kB of size, this should effectively be really close to the same price as S3 when writing (didn't check reading so much, but the writes/PUT is always much more expensive than read/GET)

Would be really useful pre-compaction and to deal with small files issue without latency penalties

mritchie712 · 2026-04-03T12:40:09 1775220009

I think this is a move to get business users paying consumption (per token) pricing for codex instead of a flat rate.

vidarh · 2026-04-03T12:46:56 1775220416

How would that work, when it's the flat rate subscription they've reduced the price of?

mhsdef · 2026-04-03T12:52:25 1775220745

Tighter limits under the auspices of "lower price!"

sync · 2026-04-03T13:07:26 1775221646

Yeah, the lede is buried a bit, these new rate-cards seem to be aligning towards token-based pricing with the prior rates now labeled 'legacy'

https://help.openai.com/en/articles/20001106-codex-rate-card https://help.openai.com/en/articles/11481834-chatgpt-rate-ca...

vidarh · 2026-04-03T14:40:40 1775227240

The prior rates in question there are message based pricing, not the flat rate subscription.

mritchie712 · 2026-03-21T13:48:35 1774100915

Hannes (one of the creators) had a pet duck

mritchie712 · 2026-03-18T16:51:26 1773852686

what's the use case for cortex? is anyone here using it?

We run a lakehouse product (https://www.definite.app/) and I still don't get who the user is for cortex. Our users are either:

non-technical: wants to use the agent we have built into our web app

technical: wants to use their own agent (e.g. claude, cursor) and connect via MCP / API.

why does snowflake need it's own agentic CLI?

lunatuna · 2026-03-18T17:53:28 1773856408

When you say just Cortex it is ambiguous as there is Cortex Search, Agents, Analyst, and Code.

Cortex Code is available via web and cli. The web version is good. I've used the cli and it is fine too, though I prefer the visuals of the web version when looking at data outputs. For writing code it is similar to a Codex or Claude Code. It is data focussed I gather more so than other options and has great hooks into your snowflake tables. You could do similar actions with Snowpark and say Claude Code. I find Snowflake focus on personas are more functional than pure technical so the Cortex Code fits well with it. Though if you want to do your own thing you can use your own IDE and code agent and there you are back to having an option with the Codex Code CLI along with Codex, Cursor or Claude Code.

dboreham · 2026-03-18T17:19:42 1773854382

Because "stock price go up"?

mritchie712 · 2026-03-16T18:18:05 1773685085

claude code solved this about a month ago

mritchie712 · 2026-02-25T12:15:23 1772021723

I think you're reading it exactly right

mritchie712 · 2026-02-19T22:03:30 1771538610

It's half the price per token. Not all tokens are generated equally.

sdeiley · 2026-02-19T22:14:05 1771539245

Neither are cars but Ill take a Porsche over a Ferrari for a fraction of the price.

jmalicki · 2026-02-20T00:16:07 1771546567

What about a Porsche vs. a Toyota Camry for half the price?

ionwake · 2026-02-19T22:26:49 1771540009

which model?

sdeiley · 2026-02-19T22:29:28 1771540168

For me any, tbh. I wouldn't fit in a Ferrari lol

mritchie712 · 2026-02-18T15:13:52 1771427632

We've (https://www.definite.app/) replaced quite a few metabase accounts now and we have a built-in lakehouse using duckdb + ducklake, so I feel comfortable calling us a "duckdb-based metabase alternative".

When I see the title here, I think "BI with an embedded database", which is what we're building at Definite. A lot of people want dashboards / AI analysis without buying Snowflake, Fivetran, BI and stitching them all together.

victorbjorklund · 2026-02-18T22:13:04 1771452784

Not open source though?

mritchie712 · 2026-02-13T00:03:46 1770941026

us-east-2 has been down for over 2 hours now

consumer451 · 2026-02-13T00:36:05 1770942965

And still down. In my case, no auth and no reads.

If this had happened prior to 4PM Eastern, I would have been screwed on my main early-stage project. I guess it's time to move up the timeline on real backend with failover.