Ben Chuanlong Du's Blog

It is never too late to learn.

Unified SQL Syntax

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Trino

Trino is a distributed SQL query engine for big data. It is formerly known as PrestoSQL.

ZetaSQL

ZetaSQL is a customized SQL dialect, along with parser and analyzer, that Google …

SQL Database Client-server Protocols

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

  1. Apache Arrow Flight is the future protocol for querying Databases! It use columnar data and leverages Apache Arrow to avoid unnecessary copy of data, which makes it able to query large …

Cloud Object Storage

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Amazon S3

Google Cloud Storage

Use Tauri to Build a Desktop Application

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Tauri is a framework for building tiny, blazing fast binaries for all major desktop platforms. Developers can integrate any front-end framework that compiles to HTML, JS and CSS for building their …

Use TableSample in SQL

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

The limit clause (or the method DataFrame.limit if you are using Spark) is a better alternative if randomness is not critical.

PostgreSQL

SELECT id from table TABLESAMPLE BERNOULLI(10) WHERE …