Hands on the Python Library pdfplumber

Jul 15, 2021

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Stirling-PDF is is a robust, locally hosted web-based PDF manipulation tool using Docker.

Directly Initialize a Hashmap in Java

Dec 10, 2022

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

The following code snippet in Java 9+ initialize an immutable HashMap with up to 10 elements.

Extract Identifier Names from a SQL Query Using Sqlfluff

Apr 18, 2021

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Get Size of Tables on HDFS

Nov 10, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

The HDFS Way¶

You can use the hdfs dfs -du /path/to/table command or hdfs dfs -count -q -v -h /path/to/table to get the size of an HDFS path (or table). However, this only works if the cluster supports HDFS. If a Spark cluster exposes only JDBC/ODBC APIs, this method does not work.

Read/Write Files/Tables in Spark

Sep 27, 2020

References¶

DataFrameReader APIs

DataFrameWriter APIs

https://spark.apache.org/docs/latest/sql-programming-guide.html#data-sources

Comments¶

It is suggested that you specify a schema when reading text files. If a schema is not specified when reading text files, it is good practice to check the types of columns (as the types are inferred).
Do NOT read data from and write data to the same path in Spark! Due to lazy evaluation of Spark, the path will likely be cleared before it is read into Spark, which will throw IO exceptions. And the worst part is that your data on HDFS is removed but recoverable.

Convert Math Formula and Table To LaTeX

Nov 13, 2012

R

xtable{xtable}
- Good for converting table to LaTeX code.
latex{Hmisc}
- Convert R objects (not just tables) to LaTeX code.

Excel

MATLAB

Mathematica

Type in the formula in Mathematica.
Selected the formula.
Right click on selection, and then select "Copy as" -> "LaTeX".
You can also convert formulas to other …

Ben Chuanlong Du's Blog

It is never too late to learn.

Hands on the Python Library pdfplumber

Directly Initialize a Hashmap in Java

Extract Identifier Names from a SQL Query Using Sqlfluff

Get Size of Tables on HDFS

The HDFS Way¶

Read/Write Files/Tables in Spark

References¶

Comments¶

Convert Math Formula and Table To LaTeX

R

Excel

MATLAB

Mathematica