Window Functions in Spark
Window with orderBy¶
It is tricky!!!
If you provide ORDER BY clause then the default frame is RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW:
https://stackoverflow.com/questions/52273186/pyspark-spark-window-function-first-last-issue
Avoid using last and use first with
descending order byinstead. This gives less surprisings.Do NOT use order by if not necessary. It introduces unnecessary ...
Install the Latest Version of Python in Ubuntu
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
There are a few ways to install a newer version of Python on Ubuntu. If you want to completely overwrite the system version of Python, it is suggested that you install …
Tips on FastApi
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Special Numbers in Python
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Python Profiler for JupyterLab Notebooks
%time¶
Measue the execuation time of the code ONCE.
%timeit¶
Measure the execuation time of the code (accurately) by running it MULTIPLE TIMES and taking the average.
%prun | %%prun¶
-D: output the profiling results into a file
so that you can other tools (e.g., snakeviz) to visualize it.
Noticd that %prun and %%prun are based on cProfile