Publications

Statistics, LLMs, data science, AI, and anything related.

No Translation Needed Forecasting Quality from Fertility and Metadata
The Token Tax Systematic Bias in Multilingual Tokenization
Linear Regression Explained An explanation of linear regression concepts.
Image Generation via Conditional Variational Autoencoders How to use CVAE to generate images.
Image Generation via Diffusion How to use diffusion to generate images.
The Cyber Nightmare: Personal Information on the World-Wide (Spider) Web How to keep your personal information safe.