Issue 2021-W46

Published on

Subscribe to new issues of the bulletin via the RSS feed or via email.

This week has been about statistical accuracy vs privacy protection, an open-source alternative to Notion, a parsing toolkit for Python, the International Components for Unicode, a book on Knowledge Graphs and a service to convert web pages into ePub.

# txtpaper

A service to convert web pages into PDF, ePub, and Kindle (mobi) files.

# ICU: International Components for Unicode

A set of C/C++ and Java libraries providing Unicode and Globalization support for software applications.

ICU4X is a more recent approach learning from ICU4C and ICU4J to solve internationalisation (i18n).

# An Economic Analysis of Privacy Protection and Statistical Accuracy as Social Choices (pdf)

A paper introducing an economic model attempting to address the dichotomy between statistical accuracy and privacy protection. In a gist:

The fundamental challenge posed in servicing this dual mandate is that as more statistics are published with more accuracy, more privacy is lost.

# AppFlowy

An open-source alternative to Notion with full control of your data and customizations.

# Knowledge Graphs

A book providing a thorough introduction to knowledge graphs as well as trying to get consensus on a definition for the term:

The definition of a “knowledge graph” remains contentious […]. Herein we adopt an inclusive definition, where we view a knowledge graph as a graph of data intended to accumulate and convey knowledge of the real world, whose nodes represent entities of interest and whose edges represent relations between these entities.

# Lark

A parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.