This week has been about statistical accuracy vs privacy protection, an open-source alternative to Notion, a parsing toolkit for Python, the International Components for Unicode, a book on Knowledge Graphs and a service to convert web pages into ePub.
A service to convert web pages into PDF, ePub, and Kindle (mobi) files.
A set of C/C++ and Java libraries providing Unicode and Globalization support for software applications.
ICU4X is a more recent approach learning from ICU4C and ICU4J to solve internationalisation (i18n).
A paper introducing an economic model attempting to address the dichotomy between statistical accuracy and privacy protection. In a gist:
The fundamental challenge posed in servicing this dual mandate is that as more statistics are published with more accuracy, more privacy is lost.
An open-source alternative to Notion with full control of your data and customizations.
A book providing a thorough introduction to knowledge graphs as well as trying to get consensus on a definition for the term:
The definition of a “knowledge graph” remains contentious […]. Herein we adopt an inclusive definition, where we view a knowledge graph as a graph of data intended to accumulate and convey knowledge of the real world, whose nodes represent entities of interest and whose edges represent relations between these entities.
A parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.