@arjen
Another Dutch BERT model: BERTje
github.com/wietsedv/bertje

Say hi to RobBERT, a Dutch language model based on RoBERTa with some tasks specific to Dutch.

Slides for all lectures at the #Siks #IR course:

informagus.nl/events/siks-ir-2

Here, you also find the information for the group experiment of this afternoon.

An AI Wizard of Words
by @wftl
A look at using OpenAI's Generative Pretrained Transformer 2 (GPT-2) to generate text.
linuxjournal.com/content/ai-wi
#python #AI #HowTo

@hiemstra
Mainly metadata, the title, the abstract, etc.

A database of Computer Science arXiv data collected by Matthew Kenney

now has an option for a single column interface! Go to "Preferences" -> "Appearance" and untick "Enable advanced web interface".

This will be the default interface for new users.

**Fair is Better than Sensational:
Man is to Doctor as Woman is to Doctor**

arxiv.org/pdf/1905.09866.pdf

A recent paper by some researchers at University of Groningen that shows some analogies like "a man is to computer programmer as woman is to homemaker" are not the most accurate way to show the bias that exists in the data.

[ICYMI on twitter] The Deep Learning track at #TREC2019 will feature two tasks—a DOCUMENT RANKING task and a PASSAGE RANKING task—both with large training datasets.

Approximate dataset stats: 367K training queries + 3.2M documents for the document ranking task; and 500K training queries + 8.8M passages for the passage ranking task.

Official guidelines: microsoft.github.io/TREC-2019-
Coordinators: Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos.

@Erik @Heining @hiemstra
Distribution of search frequencies for"UTwente" in the Netherlands.
It seems that UT has no presence in the west of the Netherlands :)

I did another experiment with different alternatives for each university:
1- Full names of each university in English in the whole world
2 - Full names of each university in Dutch in the Netherlands
3 - Abbreviated names of each university in the Netherlands

As Anne suggested in the Netherlands the term UTwente is used more often in search engines. However, globally "UTwente" is relatively searched less than UVA and somehow TU Delft.

I used "trendyy", an interesting R package for querying Google Trends, to obtain and visualize the search popularity of Dutch universities over the past few years.
The results were not exactly what I expected to be!🤔

This flawed decision has to be overturned.

Science is not politics.

IEEE, a major science publisher, bans Huawei scientists from reviewing papers | Science | AAAS
sciencemag.org/news/2019/05/ie

Scientists worldwide should work together.

Note that we HAVE serious doubts about so many things in CS. CISCO and Intel "bugs" could be by design, or be widely known in Secret Services, who perhaps co-fund Google, etc etc. NIST and NSA story on encryption. Research co-funded by DARPA.

Full Stack Deep Learning bootcamp 2019 lectures and other resources :
fullstackdeeplearning.com/marc

Visualization of the world of search engines:

searchenginemap.com/

We support $$\LaTeX$$ formulas: Use $$ and $$ for inline LaTeX formulas, and $ and $ for display mode.