News broadcastings are well-known from everyone. For example, CNN, financial times and Bloomberg, etc. If you can make your own news broadcasting, it is awesome and amazing. But is it possible? One of the obstacles is how we can collect articles and information from all over the world in real-time basis. Of course I do not have my own network of news correspondents all over the globe. Then, what should we do about that?
Last week I found the blog about “GDELT 2.0“. The GDELT Project, which monitors events driving global society, creating a free, open platform for computing in the entire world, was founded and led by Kalev H. Leetaru. The GDELT Project’s full name stands for the Global Database of Events, Language, and Tone (GDELT). Now this project is going to a new stage of “GDELT 2.0”. Compare with “GDELT 1.0”, “GDELT 2.0” has a great deal of progress as follows
1. “GDELT 2.0” can cover documents and information written in 65 languages
There is a lot of linguistic communication to be written and spoken all over the world. If we try to cover all over the Earth, we need to understand languages other than English. For example, an apple is called “Ringo” in Japanese. If computers cannot read what “Ringo”means, it is impossible to collect the information about apple in Japan because few of the articles are translated from Japanese to English. There is no need to worry about them. GDELT 2.0” can do that by using real time machine translation. This function is called “GDELT Translingual“. It means that global news that GDELT monitors in 65 languages, representing 98.4% of its daily non-English monitoring volume, is transformed in real time into English. It is amazing because the media of the non-Western world can be included in our coverage. There are no language barriers to worry about.
2. “GDELT 2.0” can be updated in near-real time basis
A blog of “GDELT 2.0″ says ” In essence, within 15 minutes of GDELT monitoring a news report breaking anywhere the world, it has translated it, processed it to identify all events, counts, quotes, people, organizations, locations, themes, emotions, relevant imagery, video, and embedded social media posts, placed it into global context, and made all of this available via a live open metadata firehouse enabling open research on the planet itself.” These data use to be updated once a day. Now it is updated within 15 minutes. I think it is critically important when we try to create our own news-broadcasting.
3. “GDELT 2.0” can exercise content analysis for each article in near-real time basis
“GDELT 2.0” can also judge whether the articles are positive or negative. The blog says “GDELT 2.0” can quantify the extraordinary array of latent emotional and thematic signals subconsciously encoded in the world’s media each day. 18 content analysis systems totaling more than 2,230 dimensions are now run on each news article seen by GDELT each day and all of these scores are available. It is called “the Global Content Analysis Measures (GCAM)”.
In short, information all over the world can be updated with real-time machine translation and content analysis. It is definitely amazing. With this database of “GDELT 2.0”, we might create our own news broadcasting! Could you try it now?
If you are interested in “GDELT 2.0”, it is a nice video for an introduction.