Images are universal. Images are language independent. The university professor in Chicago, the bank employee in Beijing, the hairdresser in Rome, all recognize the jubilation of the American sailor kissing a woman on Times Square and the horror of the naked Vietnamese girl running for her life. No need for explanation nor translation. Understanding an article in the Beijing Daily or the Corriere della Sera, however, might prove insuperable for the US American professor – except if she masters Chinese and Italian.

Unlike a picture, a text that is not written in a language we are familiar with remains an incomprehensible set of signs; a mystery, until we find a translator – either a person or a tool – to help us out.

Undoubtedly, the universal character of pictures is one of the criteria why AI research has chosen to address the field of image recognition first. The leverage effect is just tremendous: once your algorithm recognizes baby faces, you can sell it to any mom in the world – provided she has an internet connection. Meanwhile, most systems achieve very decent results. Most AI researchers have lost their awe towards Natural Language Processing, thinking that, well, if their neural networks had revolutionized machine vision, they might as well lead to a breakthrough in text understanding.

But despite huge research efforts, the deep learning models, which perform well in labelling your Facebook gallery, still struggle with understanding the meaning of text. The reality is: they are not built to cope with the infinite richness of semantics.

They are like monstrous icebergs hiding the secrets of their genesis below sea level – millions of carefully annotated data, thousands of hours of parameter tuning, pages and pages of statistics.

When these systems recognize similar texts, it’s not intelligence. It’s not even magic. It’s just pure luck, the best proof of this being that they cannot reproduce their good scores with different frameworks or different use cases. They need to rehearse the whole procedure of selecting and annotating huge data sets and going through a tedious process of trials & errors, moving a code line here, adding a filter there. Interestingly, the deep learning gurus themselves begin to outline the limits of their racehorse and to talk more and more about what their technology cannot do.

So, understanding text is a hassle. But what if texts could be converted in images? In unique images where each pixel would bear a different meaning and pixels with similar meanings would be close to each other? Could the meaning of text become universal too?

Let’s have a look at these text images produced with Cortical.io’s Retina API (I used the sandbox API):

Jaguar versus Porsche

Jaguar versus Porsche: the left image shows the semantic fingerprint of the term “jaguar”, the right image the semantic fingerprint of the term “Porsche”, the image in the middle displays the overlap of the left and right images. Your eyes immediately spot a large cluster of dots in the bottom right corner of each image. This cluster represents the main context the two terms share: cars. It is not only obvious for your brain, it is also obvious for a computer system because each pixel in the image corresponds to a pair of vector coordinates that are easily computable.

Jaguar the animal versus South American wildlife

Jaguar, the animal, versus South American wildlife: now look at the left image representing a description of jaguar, the animal (from Wikipedia), and a short text about South American wildlife wildlife (from the BBC). No need to explain that the two texts are strongly related, even though the term “jaguar” does not appear in the BBC text. Imagine the implications for a news filter, for example: you describe your interests in a short text that is converted into a semantic fingerprint by the system. Each piece of news from your RSS or Twitter feed is converted into a semantic fingerprint too and compared with the image describing your interests. The systems forwards you only the pieces of news that is most similar to your interests, independently of any keyword used and in any language:

Jaguar the animal in English

Jaguar the animal in French

Jaguar the animal in German

Travel offer to a natural reserve in Brazil in French

Jaguar, the animal, in different languages: probably the most striking aspect of the semantic fingerprints is that they demonstrate the stability of semantic spaces across languages. Look at the images for “jaguar the animal” from English, French and German Wikipedia. They all show the same cluster in the top right corner, the cluster associated with the wild life meaning. The last image represents the semantic fingerprint of a text in French describing a travel offer to a natural reserve in Brazil where you can spot jaguars. The cluster is still there.

These images are no magic. They can be analyzed down to single pixels to understand any nuance of semantics. They can be reproduced with different texts, in different contexts, with different languages.

These images show that, yes, one day, text might become universal too.

Want to try it out? Compare the similarity of English texts with our Similarity Explorer demo.

Cookie	Duration	Description
__cf_bm	1 hour	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__cfruid	session	Cloudflare sets this cookie to identify trusted web traffic.
__hssc	1 hour	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
_GRECAPTCHA	6 months	Google Recaptcha service sets this cookie to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	New Relic uses this cookie to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wordpress_test_cookie	session	WordPress sets this cookie to determine whether cookies are enabled on the users' browsers.

Cookie	Duration	Description
_lscache_vary	2 days	Litespeed sets this cookie to provide the prevention of cached pages.
li_gc	6 months	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
__hstc	6 months	Hubspot set this main cookie for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gcl_au	3 months	Google Tag Manager sets the cookie to experiment advertisement efficiency of websites using their services.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
hubspotutk	6 months	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.

Cookie	Duration	Description
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
IDE	1 year 24 days	Google DoubleClick IDE cookies store information about how the user uses the website to present them with relevant ads according to the user profile.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.

Cookie	Duration	Description
_cfuvid	session	The _cfuvid cookie is only used to allow the Cloudflare WAF to distinguish individual users who share the same IP address. Visitors who do not provide the cookie are likely to be grouped together and may not be able to access the site if there are many other visitors from the same IP address.
_gat_form_6	1 minute	This cookie is set by Google Universal Analytics and is used to throttle the request rate - limiting the collection of data on high traffic sites.
cf_clearance	1 year	Cloudfare clearance Cookie stores the proof of challenge passed. It is used to no longer issue a challenge if present. It is required to reach an origin server.
et_bloom_optin_optin_3_39_imp	1 year	Determines if the users already dismissed a specific popup.
et_bloom_optin_optin_7_2115_imp	1 year	Determines if the users already dismissed a specific popup.
etBloomCookie_optin_3	5 days	Determines if the users already dismissed a specific popup.
etBloomCookie_optin_7	5 days	Determines if the users already dismissed a specific popup.

The power of images

Recent Posts

Stay informed!

Subscribe to our newsletter to keep track of what happens at Cortical.io.

You have Successfully Subscribed!