We provide the complete methodology on how the project was conducted: how data was collected, manipulated, interpreted, and finally visualized. The project was developed as an experimental exploration on how to deal with data that describe company acquisitions, and how much tacit information we can derive from its analysis and visualization.
What are we looking for?: Diving into a gray area
GAFAM stands for the five largest, most dominant, and most prestigious companies in the information technology industry of the United States as of 2022: Alphabet (Google), Amazon, Apple, Meta (Facebook) and Microsoft.
GAFAM operates in a gray area in which it is difficult to state whether they are competing or not, or whether there are monopoly issues without us realizing it. From 1975, Microsoft has acquired more than 357 companies. This project looks at the relationships between the companies acquired by GAFAM, shifting from the already seen money-approximation toward looking closely at each company definition. There is no complete and transparent information available on how much money the GAFAM companies invest and spend. We cannot know with certainty the motivations for why they buy other companies and start-ups. But we can observe what they buy and how each company describes its business and area of interest. In the attempt to grab a picture of their interests, hopes, and beliefs, our approach recognizes and accepts that the information we have is incomplete: we are diving into a gray area.
Data collection
The project uses Crunchbase as a source: a platform for finding business information on private and public companies. The information on profiles for individuals, organizations, or institutions are sourced in four ways: the venture program, machine learning, an in-house data team, and the Crunchbase community. Members of the public can submit information to the Crunchbase database. These submissions are subject to registration, social validation, and are often reviewed by a moderator before being accepted for publication.
The following image shows some of the information describing each company that we use in the project.
The information was collected for the 1,210 companies that have been acquired by GAFAM over the years, ranging from Microsoft's acquisition of Forethought (which became PowerPoint) in 1987 to Apple's purchase of an aerospace transportation company that develops electric aircrafts, Joby Aviation, in the first days of April 2022. Companies listed in the dataset were assigned tags describing their business and work. For example, Joby Aviation is tagged "Aerospace”, “Air Transportation”, “Electric Vehicle”, and “Transportation". Also, relevant data was collected for tracing back the acquisition pipeline that the GAFAM companies have made over the years. For example, Tavolo is a company that was acquired in 2000 by OurHouse, which was later acquired by Amazon the following year.
Company | Parent company | Type | Date of acquisition | Operating status | Location | Description |
---|---|---|---|---|---|---|
Forethought | Microsoft | acquisition | 30/07/1987 | Active | Sunnyvale, California, United States | Forethought, Inc. is a computer software company that developed PowerPoint, a slide-based presentation program. |
OurHouse | Amazon | acquisition | 01/12/2001 | Active | Decatur, Georgia, United States | OurHouse is an online retailer that provides its clients with hardware products and advice from its official online platform. |
Tavolo | OurHouse | acquisition | 12/12/2000 | Active | Minneapolis, Minnesota, United States | Tavolo is an online retailer and destination for consumers who cook, dine and entertain. |
Joby Aviation | Apple | acquisition | 01/04/2022 | Active | Santa Cruz, California, United States | Joby Aviation is an aerospace transportation company developing electric aircrafts. |
In this project we do not use geographic locations of companies nor monetary values related to the purchase and sale of the companies nor to their profit or investment since our focus is on the description of what is purchased.
Data manipulation and visualization
In order to see the relationships that exist among the companies acquired by the GAFAM companies, we first try to see the general landscape of where these companies are located according to their businesses. To do so, we used the tags that define them, attempting to group the companies according similarities in how they are defined. However, given the lack of uniformity of the different 390 tags that define all the companies, it was necessary to organize them into categories that would allow us to group them together.
To do so, we built a hierarchical schema grouping tags into categories, from the finest to the broadest.
We've embedded content from Observable here. As Observable may collect personal data and track your viewing behaviour, we'll only load this content after you consent to their use of cookies and similar technologies as described in their privacy policy.
For example, Handipoints is "a website with a chore chart component and a virtual world containing a range of online games and activities for children" acquired by Slide in 2009 and then by Google in 2010. Its tags are Casual games, Gaming and Virtual games, resulting in Gaming as a broad tag from our hierarchical schema.
Landscape of acquisitions
Once we organize the tags, we elaborate a map that positions each company according to the similarity of the combinations of tags. We use a model called UMAP, which results in cluster coordinates that allow us to see the landscape of these companies.
The next step was to read the companies' clusters and understand their business and interests. We observed that in many cases, the tags describing the companies were not sufficient for understanding the company business. For example, Neven Vision was acquired by Google in 2006. It is described with the tag Software as a “developer of mobile recognition engines for mobile phone and consumer electronics industries, and more”. After further research, we found that Neven Vision was a biometric and photo recognition company. It has patents on technology ranging from photo analysis to face recognition in video files to several patents for facial capture for avatar animation. Sounds like a fascinating partnership. The company is heavily focused on mobile phones and also offers a product to deliver coupons to mobile devices. Thus, the tag "software" is changed to "biometrics".
After researching each company, we noticed that it is difficult to classify them based on the technologies they develop, since in many cases there is no information that accounts for them, and on the other hand, many of these technologies are used in different sectors. For example, biometric technology such as speech recognition can be used in sectors as diverse as health (Dictaphone) or security. In this way, this project classifies each company with a technology and a sector in which it is possible to recognize them. This project proposes an overview of 15 sectors and 10 types of technologies. Color is used to encode the sector and technology throughout the project:
- Advertising is marketing communication that uses an overtly sponsored, non-personal message to promote or sell a product, service, or idea. i.e: AdMob 'is a mobile advertising network that helps app developers monetize and promote their mobile and tablet apps with ads. It provides solutions for discovery, branding, and monetization on the mobile Web. Its mobile advertising and monetization solutions include mobile sites; Mobile Analytics solution, which helps mobile site owners to gain insight into their traffic and site usage to make data-driven decisions about their mobile business; and Mobile Metrics Report, which offers insights into trends in the mobile ecosystem.'
- Companies in this sector offer products (software, platforms, apps, services, etc) for end users to create, edit and manipulate different materials, generally audio and visual, ranging from photographs, diagrams, video, music and web design. I.e: drawElements 'is a company of computer graphics experts. Our products and services help businesses to test and improve the quality of computer graphics in their products.'
- 'The activity of electronically buying or selling of products on online services or over the Internet'. (Wikipedia) I.e: AbeBooks 'is an online marketplace for books, listing over 110 million new, used, rare, and out of print books from 13,500 sellers. AbeBooks offers a range of services including support, HomeBase inventory management software, and five international sites to reach global markets. The company often fills a niche for Amazon in finding hard-to-find or out-of-print books. AbeBooks is an Amazon subsidiary.'
- Companies in this sector sale products for facilitating learning and learning management mainly through online applications. I.e: PowerSchool 'is a provider of cloud-based software for K-12 education in North America. Its mission is to power the education ecosystem with unified technology that helps educators and students realize their full potential, in their way. PowerSchool connects students, teachers, administrators, and parents, with the shared goal of improving student outcomes. [...]'
- The energy sector embrace industries involved in the production and sale of energy. In this dataset we observed companies such as Makani Power that works on wind energy to produce energy, and MyEnergy which offers a 'personalized guide to energy savings', both acquired by Google.
- Companies in this sector relates to entertainment by means of content and ways of delivering it. By content we mean marketplaces, distribution that specifically sell music, films, podcasts, comics, animation among others. I.e: imoji 'is an app that turns selfies and other pictures into stickers for texting. The app then offers you a set of simple tools to zoom, crop, and finally cut out your image with a pair of virtual scissors so it appears like a real emoji or sticker. imoji has been acquired by Giphy in 2017.' We consider streaming products that deliver multimedia content by to be consumed in a continuous manner from a source, with little or no intermediate storage in network elements. I.e: MOG is 'a next-generation music media company, offers on-demand streaming music services through multi-platform access to a library of songs.'
- There is only one company within this sector: Wild Oats Marketplace which 'is a producer of Natural and organic foods distributed through partnerships in the United States.' It was acquired in 2007 by Whole Foods Market ('a supermarket chain owned [...] that exclusively sells natural and organic products') and later by Amazon in 2017.
- The video game industry is the industry involved in the development, marketing, and monetization of video games (Wikipedia). Online gaming describes any video game that offers online interactions with other players. I.e: Activision Blizzard 'is the world’s most successful interactive entertainment company. The California-based business specializes in producing cutting-edge, best-selling games including Call of Duty, Skylanders, Overwatch, Candy Crush and more.'
- The healthcare industry (also called the medical industry or health economy) is an aggregation and integration of sectors within the economic system that provides goods and services to treat patients with curative, preventive, rehabilitative, and palliative care. It includes the generation and commercialization of goods and services lending themselves to maintaining and re-establishing health (Wikipedia). In this sector it is common to find technologies associated with artificial intelligence (such as speech recognition for voice assistants) as well as software for data management. I.e: Health Navigator 'sets the standard for digital health clinical content quality, across multiple channels and the continuum of care, improving the patient and provider experience, from the presenting complaint to the final diagnosis and treatment'.
- In this sector we combine two Information Technology (IT) and Information technology security. Diverse technologies are related to this sector: Artificial intelligence, Big Data & Analytics, Biometrics, Cloud Computing, Hardware, Location-based, technologies, Office Software, Search Engine, Software in general. Information technology 'is the use of computers to create, process, store, retrieve, and exchange all kinds of electronic data and information' (Wikipedia). I.e: Elastifile 'is redefining the way data is stored and managed in the cloud and spanning hybrid clouds.' Information technology security is the overarching term used to describe the collective strategies, methods, solutions and tools used to protect the confidentiality, integrity and availability of the organization’s data and digital assets. It seeks to prevent, detect and remediate a variety of cyber threats and cyberattacks. It will include protection for all hardware systems, software applications and endpoints, as well as the network itself and its various components, such as physical or cloud-based data centers. (from Crowdstrike.com) I.e: Alacris 'provides advanced identification and access security software and certificate management products. Alacris looks to help clients manage trusted digital identities across a wide variety of security technologies, platforms, architectures and devices. Alacris was acquired by Microsoft in September, 2005.'
- Two companies are define in this sector: Google for Nonprofits, which 'offers organizations access to highly discounted or free Google services' and the Google Digital News Initiative which 'is a partnership between Google and news publishers in Europe to support high quality journalism through technology and innovation.'
- Robotics 'involves design, construction, operation, and use of robots. The goal of robotics is to design machines that can help, assist' (Wikipedia) or supplant humans. We find robots for various uses such as autonomous mobility, pallet management in distribution warehouses and in the agricultural field such as Strio.AI 'a company that focuses on artificial intelligence and agriculture through robotics.'
- Social media are interactive digital channels that facilitate the creation and sharing of information, ideas, interests, and other forms of expression through virtual communities and networks (Wikipedia). We also include to this sector all companies that offer messaging, texting, blogging. Here you will find chatbox as well. I.e: Hot Potato 'connects friends and fans around live events. We're building tools for real-time social collaboration amongst groups who share an interest in any event or experience as it's happening. Whether checking in, recording your perspective, or just enjoying others contributions, it's more fun to participate in events with Hot Potato'.
- Consumer electronics or home electronics are electronic (analog or digital) equipment intended for everyday use, typically in private homes (Wikipedia). I.e: 'Nokia - Devices & Services Business develops and manufactures mobile phones.'
GAFAM Petri dishes
After viewing all the companies in the same landscape, the "petri dishes" model aggregates them according to each GAFAM company that purchased them. We represent the interests of each GAFAM company by grouping the companies by sector and using color to describe the technology. The petri dishes show the accumulation of all companies that were purchased by a GAFAM company over time. Companies that were acquired and then sold are not included in the petri dish, and companies acquired through the acquisition of another company are included only when the latter was acquired.
We've embedded content from Observable here. As Observable may collect personal data and track your viewing behaviour, we'll only load this content after you consent to their use of cookies and similar technologies as described in their privacy policy.
The visualization uses a force-based layout that clusters single nodes based on a categorical scale (in this case, the sector each company belongs to), which are then colored by their technological affinity. To better identify groups, all companies belonging to the same sector are encircled using a Convex Hull.
Timelines of expansion
The timeline visualizes the chain of company purchases in relation to each GAFAM company to scrutinize the relationships of interests over time. Acquisitions and sales can be a clue to get closer to understanding the motivations and interests of GAFAM companies. For example, Powerschool is a company founded in 1997 that started buying other start-ups and companies in relation to its interest in the education sector. In 2001 Apple buys Powerschool and continues to buy other companies in the sector. Finally, Apple sells Powerschool to Onex in 2022. This chain of acquisitions and sales is not linear and deserves to get a closer look as to when they happened and which sector and technologies are involved.
The visual model combines a hierarchical structure, usually visualized using dendrograms, or tree-like structures, and a time-based visualization, by placing nodes in the year of their acquisition by GAFAM. The result is a time-based hierarchical network structure that showcases the intricacies of the relationships between GAFAM and acquired companies.
The visual model combines a hierarchical structure, usually visualized using dendrograms, or tree-like structures, and a time-based visualization, by placing nodes in the year of their acquisition by a GAFAM company. The result is a time-based hierarchical network structure that showcases the intricacies of the relationships between GAFAM and acquired companies. Companies that were sold by one GAFAM company to another, however, are visualized but only as sold companies.
We've embedded content from Observable here. As Observable may collect personal data and track your viewing behaviour, we'll only load this content after you consent to their use of cookies and similar technologies as described in their privacy policy.
A company's complete dendrogram shows all acquisitions from its initial founding. Red connections show companies that have been acquired and closed after some time, while green ones show companies that were later sold. Dashed connections show how an acquisition included previous acquisitions by that company. To build this hierarchical structure, we filter the complete dataset by the ultimate acquisition, and stratify it using d3.js to obtain a data structure that can be used to build a dendrogram.
We've embedded content from Observable here. As Observable may collect personal data and track your viewing behaviour, we'll only load this content after you consent to their use of cookies and similar technologies as described in their privacy policy.
To conclude, the methodology used in this project acknowledges the partiality of the data and the analysis performed. This is due to the overall amount of data collected, the source of data and the timeframe in which it was first compiled. Indeed, many acquired companies, especially acquisitions that happened many years ago, are missing information from the companies themselves - on the other hand, many small companies that were recently acquired don't have a lot of information available due to their small size. With this in mind, we provide the entire dataset used to produce the visualizations and writings in this website.
We've embedded content from Google here. As Google may collect personal data and track your viewing behaviour, we'll only load this content after you consent to their use of cookies and similar technologies as described in their privacy policy.