And the best part is that it is extremely easy to use and maintain. Despite the large quantities of data, and some of it being unstructured, MaxCompute is especially well-suited for real-time analysis. The most recent version of MaxCompute supports SQL 2.0, and interestingly allows for querying of unstructured data like images and video content. The uploaded data is stored in an SQL database, and can easily be scaled up to petabytes in size. You can use Alibaba Cloud’s own tools like the MaxCompute client, or DataWorks, or even popular external tools like Flume, Logstash, or Fluentd. MaxCompute makes data migration simple with a variety of options. It is the first database service to scale to 100 TB of data at 7,000 BigBench Query-per-minute (BBQpm). On a daily basis, 14,000 developers at Alibaba run three million jobs on it, and it stores 99% of all of Alibaba’s data. It is battle-tested internally at Alibaba, where it has run a cluster of 10,000 server nodes. Coming from the house of Alibaba, it was built out of necessity, when Alibaba needed a way to manage their ever-growing data that Oracle servers could no longer handle. Rather than having databases that are restricted to hardware servers, a data warehouse is made of multiple servers that work together as a single unit.Īlibaba Cloud’s MaxCompute is a large-scale data analysis platform that is purpose-built for running big data workloads. The first thing to know about a data warehouse is that it is architected differently from small-scale database infrastructure. At that point, you can’t function at the same speed and agility you used to, and you need a data warehouse. A data warehouse makes sense only once you’ve scaled to a few hundred GB of data. If you’re looking to analyze small quantities of data that are a couple of GB in size, a data warehouse is too complex for your needs. This post explains how to use data warehouses in the cloud, and compares popular options on major public cloud platforms.Ī data warehouse is a centralized data store that’s used by multiple applications within your organization. Thanks to cloud-based data lakes, what would have been impossible a few years ago is now made possible by the plummeting costs of data storage disks, and more powerful compute instances. What would be cost-prohibitive with traditional servers is now much more accessible with the economic and powerful solutions offered by cloud computing vendors.Ĭase in point: Data warehouse solutions hosted completely in the cloud. Enterprises deal with large quantities of data, typically at petabyte scale, and they look to glean maximum value from all this data.Ĭloud computing has been a game changer in this respect. How your organization stores, organizes, analyzes, and uses the data within its confines will largely determine how successful it is. ![]() Data is the currency of the digital world.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |