Channel: Blog | Dell

↧

Application Workloads and Object Innovation

July 26, 2013, 5:01 am

≫ Next: Introducing MitCH: A Channel Superpower

≪ Previous: It’s A Great Time for Collaboration — EMC Syncplicity a Leader in File Sync and Share Platforms

	This series of blog posts has focused on the evolution of high-tech infrastructure in response to constantly-evolving application workloads. In my last post I described how unstructured and metadata-rich application workloads drove the rise of Network-Attached Storage (NAS). The diagram below allowed me to highlight differences between block and file system architecture. Unstructured content benefits from metadata association. NAS systems provided the binding between the two. The approach used by many vendors involved the interspersal of content and metadata within a disk array infrastructure. Block-based systems of that era, on the other hand, viewed all blocks as "content", and had no fundamental awareness of application metadata. The overlay below highlights this difference. The NAS approach of tight interspersal of content and metadata became a hurdle for a new class of application workloads. To quote my EMC colleague Stephen Manley, these new applications wanted to do "even cooler" things with their metadata. For example, applications wanted to: attach increasingly larger amounts of metadata to content. create formal ontologies for metadata (e.g. XML rules for metadata structure). search through metadata at high speed. enforce policies on content via metadata keywords (e.g. retention periods). The increased importance that these new workloads placed on metadata drove the industry to treat metadata as a first-class citizen. The "interspersal" technique used by most NAS devices did not lend itself to the new workloads. As a result, the industry evolved (yet again) in response to these new applications and facilitated the rise of object-based storage systems. Object-based systems allow applications to "attach" rich metadata to content and bind them together via an object-identifier. Under the covers, object-based storage systems were not constrained to intersperse the metadata and the content. They could be stored as separate entities, which "freed" the metadata to be used in more diverse and beneficial ways. In fact, the content itself was "freed" from the linkage to a specific directory, which facilitated new levels of sharing and collaboration for content. The implementation of object-based storage systems also gave vendors the opportunity to address additional shortcomings that NAS-based systems were experiencing at the time, including file size maximums and file count limits. The first object-based implementation was termed content-addressable storage, or CAS. Wikipedia provides the definition of CAS below: a mechanism for storing information that can be retrieved based on its content, not its storage location. The diagram below highlights CAS function and operation in the context of one of the first CAS implementations (known as Centera): Instead of using the traditional file-based access methods (e.g. file open, read, write, and close), the Centera approach allowed an application to write a random stream of data, associate it with relevant metadata, and store it as a package to the Centera storage system. In return the Centera system would return a unique identifier to the application. This approach caused a fundamental shift in application architectures, which enabled: A permanent binding between file content and an unlimited amount of metadata associated with the file content. The removal of responsibility for "where" the application placed data. The application no longer had to specify a logical directory location for each file. Object counts could scale into the billions, well beyond the limit of many file system capacities at the time. The metadata contained keywords to implement policies (such as how long to retain a document and disallow deletion). A third access pillar was added to the data center as a result of new application workloads. Many customers deployed all three access methods: block, file, and object. Capacity-based, object workloads are graphically depicted in the lower-half of our workload framework. Some object-based workloads required high service levels (e.g. hospital applications) while some did not (e.g. YouTube). As a result of all three types of application access methods (block, file, and object), data and meta-data continued to grow unabated within customer data centers. This gave rise to a new problem: the growth of new forms of metadata related to the data center operation itself. I'll cover "The Rise of Metadata Part 2" in my next post. Steve http://stevetodd.typepad.com Twitter: @SteveTodd EMC Fellow
Update your feed preferences

↧

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Bureau of Internal Revenue: Regional Offices (Directory)

January 9, 2014, 11:06 pm

Four Air Leitchville Pty Ltd v Hurlad Pty Ltd (No 3) [2024] FCA 238

March 13, 2024, 12:00 am

$22.6m payout to workers fired under UNC govts

July 19, 2018, 9:30 pm

Vocational Training Instructor (Carpenter) at States of Jersey

November 1, 2021, 5:00 am

The 10 Tennessee Cities With The Largest Black Population For 2021

December 21, 2020, 10:12 am

'My best friend looked possessed, then he stabbed me', teenager tells court

February 5, 2013, 1:00 am

Karimnagar District Tahsildars Phone Numbers-Mobile Numbers Telangana-State

February 18, 2016, 8:01 pm

JACOB FORREST OGDEN Arrested by Clackamas County Sheriff's Office on Dec 30,...

December 30, 2019, 12:00 am

(get) Tej Dosa Letter 81 - How To Make An Extra $200-$500/Week (In 2025)

July 1, 2025, 2:15 pm

Black Angus Grilled Artichokes

July 16, 2016, 4:37 pm

Windows Update / Microsoft Update の接続先 URL について

February 27, 2017, 12:32 am

SAHARA FLASH LIVE IN WERAGOLLA 2018-04-20

November 26, 2021, 5:21 am

Adolescence A Stage of Growth and Change Class 7 Extra Questions and Answers...

June 2, 2025, 2:24 am

HP P2000 Storage Error Controller A Unknown Issue Resolution Request

December 22, 2024, 5:53 pm

Moondru Mudichu 04-10-2017 – Polimer tv Serial

October 4, 2017, 9:14 am

FortiLink mode supported over a layer-3 network

July 22, 2019, 12:55 pm

ZARIA CUMMINGS

February 20, 2017, 7:44 pm

Serial child killer David Threinen’s reign of terror

June 15, 2015, 6:04 pm

Philly Mobster Ronnie Turchi Took Last Ride In October ’99, Turned Up Trunk...

October 20, 2019, 3:40 am

© 2025 //www.rssing.com