HTRUNK

Designer

Designer offers an intuitive collection of data processing,ingestion and egress components that mask the complexity of Hadoop data application development

What Designer Does

It enables application developers to build and test data processing applications using a pre built set of components and to import or export data

from and to Hadoop. The Designer provides the following features for a quick and easy application development.

Data processing Components

The data processing components help users to implement complex transformations.

No matter where and how the data is stored on Hadoop the components can use the data from any Hadoop Eco-system sources like HIVE, HDFS, HBASE, PARQUET etc with a simple drag and drop approach. This makes it ideal for Extract-transform-load (ETL) data pipelines or Iterative data processing.

The designer provides components to convert the unstructured data sources like word documents, PDFs, ODFs, RTFs, log files into a structure and use the structured data for further processing.

Data transfer Components

Moves subset or all of the data from external systems and EDWs into Hadoop to optimize cost-effectiveness of combined data storage and processing.

Managing

The designer provides an quick and easy way to create, implement business logic and define the data flow. It provides an easy way to manage private and shared application development repositories for a power full conflict less application development. Designer includes version controlling system to efficiently track project history over time and collaborate easily with a co-located team or a community of developers scattered.

Job Execution

The designer enables quick one click job execution on a smaller data set as part of a development or application testing, this enables team to test and validate before the code is deployed on a fully distributed environment. The quick execution runs the job locally simulating the distributed execution.

Metadata

Hadoop’s “schema on read” architecture make Hadoop cluster a perfect storage of heterogeneous data for both structured and unstructured which add a lot of complexity. Designer presents a relational view of data and can be used across the hTRUNK™ components.