HdfsDataSegmentPusher (io.druid:druid 0.12.0 API)

java.lang.Object
- io.druid.storage.hdfs.HdfsDataSegmentPusher

All Implemented Interfaces:: DataSegmentPusher

public class HdfsDataSegmentPusher
extends Object
implements DataSegmentPusher

Field Summary
- Fields inherited from interface io.druid.segment.loading.DataSegmentPusher
  JOINER

Constructor Summary

Constructors
Constructor and Description
`HdfsDataSegmentPusher(HdfsDataSegmentPusherConfig config, org.apache.hadoop.conf.Configuration hadoopConfig, com.fasterxml.jackson.databind.ObjectMapper jsonMapper)`

Method Summary

All Methods Instance Methods Concrete Methods Deprecated Methods
Modifier and Type	Method and Description
`String`	`getPathForHadoop()`
`String`	`getPathForHadoop(String dataSource)` Deprecated.
`String`	`getStorageDir(DataSegment segment)` Due to https://issues.apache.org/jira/browse/HDFS-13 ":" are not allowed in path names.
`String`	`makeIndexPathName(DataSegment dataSegment, String indexName)`
`Map<String,Object>`	`makeLoadSpec(URI finalIndexZipFilePath)`
`DataSegment`	`push(File inDir, DataSegment segment, boolean replaceExisting)` Pushes index files and segment descriptor to deep storage.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Methods inherited from interface io.druid.segment.loading.DataSegmentPusher
getAllowedPropertyPrefixesForHadoop, getDefaultStorageDir

- Constructor Detail
  - HdfsDataSegmentPusher
```
@Inject
public HdfsDataSegmentPusher(HdfsDataSegmentPusherConfig config,
                                     org.apache.hadoop.conf.Configuration hadoopConfig,
                                     com.fasterxml.jackson.databind.ObjectMapper jsonMapper)
                              throws IOException
```
    Throws:
    
    IOException
- Method Detail
  - getPathForHadoop
```
@Deprecated
public String getPathForHadoop(String dataSource)
```
    Deprecated.
    
    Specified by:
    
    getPathForHadoop in interface DataSegmentPusher
  - getPathForHadoop
```
public String getPathForHadoop()
```
    Specified by:
    
    getPathForHadoop in interface DataSegmentPusher
  - push
```
public DataSegment push(File inDir,
                        DataSegment segment,
                        boolean replaceExisting)
                 throws IOException
```
    Description copied from interface: DataSegmentPusher
    
    Pushes index files and segment descriptor to deep storage.
    
    Specified by:
    
    push in interface DataSegmentPusher
    
    Parameters:
    
    inDir - directory containing index files
    
    segment - segment descriptor
    
    replaceExisting - overwrites existing objects if true, else leaves existing objects unchanged on conflict. The behavior of the indexer determines whether this should be true or false. For example, since Tranquility does not guarantee that replica tasks will generate indexes with the same data, the first segment pushed should be favored since otherwise multiple historicals may load segments with the same identifier but different contents which is a bad situation. On the other hand, indexers that maintain exactly-once semantics by storing checkpoint data can lose or repeat data if it fails to write a segment because it already exists and overwriting is not permitted. This situation can occur if a task fails after pushing to deep storage but before writing to the metadata storage, see: https://github.com/druid-io/druid/issues/5161. If replaceExisting is true, existing objects MUST be overwritten, since failure to do so will break exactly-once semantics. If replaceExisting is false, existing objects SHOULD be prioritized but it is acceptable if they are overwritten (deep storages may be eventually consistent or otherwise unable to support transactional writes).
    
    Returns:
    
    segment descriptor
    
    Throws:
    
    IOException
  - makeLoadSpec
```
public Map<String,Object> makeLoadSpec(URI finalIndexZipFilePath)
```
    Specified by:
    
    makeLoadSpec in interface DataSegmentPusher
  - getStorageDir
```
public String getStorageDir(DataSegment segment)
```
    Due to https://issues.apache.org/jira/browse/HDFS-13 ":" are not allowed in path names. So we format paths differently for HDFS.
    
    Specified by:
    
    getStorageDir in interface DataSegmentPusher
  - makeIndexPathName
```
public String makeIndexPathName(DataSegment dataSegment,
                                String indexName)
```
    Specified by:
    
    makeIndexPathName in interface DataSegmentPusher

Class HdfsDataSegmentPusher

Field Summary

Fields inherited from interface io.druid.segment.loading.DataSegmentPusher

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Methods inherited from interface io.druid.segment.loading.DataSegmentPusher

Constructor Detail

HdfsDataSegmentPusher

Method Detail

getPathForHadoop

getPathForHadoop

push

makeLoadSpec

getStorageDir

makeIndexPathName