Package org.apache.storm.hdfs.bolt
Class AbstractHdfsBolt
java.lang.Object
org.apache.storm.topology.base.BaseComponent
org.apache.storm.topology.base.BaseRichBolt
org.apache.storm.hdfs.bolt.AbstractHdfsBolt
- All Implemented Interfaces:
- Serializable,- IBolt,- IComponent,- IRichBolt
- Direct Known Subclasses:
- AvroGenericRecordBolt,- HdfsBolt,- SequenceFileBolt
- See Also:
- 
Field SummaryFieldsModifier and TypeFieldDescriptionprotected OutputCollectorprotected Stringprotected FileNameFormatprotected Integerprotected org.apache.hadoop.fs.FileSystemprotected Stringprotected org.apache.hadoop.conf.Configurationprotected Integerprotected longprotected Partitionerprotected List<RotationAction>protected FileRotationPolicyprotected Timerprotected SyncPolicyprotected Integerprotected Object
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionvoidcleanup()Called when an IBolt is going to be shutdown.voiddeclareOutputFields(OutputFieldsDeclarer outputFieldsDeclarer) Declare the output schema for all the streams of this topology.protected abstract voiddoPrepare(Map<String, Object> conf, TopologyContext topologyContext, OutputCollector collector) final voidProcess a single tuple of input.protected org.apache.hadoop.fs.PathgetBasePathForNextFile(Tuple tuple) Declare configuration specific to this component.protected abstract StringgetWriterKey(Tuple tuple) protected abstract WritermakeNewWriter(org.apache.hadoop.fs.Path path, Tuple tuple) final voidprepare(Map<String, Object> conf, TopologyContext topologyContext, OutputCollector collector) Marked as final to prevent override.protected voidrotateOutputFile(Writer writer) 
- 
Field Details- 
writers
- 
rotationCounterMap
- 
rotationActions
- 
collector
- 
fsprotected transient org.apache.hadoop.fs.FileSystem fs
- 
syncPolicy
- 
rotationPolicy
- 
fileNameFormat
- 
fsUrl
- 
configKey
- 
writeLock
- 
rotationTimer
- 
offsetprotected long offset
- 
fileRetryCount
- 
tickTupleInterval
- 
maxOpenFiles
- 
partitioner
- 
hdfsConfigprotected transient org.apache.hadoop.conf.Configuration hdfsConfig
 
- 
- 
Constructor Details- 
AbstractHdfsBoltpublic AbstractHdfsBolt()
 
- 
- 
Method Details- 
rotateOutputFile- Throws:
- IOException
 
- 
preparepublic final void prepare(Map<String, Object> conf, TopologyContext topologyContext, OutputCollector collector) Marked as final to prevent override. Subclasses should implement the doPrepare() method.- Parameters:
- conf- The Storm configuration for this bolt. This is the configuration provided to the topology merged in with cluster configuration on this machine.
- topologyContext- This object can be used to get information about this task's place within the topology, including the task id and component id of this task, input and output information, etc.
- collector- The collector is used to emit tuples from this bolt. Tuples can be emitted at any time, including the prepare and cleanup methods. The collector is thread-safe and should be saved as an instance variable of this bolt object.
 
- 
executeDescription copied from interface:IBoltProcess a single tuple of input. The Tuple object contains metadata on it about which component/stream/task it came from. The values of the Tuple can be accessed using Tuple#getValue. The IBolt does not have to process the Tuple immediately. It is perfectly fine to hang onto a tuple and process it later (for instance, to do an aggregation or join).Tuples should be emitted using the OutputCollector provided through the prepare method. It is required that all input tuples are acked or failed at some point using the OutputCollector. Otherwise, Storm will be unable to determine when tuples coming off the spouts have been completed. For the common case of acking an input tuple at the end of the execute method, see IBasicBolt which automates this. - Parameters:
- tuple- The input tuple to be processed.
 
- 
getComponentConfigurationDescription copied from interface:IComponentDeclare configuration specific to this component. Only a subset of the "topology.*" configs can be overridden. The component configuration can be further overridden when constructing the topology usingTopologyBuilder- Specified by:
- getComponentConfigurationin interface- IComponent
- Overrides:
- getComponentConfigurationin class- BaseComponent
 
- 
declareOutputFieldsDescription copied from interface:IComponentDeclare the output schema for all the streams of this topology.- Parameters:
- outputFieldsDeclarer- this is used to declare output stream ids, output fields, and whether or not each output stream is a direct stream
 
- 
cleanuppublic void cleanup()Description copied from interface:IBoltCalled when an IBolt is going to be shutdown. Storm will make a best-effort attempt to call this if the worker shutdown is orderly. TheConfig.SUPERVISOR_WORKER_SHUTDOWN_SLEEP_SECSsetting controls how long orderly shutdown is allowed to take. There is no guarantee that cleanup will be called if shutdown is not orderly, or if the shutdown exceeds the time limit.The one context where cleanup is guaranteed to be called is when a topology is killed when running Storm in local mode. - Specified by:
- cleanupin interface- IBolt
- Overrides:
- cleanupin class- BaseRichBolt
 
- 
getBasePathForNextFile
- 
doPrepareprotected abstract void doPrepare(Map<String, Object> conf, TopologyContext topologyContext, OutputCollector collector) throws IOException- Throws:
- IOException
 
- 
getWriterKey
- 
makeNewWriterprotected abstract Writer makeNewWriter(org.apache.hadoop.fs.Path path, Tuple tuple) throws IOException - Throws:
- IOException
 
 
-