Data Mover Management

The centralized dashboard enables users to monitor task progress with real-time updates on data status. Users can define unlimited tasks simultaneously to move, archive, and protect data across various storage tiers.

Task management features provide control and visibility.

To start, first create a task

and choose a workflow

  1. Active Archive

  2. Offline Archive

  3. Data Exchange

  4. Maintenance

Each workflow has its own settings:

Task Filtering

Each workflow task allows for the configuration of various filtering options. There are two available modes for filter configuration:

  • Basic

  • Advanced

Basic Mode

This mode includes the capability to define expressions directly in the Console based on the following types:

Type of Filter
Operator
Description

Creation Date

Older Than Earlier Than Before After

Filter the selection on when the file has been created.

It allows to select only files that have been created before or after a specific date.

File Name

Matches Doesn't Match

It allows to filter the selection based on the name of the file.

Examples : '.' for any type of character '*' from 0 to infinite

Regex expressions are also supported.

File extension

Equal Doesn't Match

It allows to filter your selection based on the file extension.

File extension name has to be defined. Example: pdf

File Path

Matches Doesn't Match

It allows to filter your selection based on the folder(s) path.

Regex expressions are also supported.

Last access date

Older Than Earlier Than Before After

Same but with a trigger on when the file has been accessed for the last time.

Last modification date

Older Than Earlier Than Before After

Filter the selection on when the file has been modified for the last time.

It allows to select only files that have been modified before or after a specific date.

Size

More than Less than

Filter the file selection based on the files size

Advanced Mode

This mode allows the usage of an online scripting editor to define advances filter which cannot be defined in using the simple graphical mode.

The scripting Language is "LUA" , which a known language , with online documentation available.

The scripting language supports :

  • Logical Operator such as 'and' , 'or'

  • Defined Functions

Example of a LUA script

Standard filtering methods are available :

Filtering by the size of the file

Example of LUA code

Result
The workflow will only process files with a size which is bigger than 234000KB

file

Match the process file

">"

Comparison symbol

234000

File size

Filtering by the name of the file

Example of LUA code

Results

The workflow will only process files which are located in folder with a name which contains "archive"

file

Match the process file

"\\b(\\w*archive\\w*)\\b"

Regex expression

Filtering by the creation date

Example of LUA code

Result

The workflow will only process files with a creation date older or earlier than

file

Match the process file

"C"

Relative to the Creation Date

">"

">", "<" are available for (older than, earlier than)

20

Number of Year, Month, Day, Hour, Minute, Second

"D"

"Y", "M", "D", "H", "M", "S" are available for (Year, Month, Day, Hour, Minute, Second)

Filtering by the modification date

Example of LUA code

Result

The workflow will only process files with an access date older or earlier than

file

Match the process file

"C"

Relative to the Last Access Date

">"

">", "<" are available for (older than, earlier than)

20

Number of Year, Month, Day, Hour, Minute, Second

"D"

"Y", "M", "D", "H", "M", "S" are available for (Year, Month, Day, Hour, Minute, Second)

Some addition information which can be helpful for using the feature :

Data Protection / Offline Archive

Definition

Data Protection or Offline Archiving task secures selected data on a dedicated storage tier. Data accessibility is managed by the administrator behind the workflow. It allows only the data to be read or retrieved online.

Depending on the medium chosen, data can be physically moved off-site. This requires human intervention to retrieve the data online.

Users have the option of running an offline workflow that will copy or move the data to a secondary storage pool.

Why to use this workflow?

  1. Protect stored content that is online (online means "modifiable" by users or applications). Workflow allows these data to be copied according to defined criteria.

  2. Archive data on a disconnected media, with the possibility of get it online - offsite.

Traceability

A complete traceability is kept.

User/administrator have an overview of their data mobility even if the data has been secure offline.

In case of tape library, data can even be move out of the library (off site). All contents are stored in open format, under common standards.

Available Criteria

Source
Destination
Type of action Mover

Primary Storage :

  • Nas

  • Object Storage

Secondary Storage :

  • Tape Pools

  • Cloud Pools

  • Nas Pools

  1. Copy

  2. Move

  3. Erase

Container

Secondary Storage :

  • Tape Pools

  • Cloud Pools

  • Nas Pools

  1. Copy

  2. Move

  3. Erase

Restore Source

Destination

  1. Move

Tape Pools

  • Container (cache)

  • Primary Storage (NAS or Object Storage)

  1. Copy

  2. Move

Cloud Pools

  • Container (cache)

  • Primary Storage (NAS or Object Storage)

  • Cloud Pool

  1. Copy

  2. Move

Nas Pools

  • Container (cache)

  • Primary Storage (NAS or Object Storage)

  • Nas Pool

  1. Copy

  2. Move

Task

  • Container (cache)

  • Primary Storage (NAS or Object Storage)

  1. Copy

  2. Move

From Reference File

  • Container (cache)

  • Primary Storage (NAS or Object Storage)

  1. Copy

  2. Move

Specific Option

Option
Description

Remove also selected folders

This option lets you delete the original folder structure after a Move operation.

  • It is only available in an Offline Archive – Move workflow from primary storage to tape storage.

  • By default, folders are kept in place, but their contents are moved (leaving them empty).

  • If you enable “Remove also selected folders”, both the files and their original folders will be removed from primary storage.

Data Archiving / Active Archive

circle-info

Before to use this workflow, make sure to configure the container and cache. Learn more to Container configuration section

Definition

To archive a collection of data that can be moved from an expensive primary storage tier to a secondary storage tier. The data may or may not be removed from primary storage and must remain accessible from time to time. Data archiving allows data to be stored that does not require intensive reading and writing, but can be modified or read from time to time.

Why to use this workflow?

Data is stored on less expensive storage media, such as object storage, tape storage or even low-cost disks. An active archive includes functionality to seamlessly move data between different storage tiers (disk - tape - cloud) so that applications or users can access it directly.

circle-info

This will provide movement of files from where files are stored to the destination.

  • If files are in the Cache, the files are directly copied into the destination.

  • If files are in another “Active Archive” Secondary Storage, the files are recovered and copied into the cache and then copied into the final destination.

For a file directory structure, there is no change on the file presentation. Each file stay in the same folder. Nodeum uses a Virtual File System and an internal mechanism stubbing. This means that the files always pass through the caching disk.

Data can be directly copied into the Container (SMB/NFS/S3) and then defined workflow will execute task movement linked to this container.

circle-check

Task Monitoring

To aid in monitoring task execution, the Console offers a comprehensive monitoring feature. This feature provides a global overview of all task activities.

Three different sections are available :

Upcoming task

This section shows the task(s) scheduled. For each task, you can see detailed information including the date - time when the task will be executed.

It is possible to skip an execution in clicking in the right cross icon.

If a task hasn't been executed properly, then you will an information saying that "task execution missed for one or multiple occurence".

Running task

This section provides the list of the task(s) which are currently running.

For each of these tasks, you can

  1. See the detail and the status.

  2. Pause the task execution.

  3. Resume a paused task .

  4. Stop the task execution

History

In this section you can find all the tasks that have been completed.

List of tasks history

Task Report

For each task running or finished, you can open to see the global view about the task.

To see the full informations click on the arrow beside the task:

List of Nodeum's tasks

And have a look on the task information:

Screenshot about a task execution

Definitions of task informations :

Informations
Definition

Timing

  1. Task starts the x date and at x hours

  2. Elapsed time for the task

  3. Estimated remaining time to process the task

Type

  1. Offline Archive

  2. Active Archive

  3. Data Exchange

Move or Copy to a destination:

Network Shared Folder named Container

Nas

Tape

Processed size

size of data processed

Processed items

Number of items of data processed

To see the good process click on the tab processed items*

Logs

Details of task's log

circle-check

Task Log

Below is a table listing messages found in the logs, mostly arranged in the

Message
Alternative value
Definition

Task is starting

n/a

The task starts.

Parallel Processing mode is Enabled

Parallel Processing mode is Disabled

In Enabled mode, all processes run concurrently, unlike in Disabled mode where processes run sequentially. Disabled mode is useful

Initialization Process is starting

n/a

The process in charged of the inventory is started.

License is valid

License is not valid ('description of the error'), task cannot be executed

It checks the validity of the Nodeum licence is valid.

Task Job Limits : 100 files/job - 2 parallel job(s) - 200 parallel DBFile/job - 1000000000 B/job

# parallel job(s) # parallel DBFile/job # B/job

It defines the number of files to be processed per batch. This example means that the task will included 2 jobs; each of them will contains maximum 200 files or a size of maximum 1GB. When one of these values is reached, then another batch will be defined.

Incremental Mode is Configured. The process will select only new content

Full Mode is Configured.

The process will select only new content or will select all files to be processed.

Processing Order is Configured. (Ascending)

Processing Order is Configured. (Descending)

This manages the order of files processing. Best performance is Ascending.

Check integrity of data transfers: MD5

Check integrity of data transfers: CRC32 Check integrity of data transfers: XXHAS64

The type of integrity check is displayed.

Pool pool name: Tape tape barcode: VALID

n/a

This verifies if the associated pool contains a valid tape.

Pool pool name:1 valid tape(s)

n/a

This summarizes the number of valid tapes in the associated pool.

Main Process is starting

n/a

The copy process starts.

Finalization Process is starting

n/a

The process responsible of managing the catalog changes starts.

Tape tape barcode: is Mounted on the drive drive s/n

n/a

Indicates a request to mount a tape in a tape drive.

Tape tape barcodeloaded on the drive drive s/n is Dismounted

n/a

Indicates a request to dismount a tape in a tape drive.

LTFS File System of the Tape tape barcodeloaded on the drive drive s/n is Mounted

n/a

The LTFS partition is mounted for the specified tape in the specified tape drive.

LTFS File System of the Tape tape barcodeloaded on the drive drive s/nis Umounted

n/a

The LTFS partition is unmounted for the specified tape in the specified tape drive.

Unable to Mount LTFS File System of the Tape tape barcodeloaded on the drive drive s/n

n/a

The tape cannot mount its LTFS file system properly.

Unable to Copy a File. (Read-Only LTFS FileSystem, Try to mount it again)

n/a

Indicates that the tape is mounted with Read-Only LTFS file system and can't copy the file. The task will remount another tape. It is highly recommended to check the tape manually.

Total of items/size parsed: xxxx(yyyTB)

n/a

This list the result of the Initialization process and list the number of files and size to be processed by the task.

Total of items/size already up to date on the destination: xxxx(yyyTB)

n/a

This is the result of the verification of the number of files and the size already available at the destination.

Total of items/size to process on this schedule: xxxx(yyyTB)

n/a

This is the summary of what will be processed during this task execution.

Initialization Process is finished

n/a

This confirms that the Initialization process is finished and then files inventory is available.

All Files were processed on this Rule

n/a

This confirms that all files have been processed.

Main Process is finished

n/a

The Main process is finalized.

Retry made on volume tape barcode: xx

n/a

Shows the number of mount retries for all the defined tape barcode.

Finalization Process is finished

n/a

The Finalization process is finalized.

Task is finished with Success

n/a

The Task is finalized.

Last updated

Was this helpful?