Named Queues Proposal

Table of Contents Purpose API Changes Commandline Changes Schema Change Target Release Comments

Purpose

Add Resque-style named queues while retaining DJ-style priority. Named queues provide a convenient system for grouping tasks to be worked by separate pools of workers, which may be scaled and controlled individually.

Named queues should be:

arbitrarily named and created on-the-fly (just put a job in a queue and that queue exists)
ignored by default (Workers should pull from all queues unless told not to for backwards compatibility)
efficient (Backends must properly index so that queues are efficient even when large)
ignorant of the priority and delaying system. Priority and run_at should continue to function as before when jobs are segmented by queue.

API Changes

Old ways still work:

  object.delay.method
  Delayed::Job.enqueue job

Assigning to a queue:

  object.delay(:queue => 'tracking').method
  Delayed::Job.enqueue job, :queue => 'tracking'

Commandline Changes

If no queue option is specified, workers will work all queues. More than one queue may be specified to a worker. Both singular and plural queue option names are provided for convenience and are interchangeable.

rake:

  QUEUE=tracking rake jobs:run
  QUEUES=one,two rake jobs:run

daemon:

  delayed_job --queue=tracking start
  delayed_job --queues=tracking,two start

Schema Change

New column: queue (string, nullable) Change index: add queue to existing priority,run_at index

TODO:

determine if queue field should come at the start or end of the index
determine if it's more efficient to make queue not-null and use a default queue name

Target Release

Since there is both a schema change for the AR backend and implementation changes for all backends to coordinate, this feature should be targeted for the next minor-version release of DJ (2.2). The upgrade process from 2.0 and 2.1 should be documented and a generator provided for the AR schema migration.

Comments

John H: What is the goal you're trying to accomplish here? I'm guessing to separate long-running and short-running jobs so that long-running (or error-prone) classes of jobs don't bottleneck short-running jobs that you might want to be more responsive. What about being able to specify a min-priority on workers? Would that let you accomplish what you're after? To illustrate:

create 2 workers
set one workers min priority to 50
set short-running/responsive jobs to have higher priority than slow or long-running jobs
now, one worker will never pick up long-running tasks, and you'll always have one working on just high-priority jobs. You could scale this and manage the priority of jobs to ensure you always have workers waiting for high-priority tasks, and then just have one digging through your low-priority backlog.

Giangprosite_job

Provide feedback

Saved searches

Use saved searches to filter your results more quickly