Class: Fiber

Relationships & Source Files
Namespace Children
Classes: `Pool`, `Scheduler`
Inherits:	Object
Defined in:	cont.c, cont.c

Overview

Fibers are primitives for implementing light weight cooperative concurrency in ::Ruby. Basically they are a means of creating code blocks that can be paused and resumed, much like threads. The main difference is that they are never preempted and that the scheduling must be done by the programmer and not the VM.

As opposed to other stackless light weight concurrency models, each fiber comes with a stack. This enables the fiber to be paused from deeply nested function calls within the fiber block. See the ruby(1) manpage to configure the size of the fiber stack(s).

When a fiber is created it will not run automatically. Rather it must be explicitly asked to run using the #resume method. The code running inside the fiber can give up control by calling .yield in which case it yields control back to caller (the caller of the #resume).

Upon yielding or termination the Fiber returns the value of the last executed expression

For instance:

fiber = Fiber.new do
  Fiber.yield 1
  2
end

puts fiber.resume
puts fiber.resume
puts fiber.resume

produces

1
2
FiberError: dead fiber called

The #resume method accepts an arbitrary number of parameters, if it is the first call to #resume then they will be passed as block arguments. Otherwise they will be the return value of the call to .yield

Example:

fiber = Fiber.new do |first|
  second = Fiber.yield first + 2
end

puts fiber.resume 10
puts fiber.resume 1_000_000
puts fiber.resume "The fiber will be dead before I can cause trouble"

produces

12
1000000
FiberError: dead fiber called

Non-blocking Fibers

The concept of non-blocking fiber was introduced in ::Ruby 3.0. A non-blocking fiber, when reaching an operation that would normally block the fiber (like sleep, or wait for another process or I/O) will yield control to other fibers and allow the scheduler to handle blocking and waking up (resuming) this fiber when it can proceed.

For a Fiber to behave as non-blocking, it need to be created in .new with blocking: false (which is the default), and .scheduler should be set with .set_scheduler. If .scheduler is not set in the current thread, blocking and non-blocking fibers’ behavior is identical.

::Ruby doesn’t provide a scheduler class: it is expected to be implemented by the user and correspond to Scheduler.

There is also .schedule method, which is expected to immediately perform the given block in a non-blocking manner. Its actual implementation is up to the scheduler.

Class Attribute Summary

.blocking {|fiber| ... } ⇒ result readonly

Forces the fiber to be blocking for the duration of the block.
.blocking? ⇒ Boolean readonly

Returns false if the current fiber is non-blocking.

Class Method Summary

.[](key) ⇒ value

Returns the value of the fiber storage variable identified by key.
.[]=(key, value)

Assign value to the fiber storage variable identified by key.
.current ⇒ Fiber

Returns the current fiber.
.current_scheduler ⇒ Object^?

Returns the Fiber scheduler, that was last set for the current thread with .set_scheduler if and only if the current fiber is non-blocking.
.new(blocking: false, storage: true) {|*args| ... } ⇒ Fiber constructor

Creates new Fiber.
.schedule {|*args| ... } ⇒ Fiber

The method is expected to immediately run the provided block of code in a separate non-blocking fiber.
.scheduler ⇒ Object^?

Returns the Fiber scheduler, that was last set for the current thread with .set_scheduler.
.set_scheduler(scheduler) ⇒ scheduler

Sets the Fiber scheduler for the current thread.
.yield(args, ...) ⇒ Object

Yields control back to the context that resumed the fiber, passing along any arguments that were passed to it.

Instance Attribute Summary

#alive? ⇒ Boolean readonly

Returns true if the fiber can still be resumed (or transferred to).
#blocking? ⇒ Boolean readonly

Returns true if fiber is blocking and false otherwise.
#storage ⇒ hash (dup) rw

Returns a copy of the storage hash for the fiber.
#storage=(hash) rw

Sets the storage hash for the fiber.

Instance Method Summary

#backtrace ⇒ Array

Returns the current execution stack of the fiber.
#backtrace_locations ⇒ Array

Like #backtrace, but returns each line of the execution stack as a ::Thread::Backtrace::Location.
#inspect

Alias for #to_s.
#kill ⇒ nil

Terminates the fiber by raising an uncatchable exception.
#raise ⇒ Object

Raises an exception in the fiber at the point at which the last .yield was called.
#resume(args, ...) ⇒ Object

Resumes the fiber from the point at which the last .yield was called, or starts running it if it is the first call to #resume.
#to_s (also: #inspect)
#transfer(args, ...) ⇒ Object

Transfer control to another fiber, resuming it from where it last stopped or starting it if it was not resumed before.

Constructor Details

.new(blocking: false, storage: true) {|*args| ... } ⇒ `Fiber`

Creates new Fiber. Initially, the fiber is not running and can be resumed with #resume. Arguments to the first #resume call will be passed to the block:

f = Fiber.new do |initial|
   current = initial
   loop do
     puts "current: #{current.inspect}"
     current = Fiber.yield
   end
end
f.resume(100)     # prints: current: 100
f.resume(1, 2, 3) # prints: current: [1, 2, 3]
f.resume          # prints: current: nil
# ... and so on ...

If blocking: false is passed to new, and current thread has a .scheduler defined, the Fiber becomes non-blocking (see “Non-blocking Fibers” section in class docs).

If the #storage is unspecified, the default is to inherit a copy of the storage from the current fiber. This is the same as specifying storage: true.

Fiber[:x] = 1
Fiber.new do
  Fiber[:x] # => 1
  Fiber[:x] = 2
end.resume
Fiber[:x] # => 1

If the given #storage is nil, this function will lazy initialize the internal storage, which starts as an empty hash.

Fiber[:x] = "Hello World"
Fiber.new(storage: nil) do
  Fiber[:x] # nil
end

Otherwise, the given #storage is used as the new fiber’s storage, and it must be an instance of ::Hash.

Explicitly using storage: true is currently experimental and may change in the future.

[ GitHub ]

# File 'cont.c', line 2353


static VALUE
rb_fiber_initialize(int argc, VALUE* argv, VALUE self)
{
    return rb_fiber_initialize_kw(argc, argv, self, rb_keyword_given_p());
}

Class Attribute Details

.blocking {|fiber| ... } ⇒ `result` (readonly)

Forces the fiber to be blocking for the duration of the block. Returns the result of the block.

See the “Non-blocking fibers” section in class docs for details.

[ GitHub ]

# File 'cont.c', line 2834


VALUE
rb_fiber_blocking(VALUE class)
{
    VALUE fiber_value = rb_fiber_current();
    rb_fiber_t *fiber = fiber_ptr(fiber_value);

    // If we are already blocking, this is essentially a no-op:
    if (fiber->blocking) {
        return rb_yield(fiber_value);
    }
    else {
        return rb_ensure(fiber_blocking_yield, fiber_value, fiber_blocking_ensure, fiber_value);
    }
}

.blocking? ⇒ `Boolean` (readonly)

Returns false if the current fiber is non-blocking. Fiber is non-blocking if it was created via passing blocking: false to .new, or via .schedule.

If the current Fiber is blocking, the method returns 1. Future developments may allow for situations where larger integers could be returned.

Note that, even if the method returns false, Fiber behaves differently only if .scheduler is set in the current thread.

See the “Non-blocking fibers” section in class docs for details.

[ GitHub ]

# File 'cont.c', line 2867


static VALUE
rb_fiber_s_blocking_p(VALUE klass)
{
    rb_thread_t *thread = GET_THREAD();
    unsigned blocking = thread->blocking;

    if (blocking == 0)
        return Qfalse;

    return INT2NUM(blocking);
}

Class Method Details

.[](key) ⇒ `value`

Returns the value of the fiber storage variable identified by key.

The key must be a symbol, and the value is set by Fiber#[]= or Fiber#store.

.[]=(key, value)

Assign value to the fiber storage variable identified by key. The variable is created if it doesn’t exist.

key must be a ::Symbol, otherwise a ::TypeError is raised.

.current ⇒ `Fiber`

Returns the current fiber. If you are not running in the context of a fiber this method will return the root fiber.

[ GitHub ]

# File 'cont.c', line 3306


static VALUE
rb_fiber_s_current(VALUE klass)
{
    return rb_fiber_current();
}

.current_scheduler ⇒ Object^?

Returns the Fiber scheduler, that was last set for the current thread with .set_scheduler if and only if the current fiber is non-blocking.

[ GitHub ]

# File 'cont.c', line 2459


static VALUE
rb_fiber_current_scheduler(VALUE klass)
{
    return rb_fiber_scheduler_current();
}

.schedule {|*args| ... } ⇒ `Fiber`

The method is expected to immediately run the provided block of code in a separate non-blocking fiber.

puts "Go to sleep!"

Fiber.set_scheduler(MyScheduler.new)

Fiber.schedule do
  puts "Going to sleep"
  sleep(1)
  puts "I slept well"
end

puts "Wakey-wakey, sleepyhead"

Assuming MyScheduler is properly implemented, this program will produce:

Go to sleep!
Going to sleep
Wakey-wakey, sleepyhead
#...1 sec pause here...
I slept well

…e.g. on the first blocking operation inside the Fiber (sleep(1)), the control is yielded to the outside code (main fiber), and at the end of that execution, the scheduler takes care of properly resuming all the blocked fibers.

Note that the behavior described above is how the method is expected to behave, actual behavior is up to the current scheduler’s implementation of Scheduler#fiber method. ::Ruby doesn’t enforce this method to behave in any particular way.

If the scheduler is not set, the method raises RuntimeError (No scheduler is available!).

[ GitHub ]

# File 'cont.c', line 2429


static VALUE
rb_fiber_s_schedule(int argc, VALUE *argv, VALUE obj)
{
    return rb_fiber_s_schedule_kw(argc, argv, rb_keyword_given_p());
}

.scheduler ⇒ Object^?

Returns the Fiber scheduler, that was last set for the current thread with .set_scheduler. Returns nil if no scheduler is set (which is the default), and non-blocking fibers’ behavior is the same as blocking. (see “Non-blocking fibers” section in class docs for details about the scheduler concept).

[ GitHub ]

# File 'cont.c', line 2445


static VALUE
rb_fiber_s_scheduler(VALUE klass)
{
    return rb_fiber_scheduler_get();
}

.set_scheduler(scheduler) ⇒ scheduler

Sets the Fiber scheduler for the current thread. If the scheduler is set, non-blocking fibers (created by .new with blocking: false, or by .schedule) call that scheduler’s hook methods on potentially blocking operations, and the current thread will call scheduler’s close method on finalization (allowing the scheduler to properly manage all non-finished fibers).

.scheduler can be an object of any class corresponding to ::Fiber::Scheduler. Its implementation is up to the user.

See also the “Non-blocking fibers” section in class docs.

[ GitHub ]

# File 'cont.c', line 2481


static VALUE
rb_fiber_set_scheduler(VALUE klass, VALUE scheduler)
{
    return rb_fiber_scheduler_set(scheduler);
}

.yield(args, ...) ⇒ Object

Yields control back to the context that resumed the fiber, passing along any arguments that were passed to it. The fiber will resume processing at this point when #resume is called next. Any arguments passed to the next #resume will be the value that this yield expression evaluates to.

[ GitHub ]

# File 'cont.c', line 3197


static VALUE
rb_fiber_s_yield(int argc, VALUE *argv, VALUE klass)
{
    return rb_fiber_yield_kw(argc, argv, rb_keyword_given_p());
}

Instance Attribute Details

#alive? ⇒ `Boolean` (readonly)

Returns true if the fiber can still be resumed (or transferred to). After finishing execution of the fiber block this method will always return false.

[ GitHub ]

# File 'cont.c', line 2975


VALUE
rb_fiber_alive_p(VALUE fiber_value)
{
    return RBOOL(!FIBER_TERMINATED_P(fiber_ptr(fiber_value)));
}

#blocking? ⇒ `Boolean` (readonly)

Returns true if fiber is blocking and false otherwise. Fiber is non-blocking if it was created via passing blocking: false to .new, or via .schedule.

Note that, even if the method returns false, the fiber behaves differently only if .scheduler is set in the current thread.

See the “Non-blocking fibers” section in class docs for details.

[ GitHub ]

# File 'cont.c', line 2788


VALUE
rb_fiber_blocking_p(VALUE fiber)
{
    return RBOOL(fiber_ptr(fiber)->blocking);
}

#storage ⇒ `hash` (`dup`) (rw)

Returns a copy of the storage hash for the fiber. The method can only be called on the .current.

[ GitHub ]

# File 'cont.c', line 2097


static VALUE
rb_fiber_storage_get(VALUE self)
{
    storage_access_must_be_from_same_fiber(self);

    VALUE storage = fiber_storage_get(fiber_ptr(self), FALSE);

    if (storage == Qnil) {
        return Qnil;
    }
    else {
        return rb_obj_dup(storage);
    }
}

#storage=(hash) (rw)

Sets the storage hash for the fiber. This feature is experimental and may change in the future. The method can only be called on the .current.

You should be careful about using this method as you may inadvertently clear important fiber-storage state. You should mostly prefer to assign specific keys in the storage using .[]=.

You can also use Fiber.new(storage: nil) to create a fiber with an empty storage.

Example:

while request = request_queue.pop
  # Reset the per-request state:
  Fiber.current.storage = nil
  handle_request(request)
end

[ GitHub ]

# File 'cont.c', line 2159


static VALUE
rb_fiber_storage_set(VALUE self, VALUE value)
{
    if (rb_warning_category_enabled_p(RB_WARN_CATEGORY_EXPERIMENTAL)) {
        rb_category_warn(RB_WARN_CATEGORY_EXPERIMENTAL,
          "Fiber#storage= is experimental and may be removed in the future!");
    }

    storage_access_must_be_from_same_fiber(self);
    fiber_storage_validate(value);

    fiber_ptr(self)->cont.saved_ec.storage = rb_obj_dup(value);
    return value;
}

Instance Method Details

#backtrace ⇒ Array #backtrace(start) ⇒ Array #backtrace(start, count) ⇒ Array #backtrace(start..end) ⇒ Array

Returns the current execution stack of the fiber. start, count and end allow to select only parts of the backtrace.

def level3
  Fiber.yield
end

def level2
  level3
end

def level1
  level2
end

f = Fiber.new { level1 }

# It is empty before the fiber started
f.backtrace
#=> []

f.resume

f.backtrace
#=> ["test.rb:2:in `yield'", "test.rb:2:in `level3'", "test.rb:6:in `level2'", "test.rb:10:in `level1'", "test.rb:13:in `block in <main>'"]
p f.backtrace(1) # start from the item 1
#=> ["test.rb:2:in `level3'", "test.rb:6:in `level2'", "test.rb:10:in `level1'", "test.rb:13:in `block in <main>'"]
p f.backtrace(2, 2) # start from item 2, take 2
#=> ["test.rb:6:in `level2'", "test.rb:10:in `level1'"]
p f.backtrace(1..3) # take items from 1 to 3
#=> ["test.rb:2:in `level3'", "test.rb:6:in `level2'", "test.rb:10:in `level1'"]

f.resume

# It is nil after the fiber is finished
f.backtrace
#=> nil

[ GitHub ]

# File 'cont.c', line 3048


static VALUE
rb_fiber_backtrace(int argc, VALUE *argv, VALUE fiber)
{
    return rb_vm_backtrace(argc, argv, &fiber_ptr(fiber)->cont.saved_ec);
}

#backtrace_locations ⇒ Array #backtrace_locations(start) ⇒ Array #backtrace_locations(start, count) ⇒ Array #backtrace_locations(start..end) ⇒ Array

Like #backtrace, but returns each line of the execution stack as a ::Thread::Backtrace::Location. Accepts the same arguments as #backtrace.

f = Fiber.new { Fiber.yield }
f.resume
loc = f.backtrace_locations.first
loc.label  #=> "yield"
loc.path   #=> "test.rb"
loc.lineno #=> 1

[ GitHub ]

# File 'cont.c', line 3073


static VALUE
rb_fiber_backtrace_locations(int argc, VALUE *argv, VALUE fiber)
{
    return rb_vm_backtrace_locations(argc, argv, &fiber_ptr(fiber)->cont.saved_ec);
}

#inspect

Alias for #to_s.

#kill ⇒ `nil`

Terminates the fiber by raising an uncatchable exception. It only terminates the given fiber and no other fiber, returning nil to another fiber if that fiber was calling #resume or #transfer.

kill only interrupts another fiber when it is in .yield. If called on the current fiber then it raises that exception at the kill call site.

If the fiber has not been started, transition directly to the terminated state.

If the fiber is already terminated, does nothing.

Raises FiberError if called on a fiber belonging to another thread.

[ GitHub ]

# File 'cont.c', line 3276


static VALUE
rb_fiber_m_kill(VALUE self)
{
    rb_fiber_t *fiber = fiber_ptr(self);

    if (fiber->killed) return Qfalse;
    fiber->killed = 1;

    if (fiber->status == FIBER_CREATED) {
        fiber->status = FIBER_TERMINATED;
    }
    else if (fiber->status != FIBER_TERMINATED) {
        if (fiber_current() == fiber) {
            fiber_check_killed(fiber);
        }
        else {
            fiber_raise(fiber_ptr(self), Qnil);
        }
    }

    return self;
}

#raise ⇒ Object #raise(string) ⇒ Object #raise(exception [, string [, array]]) ⇒ Object

Raises an exception in the fiber at the point at which the last .yield was called. If the fiber has not been started or has already run to completion, raises ::FiberError. If the fiber is yielding, it is resumed. If it is transferring, it is transferred into. But if it is resuming, raises ::FiberError.

With no arguments, raises a ::RuntimeError. With a single ::String argument, raises a ::RuntimeError with the string as a message. Otherwise, the first parameter should be the name of an ::Exception class (or an object that returns an ::Exception object when sent an exception message). The optional second parameter sets the message associated with the exception, and the third parameter is an array of callback information. Exceptions are caught by the rescue clause of begin...end blocks.

Raises ::FiberError if called on a Fiber belonging to another ::Thread.

See Kernel.raise for more information.

[ GitHub ]

# File 'cont.c', line 3253


static VALUE
rb_fiber_m_raise(int argc, VALUE *argv, VALUE self)
{
    return rb_fiber_raise(self, argc, argv);
}

#resume(args, ...) ⇒ Object

Resumes the fiber from the point at which the last .yield was called, or starts running it if it is the first call to #resume. Arguments passed to resume will be the value of the .yield expression or will be passed as block parameters to the fiber’s block if this is the first #resume.

Alternatively, when resume is called it evaluates to the arguments passed to the next .yield statement inside the fiber’s block or to the block value if it runs to completion without any .yield

[ GitHub ]

# File 'cont.c', line 2996


static VALUE
rb_fiber_m_resume(int argc, VALUE *argv, VALUE fiber)
{
    return rb_fiber_resume_kw(fiber, argc, argv, rb_keyword_given_p());
}

#to_s Also known as: #inspect

[ GitHub ]

# File 'cont.c', line 3312


static VALUE
fiber_to_s(VALUE fiber_value)
{
    const rb_fiber_t *fiber = fiber_ptr(fiber_value);
    const rb_proc_t *proc;
    char status_info[0x20];

    if (fiber->resuming_fiber) {
        snprintf(status_info, 0x20, " (%s by resuming)", fiber_status_name(fiber->status));
    }
    else {
        snprintf(status_info, 0x20, " (%s)", fiber_status_name(fiber->status));
    }

    if (!rb_obj_is_proc(fiber->first_proc)) {
        VALUE str = rb_any_to_s(fiber_value);
        strlcat(status_info, ">", sizeof(status_info));
        rb_str_set_len(str, RSTRING_LEN(str)-1);
        rb_str_cat_cstr(str, status_info);
        return str;
    }
    GetProcPtr(fiber->first_proc, proc);
    return rb_block_to_s(fiber_value, &proc->block, status_info);
}

#transfer(args, ...) ⇒ Object

Transfer control to another fiber, resuming it from where it last stopped or starting it if it was not resumed before. The calling fiber will be suspended much like in a call to .yield.

The fiber which receives the transfer call treats it much like a resume call. Arguments passed to transfer are treated like those passed to resume.

The two style of control passing to and from fiber (one is #resume and .yield, another is #transfer to and from fiber) can’t be freely mixed.

If the Fiber’s lifecycle had started with transfer, it will never be able to yield or be resumed control passing, only finish or transfer back. (It still can resume other fibers that are allowed to be resumed.)
If the Fiber’s lifecycle had started with resume, it can yield or transfer to another Fiber, but can receive control back only the way compatible with the way it was given away: if it had transferred, it only can be transferred back, and if it had yielded, it only can be resumed back. After that, it again can transfer or yield.

If those rules are broken ::FiberError is raised.

For an individual Fiber design, yield/resume is easier to use (the Fiber just gives away control, it doesn’t need to think about who the control is given to), while transfer is more flexible for complex cases, allowing to build arbitrary graphs of Fibers dependent on each other.

Example:

manager = nil # For local var to be visible inside worker block

# This fiber would be started with transfer
# It can't yield, and can't be resumed
worker = Fiber.new { |work|
  puts "Worker: starts"
  puts "Worker: Performed #{work.inspect}, transferring back"
  # Fiber.yield     # this would raise FiberError: attempt to yield on a not resumed fiber
  # manager.resume  # this would raise FiberError: attempt to resume a resumed fiber (double resume)
  manager.transfer(work.capitalize)
}

# This fiber would be started with resume
# It can yield or transfer, and can be transferred
# back or resumed
manager = Fiber.new {
  puts "Manager: starts"
  puts "Manager: transferring 'something' to worker"
  result = worker.transfer('something')
  puts "Manager: worker returned #{result.inspect}"
  # worker.resume    # this would raise FiberError: attempt to resume a transferring fiber
  Fiber.yield        # this is OK, the fiber transferred from and to, now it can yield
  puts "Manager: finished"
}

puts "Starting the manager"
manager.resume
puts "Resuming the manager"
# manager.transfer  # this would raise FiberError: attempt to transfer to a yielding fiber
manager.resume

produces

Starting the manager
Manager: starts
Manager: transferring 'something' to worker
Worker: starts
Worker: Performed "something", transferring back
Manager: worker returned "Something"
Resuming the manager
Manager: finished

[ GitHub ]

# File 'cont.c', line 3161


static VALUE
rb_fiber_m_transfer(int argc, VALUE *argv, VALUE self)
{
    return rb_fiber_transfer_kw(self, argc, argv, rb_keyword_given_p());
}

Class: Fiber

Overview

Non-blocking Fibers

Class Attribute Summary

Class Method Summary

Instance Attribute Summary

Instance Method Summary

Constructor Details

.new(blocking: false, storage: true) {|*args| ... } ⇒ Fiber

Class Attribute Details

.blocking {|fiber| ... } ⇒ result (readonly)

.blocking? ⇒ Boolean (readonly)

Class Method Details

.[](key) ⇒ value

.[]=(key, value)

.current ⇒ Fiber

.current_scheduler ⇒ Object?

.schedule {|*args| ... } ⇒ Fiber

.scheduler ⇒ Object?

.set_scheduler(scheduler) ⇒ scheduler

.yield(args, ...) ⇒ Object

Instance Attribute Details

#alive? ⇒ Boolean (readonly)

#blocking? ⇒ Boolean (readonly)

#storage ⇒ hash (dup) (rw)

#storage=(hash) (rw)

Instance Method Details

#backtrace ⇒ Array #backtrace(start) ⇒ Array #backtrace(start, count) ⇒ Array #backtrace(start..end) ⇒ Array

#backtrace_locations ⇒ Array #backtrace_locations(start) ⇒ Array #backtrace_locations(start, count) ⇒ Array #backtrace_locations(start..end) ⇒ Array

#inspect

#kill ⇒ nil

#raise ⇒ Object #raise(string) ⇒ Object #raise(exception [, string [, array]]) ⇒ Object

#resume(args, ...) ⇒ Object

#to_s Also known as: #inspect

#transfer(args, ...) ⇒ Object

.new(blocking: false, storage: true) {|*args| ... } ⇒ `Fiber`

.blocking {|fiber| ... } ⇒ `result` (readonly)

.blocking? ⇒ `Boolean` (readonly)

.[](key) ⇒ `value`

.current ⇒ `Fiber`

.current_scheduler ⇒ Object^?

.schedule {|*args| ... } ⇒ `Fiber`

.scheduler ⇒ Object^?

#alive? ⇒ `Boolean` (readonly)

#blocking? ⇒ `Boolean` (readonly)

#storage ⇒ `hash` (`dup`) (rw)

#kill ⇒ `nil`