CrackedRuby - Higher-Order Functions

Overview

Higher-order functions treat functions as first-class citizens, accepting functions as arguments, returning functions as results, or both. This concept originates from lambda calculus and functional programming, where functions operate on other functions to create new behavior without modifying existing code.

The significance extends beyond functional programming. Higher-order functions enable code reuse through composition, separate data transformation logic from iteration mechanics, and express complex operations declaratively. Ruby collections use higher-order functions extensively through methods like map, select, and reduce, which accept blocks or procs that define transformation logic.

# Higher-order function accepting a function (block)
numbers = [1, 2, 3, 4, 5]
doubled = numbers.map { |n| n * 2 }
# => [2, 4, 6, 8, 10]

# Higher-order function returning a function
def multiplier(factor)
  ->(x) { x * factor }
end

times_three = multiplier(3)
times_three.call(5)
# => 15

Higher-order functions appear throughout programming: event handlers accept callback functions, middleware chains pass request handlers, decorators wrap existing functions with additional behavior, and dependency injection frameworks pass function references to control flow.

Key Principles

Function Composition forms the foundation of higher-order functions. Composition combines simple functions into complex operations by passing the output of one function as input to another. This principle enables building sophisticated behavior from small, tested components without creating monolithic procedures.

# Function composition through chaining
result = [1, 2, 3, 4, 5]
  .map { |n| n * 2 }        # Transform each element
  .select { |n| n > 5 }     # Filter results
  .reduce(0) { |sum, n| sum + n }  # Aggregate
# => 18

Abstraction Over Iteration separates the mechanics of iteration from the logic applied to each element. Higher-order functions handle loop control, index management, and collection traversal, while the passed function defines only the transformation or filtering logic. This separation reduces bugs from manual iteration and makes code intent explicit.

Closure Capture allows returned functions to access variables from their creation context even after the outer function completes. The returned function "closes over" these variables, maintaining references to the original scope. This principle enables factory functions, configuration builders, and stateful function generators.

# Closure capturing outer scope
def counter(initial)
  count = initial
  -> { count += 1 }
end

increment = counter(10)
increment.call  # => 11
increment.call  # => 12
increment.call  # => 13

Referential Transparency in pure higher-order functions means calling the same function with the same arguments always produces identical results. Pure higher-order functions avoid side effects, making them easier to test, reason about, and parallelize. Ruby higher-order functions can be pure or impure depending on the passed blocks.

Lazy Evaluation defers computation until results are needed. Higher-order functions operating on lazy sequences avoid processing entire collections when only partial results are required. Ruby's Enumerator provides lazy evaluation through methods like lazy, preventing unnecessary computation.

# Lazy evaluation prevents processing all elements
(1..Float::INFINITY)
  .lazy
  .map { |n| n * n }
  .select { |n| n % 3 == 0 }
  .first(5)
# => [9, 36, 81, 144, 225]

Ruby Implementation

Ruby implements higher-order functions through blocks, procs, and lambdas. Blocks represent anonymous code chunks passed to methods, procs create reusable function objects, and lambdas behave like anonymous functions with strict argument checking. These mechanisms provide different trade-offs for different use cases.

Blocks are the most common higher-order function mechanism in Ruby. Methods accept blocks using the yield keyword or by capturing them with &block parameters. Blocks cannot be stored in variables directly but convert to procs when captured.

# Method accepting a block
def repeat(times)
  times.times { yield }
end

repeat(3) { puts "Hello" }
# Output:
# Hello
# Hello
# Hello

# Capturing block as proc
def apply_twice(&block)
  result = block.call(5)
  block.call(result)
end

apply_twice { |n| n * 2 }
# => 20

Procs are objects representing blocks, created with Proc.new or the proc method. Procs have lenient argument handling, ignoring extra arguments and treating missing arguments as nil. Return statements inside procs return from the enclosing method, not just the proc.

# Creating and using procs
square = Proc.new { |x| x * x }
square.call(5)  # => 25

# Procs as higher-order function arguments
def transform(array, operation)
  array.map(&operation)
end

transform([1, 2, 3], square)
# => [1, 4, 9]

# Proc return behavior
def proc_return_test
  my_proc = Proc.new { return "returned from proc" }
  my_proc.call
  "never reached"
end

proc_return_test
# => "returned from proc"

Lambdas are procs with strict semantics. Lambdas enforce argument count matching and return only from the lambda itself, not the enclosing method. The -> syntax provides concise lambda creation. Use lambdas when functions need predictable behavior across different contexts.

# Lambda creation and usage
multiply = ->(x, y) { x * y }
multiply.call(3, 4)  # => 12
multiply.call(3)     # ArgumentError: wrong number of arguments

# Lambda return behavior
def lambda_return_test
  my_lambda = -> { return "returned from lambda" }
  my_lambda.call
  "method continues"
end

lambda_return_test
# => "method continues"

# Multi-line lambda syntax
process = lambda do |data|
  transformed = data.map(&:upcase)
  transformed.select { |s| s.length > 3 }
end

process.call(["hi", "hello", "hey"])
# => ["HELLO"]

Symbol to Proc conversion provides shorthand for simple operations. The &:method_name syntax converts symbols to procs calling that method on each element. This pattern works only for single-method invocations without arguments.

# Symbol to proc conversion
["hello", "world"].map(&:upcase)
# => ["HELLO", "WORLD"]

# Equivalent explicit form
["hello", "world"].map { |s| s.upcase }
# => ["HELLO", "WORLD"]

# Chaining with symbol to proc
[1, 2, 3, 4, 5]
  .select(&:odd?)
  .map(&:to_s)
# => ["1", "3", "5"]

Method Objects convert named methods to callable objects using method or Method#to_proc. This approach enables passing existing methods as higher-order function arguments without wrapping them in blocks.

# Converting methods to procs
def double(x)
  x * 2
end

doubler = method(:double)
[1, 2, 3].map(&doubler)
# => [2, 4, 6]

# Instance methods as procs
class Calculator
  def add_ten(x)
    x + 10
  end
end

calc = Calculator.new
add_ten_proc = calc.method(:add_ten)
[5, 15, 25].map(&add_ten_proc)
# => [15, 25, 35]

Custom Higher-Order Methods accept blocks or procs to implement domain-specific iteration patterns. These methods encapsulate repetitive logic while allowing callers to customize behavior through passed functions.

# Custom higher-order function
class Array
  def map_with_index_tracking
    result = []
    each_with_index do |element, index|
      result << yield(element, index)
    end
    result
  end
end

["a", "b", "c"].map_with_index_tracking do |char, idx|
  "#{idx}: #{char}"
end
# => ["0: a", "1: b", "2: c"]

# Higher-order function returning a function
def predicate_combiner(*predicates)
  ->(value) { predicates.all? { |pred| pred.call(value) } }
end

positive = ->(x) { x > 0 }
even = ->(x) { x.even? }
small = ->(x) { x < 100 }

combined = predicate_combiner(positive, even, small)
combined.call(42)   # => true
combined.call(-2)   # => false
combined.call(200)  # => false

Common Patterns

Map Pattern transforms each element in a collection by applying a function, producing a new collection of the same size. Map operations preserve collection structure while changing element values. Use map when every element needs transformation and the collection size remains constant.

# Basic mapping
prices = [10.00, 25.50, 8.75]
with_tax = prices.map { |price| price * 1.08 }
# => [10.8, 27.54, 9.45]

# Mapping with multiple operations
users = [
  { name: "alice", age: 30 },
  { name: "bob", age: 25 }
]

formatted = users.map do |user|
  {
    display_name: user[:name].capitalize,
    birth_year: Time.now.year - user[:age]
  }
end
# => [{:display_name=>"Alice", :birth_year=>1995}, ...]

# Flat map for nested collections
nested = [[1, 2], [3, 4], [5, 6]]
nested.flat_map { |pair| pair.map { |n| n * 2 } }
# => [2, 4, 6, 8, 10, 12]

Filter Pattern selects elements matching a predicate function, producing a collection containing only matching elements. Ruby provides select for keeping matches and reject for removing matches. Use filters when collection size may decrease based on conditions.

# Filtering with select
numbers = (1..20).to_a
evens = numbers.select { |n| n.even? }
# => [2, 4, 6, 8, 10, 12, 14, 16, 18, 20]

# Filtering with reject
words = ["hello", "world", "hi", "there"]
long_words = words.reject { |w| w.length < 4 }
# => ["hello", "world", "there"]

# Chaining filters
products = [
  { name: "Widget", price: 10, in_stock: true },
  { name: "Gadget", price: 25, in_stock: false },
  { name: "Tool", price: 15, in_stock: true }
]

available_affordable = products
  .select { |p| p[:in_stock] }
  .select { |p| p[:price] < 20 }
  .map { |p| p[:name] }
# => ["Widget", "Tool"]

Reduce Pattern (fold) aggregates collection elements into a single value by repeatedly applying a binary function. The function receives an accumulator and current element, returning the new accumulator value. Reduce handles summation, finding extrema, building data structures, and custom aggregations.

# Basic reduction
numbers = [1, 2, 3, 4, 5]
sum = numbers.reduce(0) { |acc, n| acc + n }
# => 15

# Reduce without initial value uses first element
product = numbers.reduce { |acc, n| acc * n }
# => 120

# Building data structures with reduce
words = ["hello", "world", "hello", "ruby"]
word_counts = words.reduce(Hash.new(0)) do |counts, word|
  counts[word] += 1
  counts
end
# => {"hello"=>2, "world"=>1, "ruby"=>1}

# Complex reduction
transactions = [
  { type: :deposit, amount: 100 },
  { type: :withdrawal, amount: 50 },
  { type: :deposit, amount: 75 }
]

balance = transactions.reduce(0) do |total, tx|
  tx[:type] == :deposit ? total + tx[:amount] : total - tx[:amount]
end
# => 125

Partition Pattern splits a collection into two groups based on a predicate, returning both matching and non-matching elements. This pattern avoids multiple filter passes when both groups are needed.

# Partitioning into two groups
numbers = (1..10).to_a
evens, odds = numbers.partition(&:even?)
# evens => [2, 4, 6, 8, 10]
# odds => [1, 3, 5, 7, 9]

# Partitioning complex objects
orders = [
  { id: 1, status: :fulfilled },
  { id: 2, status: :pending },
  { id: 3, status: :fulfilled },
  { id: 4, status: :pending }
]

fulfilled, pending = orders.partition { |o| o[:status] == :fulfilled }

Composition Pattern combines multiple functions where the output of one becomes the input of the next. Function composition creates pipelines that transform data through stages. Ruby's method chaining provides implicit composition, while explicit composition uses lambdas.

# Implicit composition through chaining
result = "  hello world  "
  .strip
  .upcase
  .split
  .map { |w| w.reverse }
  .join("-")
# => "OLLEH-DLROW"

# Explicit function composition
add_tax = ->(price) { price * 1.08 }
round_cents = ->(price) { (price * 100).round / 100.0 }
format_currency = ->(price) { "$#{sprintf('%.2f', price)}" }

compose = ->(*fns) do
  ->(value) { fns.reduce(value) { |v, fn| fn.call(v) } }
end

price_formatter = compose.call(add_tax, round_cents, format_currency)
price_formatter.call(19.99)
# => "$21.59"

Partial Application Pattern fixes some arguments of a function, creating a new function with fewer parameters. Ruby implements partial application through lambda currying or explicit closures. Use partial application to create specialized functions from general ones.

# Manual partial application
def multiply(x, y)
  x * y
end

double = ->(y) { multiply(2, y) }
triple = ->(y) { multiply(3, y) }

double.call(5)  # => 10
triple.call(5)  # => 15

# Curry for automatic partial application
multiply_lambda = ->(x, y) { x * y }.curry
double_curry = multiply_lambda.call(2)
double_curry.call(5)  # => 10

# Partial application with multiple arguments
greet = ->(greeting, title, name) do
  "#{greeting}, #{title} #{name}!"
end.curry

hello = greet.call("Hello")
hello_doctor = hello.call("Dr.")
hello_doctor.call("Smith")
# => "Hello, Dr. Smith!"

Practical Examples

Data Pipeline Processing demonstrates higher-order functions transforming raw data through multiple stages. Each stage applies a specific transformation, with the pipeline composing these operations for end-to-end processing.

# Log entry processing pipeline
log_entries = [
  "2024-01-15 ERROR Failed to connect to database",
  "2024-01-15 INFO User logged in",
  "2024-01-15 ERROR Timeout on request",
  "2024-01-15 DEBUG Query executed in 45ms",
  "2024-01-16 ERROR Authentication failed"
]

# Pipeline: parse -> filter -> transform -> group
parse_entry = ->(line) do
  date, level, message = line.split(" ", 3)
  { date: date, level: level, message: message }
end

is_error = ->(entry) { entry[:level] == "ERROR" }

add_severity = ->(entry) do
  severity = case entry[:message]
    when /database|connection/ then :critical
    when /timeout/ then :high
    when /auth/ then :medium
    else :low
  end
  entry.merge(severity: severity)
end

errors_by_date = log_entries
  .map(&parse_entry)
  .select(&is_error)
  .map(&add_severity)
  .group_by { |e| e[:date] }
  .transform_values { |errors| errors.count }

# => {"2024-01-15"=>2, "2024-01-16"=>1}

Event Handler System uses higher-order functions to implement observer pattern functionality. Handlers register callbacks that execute when events occur, with the event system managing callback invocation.

# Event system with higher-order callbacks
class EventEmitter
  def initialize
    @listeners = Hash.new { |h, k| h[k] = [] }
  end

  def on(event, &handler)
    @listeners[event] << handler
  end

  def emit(event, data = nil)
    @listeners[event].each { |handler| handler.call(data) }
  end

  def once(event, &handler)
    wrapper = nil
    wrapper = lambda do |data|
      handler.call(data)
      @listeners[event].delete(wrapper)
    end
    @listeners[event] << wrapper
  end
end

# Using the event system
emitter = EventEmitter.new

emitter.on(:user_created) do |user|
  puts "Welcome email sent to #{user[:email]}"
end

emitter.on(:user_created) do |user|
  puts "User #{user[:name]} added to analytics"
end

emitter.once(:user_created) do |user|
  puts "First user bonus: #{user[:name]}"
end

emitter.emit(:user_created, { name: "Alice", email: "alice@example.com" })
# Output:
# Welcome email sent to alice@example.com
# User Alice added to analytics
# First user bonus: Alice

emitter.emit(:user_created, { name: "Bob", email: "bob@example.com" })
# Output:
# Welcome email sent to bob@example.com
# User Bob added to analytics

Retry Logic Builder creates configurable retry functions using higher-order function composition. The builder accepts a base operation and returns an enhanced function with retry behavior.

# Retry logic as higher-order function
def with_retry(max_attempts: 3, backoff: 1, &operation)
  ->(* args) do
    attempts = 0
    begin
      attempts += 1
      operation.call(*args)
    rescue StandardError => e
      if attempts < max_attempts
        sleep(backoff * attempts)
        retry
      else
        raise e
      end
    end
  end
end

# Simulated unreliable operation
@call_count = 0
unreliable_fetch = lambda do |url|
  @call_count += 1
  raise "Network error" if @call_count < 3
  "Data from #{url}"
end

# Wrap with retry logic
reliable_fetch = with_retry(max_attempts: 5, backoff: 0.5, &unreliable_fetch)
result = reliable_fetch.call("https://api.example.com")
# => "Data from https://api.example.com"
# (succeeds on 3rd attempt)

Validation Pipeline builds complex validation by composing simple validation functions. Each validator returns a lambda that checks one condition, and validators combine to form comprehensive validation logic.

# Composable validation functions
module Validators
  def self.present
    ->(value) do
      if value.nil? || value.to_s.strip.empty?
        ["must be present"]
      else
        []
      end
    end
  end

  def self.min_length(length)
    ->(value) do
      if value.to_s.length < length
        ["must be at least #{length} characters"]
      else
        []
      end
    end
  end

  def self.matches(pattern)
    ->(value) do
      if value.to_s.match?(pattern)
        []
      else
        ["format is invalid"]
      end
    end
  end

  def self.all(*validators)
    ->(value) do
      validators.flat_map { |validator| validator.call(value) }
    end
  end
end

# Building validators from composition
email_validator = Validators.all(
  Validators.present,
  Validators.matches(/@/)
)

password_validator = Validators.all(
  Validators.present,
  Validators.min_length(8),
  Validators.matches(/[A-Z]/),
  Validators.matches(/[0-9]/)
)

# Applying validators
email_validator.call("user@example.com")
# => []

email_validator.call("")
# => ["must be present"]

password_validator.call("weak")
# => ["must be at least 8 characters", "format is invalid", "format is invalid"]

Design Considerations

When to Use Higher-Order Functions depends on operation complexity and reusability requirements. Higher-order functions excel when the same iteration pattern applies to multiple operations, when operations need composition into pipelines, or when callbacks and event handlers drive control flow. Avoid higher-order functions for simple one-off operations where inline code remains clearer.

Use higher-order functions for collection transformations with consistent patterns across the codebase. A codebase performing similar transformations on different collections benefits from extracting the transformation logic into functions passed to generic higher-order methods. Direct iteration proves simpler when operations are unique and unlikely to recur.

# Higher-order function appropriate - reusable transformation
apply_discount = ->(rate) { ->(price) { price * (1 - rate) } }
apply_tax = ->(rate) { ->(price) { price * (1 + rate) } }

member_pricing = apply_discount.call(0.15)
final_price = ->(price) { apply_tax.call(0.08).call(member_pricing.call(price)) }

# Direct calculation simpler for one-off operations
single_price = base_price * 0.85 * 1.08

Trade-offs Between Blocks, Procs, and Lambdas affect error handling, argument validation, and performance. Blocks provide the simplest syntax for single-use functions passed directly to methods. Procs enable storing and passing functions as objects but have lenient argument handling. Lambdas enforce strict argument checking and predictable return behavior at the cost of slightly more verbose syntax.

Choose blocks for one-time operations with clear context at the call site. Choose procs when functions need storage, passing to multiple methods, or lenient argument handling helps. Choose lambdas when argument count validation matters, when functions return from complex nesting, or when the function object passes between different execution contexts.

Performance differences remain minimal for most use cases. Blocks compile directly into method calls, making them microscopically faster than proc or lambda invocations. This difference only matters in tight loops processing millions of elements. Prefer the approach that makes code intent clearest.

Balancing Functional and Imperative Styles requires considering code familiarity, debugging complexity, and performance needs. Pure functional chains using higher-order functions create immutable transformations easy to test and reason about. Imperative loops with mutation provide fine-grained control and avoid intermediate object allocation.

Functional style works best for stateless transformations on immutable data. Chaining map, select, and reduce operations expresses intent clearly when each step produces a new collection. Imperative style proves more efficient when building large collections incrementally, when early exit conditions terminate processing, or when complex state management spans iterations.

# Functional style - clear intent, creates intermediate arrays
result = data
  .map { |x| transform(x) }
  .select { |x| valid?(x) }
  .reduce({}) { |acc, x| acc.merge(process(x)) }

# Imperative style - efficient for large data
result = {}
data.each do |x|
  transformed = transform(x)
  next unless valid?(transformed)
  result.merge!(process(transformed))
end

Error Handling in Function Chains requires planning since errors in the middle of a chain can leave data in unexpected states. Ruby's exception handling interrupts functional chains, potentially leaving partial results. Design functions to either handle errors internally or propagate them consistently.

Options include wrapping operations in error-handling higher-order functions, using Result or Either types to represent success and failure explicitly, or breaking chains into steps with explicit error checks between stages. The monadic approach with Result types provides the most robust error handling but adds complexity.

# Error-handling wrapper
def safe_call(default_value = nil, &block)
  ->(* args) do
    begin
      block.call(*args)
    rescue StandardError => e
      default_value
    end
  end
end

# Using safe_call in pipeline
results = data
  .map(&safe_call(nil) { |x| risky_transform(x) })
  .compact
  .map { |x| further_processing(x) }

Testing Higher-Order Functions presents unique challenges since functions accept and return other functions. Test higher-order functions by verifying they correctly invoke passed functions, handle edge cases in function arguments, and compose functions properly. Test returned functions independently to verify closure capture and state management.

Mock passed functions to verify invocation count, argument passing, and return value handling. Test the higher-order function with various function arguments including edge cases like functions that raise errors, return nil, or have side effects. For functions returning functions, test both the factory function and the generated functions.

Reference

Core Higher-Order Function Methods

Method	Purpose	Returns	Example Use Case
map	Transform each element	New array of same size	Converting data types, applying calculations
select	Filter matching elements	New array of matches	Finding items meeting criteria
reject	Filter non-matching elements	New array excluding matches	Removing invalid items
reduce	Aggregate to single value	Single accumulated value	Summing, building hashes, finding extrema
flat_map	Map and flatten one level	Flattened array	Processing nested structures
partition	Split into two groups	Two arrays (matches, non-matches)	Separating valid and invalid items
group_by	Group by function result	Hash of groups	Categorizing items
find	Return first match	Single element or nil	Locating specific item
all?	Test if all match	Boolean	Validation checks
any?	Test if any match	Boolean	Existence checks
none?	Test if none match	Boolean	Absence verification

Lambda vs Proc vs Block Comparison

Feature	Block	Proc	Lambda
Syntax	do...end or {...}	Proc.new {...}	->(...) {...}
Storage	Cannot store directly	Stores in variables	Stores in variables
Passing	Pass to methods with yield	Pass as arguments	Pass as arguments
Argument checking	N/A (handled by method)	Lenient (ignores extras)	Strict (must match)
Return behavior	N/A	Returns from enclosing method	Returns from lambda only
Use when	One-time use at call site	Need lenient arguments	Need strict semantics

Common Patterns Quick Reference

Pattern	Operation	Ruby Method	Input -> Output
Transform	Apply function to each	map	[1,2,3] -> [2,4,6]
Filter	Select matching	select/reject	[1,2,3,4] -> [2,4]
Aggregate	Combine into one value	reduce	[1,2,3,4] -> 10
Flatten	Remove nesting	flat_map	[[1,2],[3,4]] -> [1,2,3,4]
Group	Organize by criteria	group_by	[1,2,3,4] -> {odd:[1,3], even:[2,4]}
Compose	Chain functions	method chaining	f(g(x))
Partial	Fix some arguments	curry	f(a,b) -> f(a)

Function Creation Syntaxes

Type	Single-line Syntax	Multi-line Syntax	Notes
Block	{...}	do...end	Cannot store in variable
Proc	proc {...}	Proc.new do...end	Lenient argument handling
Lambda	->(...) {...}	lambda do...\|...\|...end	Strict argument checking
Method	method(:name)	define with def	Convert with method()
Symbol	&:method_name	N/A	Works only for simple methods

Arity and Argument Handling

Aspect	Block	Proc	Lambda
Extra arguments	Ignored by method	Ignored	ArgumentError
Missing arguments	Assigned nil by method	Assigned nil	ArgumentError
Check arity	N/A	proc.arity (may be negative)	lambda.arity
Variable arguments	Handled by method	*args accepted	*args accepted
Keyword arguments	Handled by method	Supported	Supported

Common Enumerator Methods

Method	Description	Lazy Support	Common Usage
each	Iterate elements	Yes	Basic iteration
each_with_index	Iterate with index	Yes	Index-dependent operations
each_with_object	Iterate with accumulator	Yes	Building collections
lazy	Enable lazy evaluation	N/A	Infinite sequences
take	Take first N elements	Works with lazy	Limiting results
drop	Skip first N elements	Works with lazy	Skipping elements
zip	Combine arrays	Yes	Parallel iteration

Closure and Binding Behavior

Concept	Behavior	Example Impact
Variable capture	Functions close over surrounding scope	Changes to outer variables affect closure
Binding object	Tracks execution context	Access to self and local variables
Return value	Lambdas return from themselves only	Safe to use in nested contexts
Proc return	Returns from enclosing method	May cause unexpected early returns
Block return	N/A (controlled by method)	Cannot use explicit return

Performance Characteristics

Operation	Memory	Speed	When to Avoid
Map chain	Creates intermediate arrays	Fast for small collections	Large datasets needing efficiency
Lazy evaluation	Minimal until consumed	Slower per element	When all elements needed
Reduce	Single accumulator	Fastest for aggregation	When intermediate results needed
Block invocation	No allocation	Fastest	When storing functions needed
Proc/Lambda call	Object allocation	Slightly slower	Tight loops, millions of calls

Currying and Partial Application

Technique	Method	Arguments	Result
Manual curry	Create wrapper lambda	Fixed + variable	New lambda with fewer params
Automatic curry	.curry method	All then partial	Returns lambda waiting for remaining args
Argument order	Place fixed args first	N/A	Enables partial application
Full application	Call with all args	All required	Returns final result

Higher-Order Functions