Handling Actor events & persisting state
During its runtime, the Actor receives Actor events sent by the Apify platform or generated by the Apify SDK itself.
Event types
Event | Data | Description |
---|---|---|
SYSTEM_INFO | { "createdAt": datetime, "cpuCurrentUsage": float, "memCurrentBytes": int, "isCpuOverloaded": bool } | This event is emitted regularly and it indicates the current resource usage of the Actor. TheisCpuOverloaded argument indicates whether the current CPU usage is higher than Config.max_used_cpu_ratio |
MIGRATING | None | Emitted when the Actor running on the Apify platform is going to be migrated to another worker server soon. You can use it to persist the state of the Actor so that once it is executed again on the new server, it doesn't have to start over from the beginning. |
ABORTING | None | When a user aborts an Actor run on the Apify platform,
they can choose to abort gracefully to allow the Actor some time before getting killed.
This graceful abort emits the |
PERSIST_STATE | { "isMigrating": bool } | Emitted in regular intervals (by default 60 seconds) to notify the Actor that it should persist its state, in order to avoid repeating all work when the Actor restarts. This event is also emitted automatically when the PERSIST_STATE event is provided merely for user convenience,
you can achieve the same effect by persisting the state regularly in an interval and listening for the migrating event. |
Adding handlers to events
To add handlers to these events, you use the Actor.on()
method,
and to remove them, you use the Actor.off()
method.
import asyncio
from apify import Actor, Event
async def main():
async with Actor:
total_items = 1000
# Load the state if it's saved from some previous execution
processed_items = 0
actor_state = await Actor.get_value('STATE')
if actor_state is not None:
processed_items = actor_state
# Save the state when the `PERSIST_STATE` event happens
async def save_state(event_data):
nonlocal processed_items
Actor.log.info('Saving Actor state', extra=event_data)
await Actor.set_value('STATE', processed_items)
Actor.on(Event.PERSIST_STATE, save_state)
# Do some fake work
for i in range(processed_items, total_items):
Actor.log.info(f'Processing item {i}...')
processed_items = i
await asyncio.sleep(0.1)
# Suppose we can stop saving the state now
Actor.off(Event.PERSIST_STATE, save_state)
# Do some more fake work, this time something that can't be restarted,
# so no point persisting the state
for j in range(0, 10):
Actor.log.info(f'Processing item {j} of another kind...')
await asyncio.sleep(1)