Re: reseting an iterator

norseman Thu, 21 May 2009 16:01:44 -0700

Terry Reedy wrote:

I will clarify by starting over with current definitions.
Ob is an iterator iff next(ob) either returns an object or raisesStopIteration and continues to raise StopIteration on subsequent calls.
Ob is an iterable iff iter(ob) raturns an iterator.
It is intentional that the protocol definitions be minimal, so that theycan used as widely as possible.
As a convenience, the definition of iterators is given a slightcomplication. They are defined as a subcategory of iterables, with therequirement that iter(iterator) be that same iterator. This means thatiterators need the following boilerplate:
  def __iter__(self): return self
The extra burden is slight since most iterators are based on builtins orgenerator functions or expressions, which add the boilerplateautomatically. The convenience is that one may write
def f(iterable_or_iterator):
  it = iter(iterable_or_iterator)
  ...

instead of

def f(iterable_or_iterator):
  if is_iterable(iterable_or_iterator):
    it = iter(iterable_or_iterator)
  else:
    it = iterable_or_iterator
In particular, the internal function that implements for loops can dothe former.
In other words, a small bit of boilerplate added to iterators, mostlyautomatically, saves boilerplate in the use of iterators and iterables.
When the protocols were defined, there was discussion about whether ornot to require 'continue to raise StopIteration'. For instance, aniterator that returns objects derived from external input might not haveany new external input now but expect to get some in the future. It wasdecided the such iterators should either wait and block the thread orreturn a 'Not now' indicator such as None. StopIteration shouldconsistently mean 'Done, over and out' so for loops, for instance, wouldknow to exit.
The OP proposes that StopIteraton should instead mean 'Done until


Done unless you put the data pointer back to offset zero

reset', without defining 'reset'.  Some comments:
* This would complicate the protocol.
* There are real use cases, and reiterability is a real issue.  But ...
* Depending on the meaning, resetting may or may not be possible.

* When it is possible, it can potentially be done today with a .send()method.

* Many use cases are easier with a new iterator.  For instance

for i in iterable: block1()
for i in iterable: block2()

is easier to write than

it = iter(iterable)
for i in it: block1()
it.reset()
for i in it: block2()

with little relative time saving in the second case, for practicalproblems, to compensate for the extra boilerplate.



while testing:
  for i in it:
    code
  it.reset()

Terry Jan Reedy


--
http://mail.python.org/mailman/listinfo/python-list

Re: reseting an iterator

Reply via email to