Re: [Haskell-cafe] Control.Concurrent.forkIO versus Control.Parallel.par

Mario Blazevic Mon, 28 Jul 2008 07:30:20 -0700

Sterling Clover wrote:

I think a better way to look at it is that Haskell has two separatemechanisms for different *notions* of concurrency -- forkIO for actualconcurrent computation which needs explicit threads and communication(and within that, either semaphore-based communication with MVars ortransactional control with TVars and STM), and par for parallelism whichis to express computations that are innately parallel. See, e.g. the GHCusers manual which defines them as such:
...

Yes, I do understand the distinction. My problem is that I'm working ona new concurrency mechanism, in the form of a monad transformer. Itshould allow user to specify that particular monadic computation shouldbe run in parallel. It appears that will be possible only if theunderlying monad is IO, because I can't get par to work.

In any case, I suspect that your second parallelize function doesn'twork right because \x -> x >>= return is an effective no-op, modulostrictness characteristics of >>=. And in any case, it can't beevaluated until it is called in a particular monadic "environment" whichis provided, sequencing and all, via liftM2. One can't parallelize in anarbitrary monad in any case, at least without making a number ofdecisions. E.g., what's the resultant state after two parallelcomputations are run in a state monad?

I see the problem now, thanks. I wonder if it would make sense to add anew defaulted method to Monad class, perhaps a variant of the existingsequence


parallelSequence :: [m a] -> m [a]
parallelSequence = sequence

Then monads that have a way of forking and recombining parallelcomputations could override the method.

So if you're using concurrency with a monad transformer, you probablymight want to start by stripping back the layers of the concurrent partof your algorithm to the minimum possible, and then explicitly managingpassing state into the various forked computations, which can then bewrapped in as many runReaderT or such calls as necessary.

I don't have any state to pass, the question is simply whether twomonadic values can be run in parallel and then recombined. I can see whythat's impossible for State, Cont, and probably some other monads.

On another, general, note, unless you're very careful, mixing IO intoyour algorithm will probably result in very underperformant parallelcode, since it will be IO rather than processor bound.

I know, the idea was to let the user control which concurrentcomputations should be run in parallel, if resources allow.

On Jul 27, 2008, at 10:49 PM, Mario Blažević wrote:
Hello. I have a question about parallel computation in Haskell.After browsing the GHC library documentation, I was left withimpression that there are two separate mechanisms for expressingconcurrency: Control.Parallel.par for pure computations andControl.Concurrent.forkIO for computations in IO monad.
This dichotomy becomes a problem when one tries to use concurrencyfrom a monad transformer, though I'm sure that's not the only suchsituation. One cannot assume that the base monad is IO so forkIOcannot be used, while Control.Parallel.par won't run monads. My firstsolution was to replace the base monad class for the monad transformerby the following ParallelizableMonad class:
----------------------------------------------------------------------------
class Monad m => ParallelizableMonad m where
   parallelize :: m a -> m b -> m (a, b)
   parallelize ma mb = do a <- ma
                          b <- mb
                          return (a, b)

instance ParallelizableMonad Identity where
parallelize (Identity a) (Identity b) = Identity (a `par` (b `pseq`(a, b)))
instance ParallelizableMonad IO where
   parallelize ma mb = do va <- newEmptyMVar
                          vb <- newEmptyMVar
                          forkIO (ma >>= putMVar va)
                          forkIO (mb >>= putMVar vb)
                          a <- takeMVar va
                          b <- takeMVar vb
                          return (a, b)
----------------------------------------------------------------------------
I tested this solution, and it worked for IO computations in the sensethat they used both CPUs. The test also ran slower on two CPUs that onone, but that's beside the point.
Then I realized that par can, in fact, be used on any monad, it justneeds a little nudge:
----------------------------------------------------------------------------
parallelize :: m a -> m b -> m (a, b)
parallelize ma mb = let a = ma >>= return
                        b = mb >>= return
                    in a `par` (b `pseq` liftM2 (,) a b)
----------------------------------------------------------------------------
However, in this version the IO monadic computations still appear touse only one CPU. I cannot get par to parallelize monadiccomputations. I've used the same command-line options in bothexamples: -O -threaded and +RTS -N2. What am I missing?
_______________________________________________
Haskell-Cafe mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/haskell-cafe



--
Mario Blazevic
[EMAIL PROTECTED]
Stilo Corporation

This message, including any attachments, is for the sole use of the
intended recipient(s) and may contain confidential and privileged
information. Any unauthorized review, use, disclosure, copying, or
distribution is strictly prohibited. If you are not the intended
recipient(s) please contact the sender by reply email and destroy
all copies of the original message and any attachments.
_______________________________________________
Haskell-Cafe mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Control.Concurrent.forkIO versus Control.Parallel.par

Reply via email to