This is what an ideal and simple versioning spec should look like to
me.  (Not the form, but the concept).  I'm posting this here so it
could be used as an added reference to anyone that would consider
revising the current specification.

Note: Assigning default values can be bypassed depending on the implementation.

Comments are welcome.
----------------------------------------
1 Version Specification

A version is composed of four component sets: the base part, the
stage, the patch and the revision.

Each component is composed mainly of version nodes to represent their
level values.

When a component is not specified, it gets a default node value of {0}.

1.1 Version Nodes

Nodes start basically with a number and then is optionally followed by
a set of letters.

Numbers and letters can coexist alternatingly to represent a single
node.  The number of consecutive letters is also not limited to 1.
For example: "1a4xy" is allowed (can be restricted).

Each set of digits represents a single decimal number.  Leading zeros
hold no meaning.

The numerical equivalent of a set of letters is calculated in base of
27 (0 exists along with it but is not included as a symbol).

Version nodes after processing are basically just an array of signed
integers where each set of digits or letters are converted to their
numerical value.

1.2 Version Parts

1.2.1 The Base Part

The base version is simply a set of version nodes that are separated
by dots.  Examples: "4.1.2", "4.1.2aa" "4a", "4a.1", "4a.1.2".

After processing, the base version is an array of version nodes.

It is required.

1.2.2 The Stage Part

The stage part starts with _alpha, _beta, _pre or _rc.  Their
numerical values are -4, -3, -2 or -1 respectively.  Each of them can
optionally be followed by a version node string.  For example:
"_alpha01".

The resulting value for the stage after processing is a single version
node where the first value in it is the numerical values of _alpha,
_beta, _pre or _rc, and the other values are based on the added node
string.

A version without a stage has a default stage value of {0}.

The stage part is optional and can be specified only once after the
base part.  It can't be specified as a modifier for the patch, for the
revision, or for another stage.

1.2.3 The Patch Part

The patch part begins with _p and is followed by a version node
string.  For example: "_p20150105a".

The patch part is optional and can be specified after the base part or
after the stage part, but not after the revision.  It can only be
specified once.

It is processed as a single version node based on its version node string.

A version without a patch has a default patch value of {0}.

1.2.4 The Revision

The revision starts with -r and is followed by a number.

It is processed as a single version node based on its version node string.

A version without a revision has a default revision value of {0}.

1.3 Comparing Versions

Versions are compared as version nodes and the algorithm is simple:
each component and subcomponent is compared from left to right.

Anything that gives a difference decides which version is greater or lesser.

Any non-existing version-node has a default value of {0}.

Any non-existing element of a version node has a default value of 0.

If no difference is found during the process, it would mean that both
versions are equal.

1.3.1 Concept Code

  #!/usr/bin/ruby

  class ::Array
    def adaptive_transpose
      h_size = self.max_by{ |a| a.size }.size
      v_size = self.size

      result = Array.new(h_size)
      with_index = self.each_with_index.to_a.freeze

      0.upto(h_size - 1) do |i|
        result[i] = Array.new(v_size)
        with_index.each{ |a, j| result[i][j] = a[i] }
      end

      result
    end
  end

  module Portage
    class PackageVersion
      class Node < ::Array
        def initialize(*values)
          self.concat(values)
        end

        def compare_with(another)
          [self, another].adaptive_transpose.each do |a, b|
            a ||= 0
            b ||= 0
            return -1 if a < b
            return 1 if a > b
          end

          return 0
        end

        def self.parse(*args)
          result = new

          args.each do |a|
            case a
            when Integer
              result << a
            when /^[[:digit:]][[:alnum:]]*$/
              a.scan(/[[:digit:]]+|[[:alpha:]]+/).each_with_index.map do |b, i|
                if i.even?
                  str = b.gsub(/^0+/, '')
                  result << (str.empty? ? 0 : Integer(str))
                else
                  value = 0

                  b.downcase.bytes.reverse.each_with_index do |c, i|
                    value += 27 ** i * (c - 96)  ## a == 1, z == 26,
and 0 exists but is not used
                  end

                  result << value
                end
              end
            else
              raise ArgumentError.new("Invalid node string: #{a.inspect}")
            end
          end

          result
        end

        def self.zero
          @zero ||= new(0)
        end

        private_class_method :new, :allocate
      end

      attr_accessor :base, :stage, :patch, :revision

      def initialize(base, stage, patch, revision)
        @base, @stage, @patch, @revision = base, stage, patch, revision
      end

      def compare_with(another)
        [self.base, another.base].adaptive_transpose.each do |a, b|
          a ||= Node.zero
          b ||= Node.zero
          r = a.compare_with(b)
          return r unless r == 0
        end

        r = self.stage.compare_with(another.stage)
        return r unless r == 0

        r = self.patch.compare_with(another.patch)
        return r unless r == 0

        r = self.revision.compare_with(another.revision)
        return r unless r == 0

        return 0
      end

      STAGES = { 'alpha' => -4, 'beta' => -3, 'pre' => -2, 'rc' => -1 }
      REGEX = 
/^([[:digit:]][[:alnum:]]*(?:[.][[:alnum:]]+)*)?(?:_(alpha|beta|pre|rc)([[:digit:]][[:alnum:]]*)?)?(?:_p([[:digit:]][[:alnum:]]*))?(?:-r([[:digit:]]+))?(.+)?$/m

      def self.parse(version_string)
        __, base, stage, stage_ver, patch, revision, extra =
version_string.match(REGEX).to_a
        raise_invalid_version_string(version_string) if extra

        begin
          base = base.split('.').map{ |e| Node.parse(e) }
          stage = stage ? stage_ver ? Node.parse(STAGES[stage],
stage_ver) : Node.parse(STAGES[stage]) : Node.zero
          patch = patch ? Node.parse(patch) : Node.zero
          revision = revision ? Node.parse(revision) : Node.zero
        rescue ArgumentError => e
          raise_invalid_version_string("#{version_string}: #{e}")
        end

        new(base, stage, patch, revision)
      end

      def self.raise_invalid_version_string(version_string)
        raise ArgumentError.new("Invalid version string: #{version_string}")
      end

      private_class_method :new, :allocate, :raise_invalid_version_string
    end
  end

  samples = [
    ["0", "0.01"],
    ["0.01", "0.010"],
    ["0.09", "0.090"],
    ["0.10", "0.100"],
    ["0.99", "0.990"],
    ["0.100", "0.1000"],
    ["0.100", "0.100"],
    ["0.1", "0.1.1"],
    ["0.1.1", "0.1a"],
    ["0.1a", "0.2"],
    ["0.2", "1"],
    ["1", "1.0"],
    ["1.0", "1.0_alpha"],
    ["1.0_alpha", "1.0_alpha01"],
    ["1.0_alpha01", "1.0_alpha01-r1"],
    ["1.0_alpha01-r1", "1.0_alpha01_p20150105"],
    ["1.0_alpha01_p20150105", "1.0_alpha01_p20150105-r1"],
    ["1.0_alpha01", "1.0_beta"],
    ["1.0_beta", "1.0_beta01"],
    ["1.0_beta01", "1.0_pre01"],
    ["1.0_pre01", "1.0_rc01"],
    ["1.0_rc01", "1.0"],
    ["1.0", "1.0-r1"],
    ["1.0-r1", "1.0_p20150105"],
    ["1.0_p20150105", "1.0_p20150105-r1"]
  ]

  samples.each do |a, b|
    x = Portage::PackageVersion.parse(a)
    y = Portage::PackageVersion.parse(b)
    r = x.compare_with(y)
    r = r < 0 ? '<' : r > 0 ? '>' : '=='
    puts "#{a}    #{r}    #{b}"
  end

1.3.2 Concept Code Output

  0    <    0.01
  0.01    <    0.010
  0.09    <    0.090
  0.10    <    0.100
  0.99    <    0.990
  0.100    <    0.1000
  0.100    ==    0.100
  0.1    <    0.1.1
  0.1.1    <    0.1a
  0.1a    <    0.2
  0.2    <    1
  1    ==    1.0
  1.0    >    1.0_alpha
  1.0_alpha    <    1.0_alpha01
  1.0_alpha01    <    1.0_alpha01-r1
  1.0_alpha01-r1    <    1.0_alpha01_p20150105
  1.0_alpha01_p20150105    <    1.0_alpha01_p20150105-r1
  1.0_alpha01    <    1.0_beta
  1.0_beta    <    1.0_beta01
  1.0_beta01    <    1.0_pre01
  1.0_pre01    <    1.0_rc01
  1.0_rc01    <    1.0
  1.0    <    1.0-r1
  1.0-r1    <    1.0_p20150105
  1.0_p20150105    <    1.0_p20150105-r1

Reply via email to