mw-parser0.1.0-SNAPSHOTParser for production rules for MicroWorld engine dependencies
| (this space intentionally left almost blank) | |||||||||
parse multiple rules from a stream, possibly a file - although the real objective is to parse rules out of a block of text from a textarea | ||||||||||
(ns mw-parser.bulk
(:use mw-parser.core
mw-engine.utils
clojure.java.io
[clojure.string :only [split trim]])
(:import (java.io BufferedReader StringReader))) | ||||||||||
Is this | (defn comment? [line] (or (empty? (trim line)) (member? (first line) '(nil \# \;)))) | |||||||||
Parse rules from successive lines in this | (defn parse-string
[string]
;; TODO: tried to do this using with-open, but couldn't make it work.
(map parse-rule (remove comment? (split string #"\n")))) | |||||||||
Parse rules from successive lines in the file loaded from this | (defn parse-file [filename] (parse-string (slurp filename))) | |||||||||
Compile each non-comment line of this | (defn compile-string [string] (map #(compile-rule % true) (remove comment? (split string #"\n")))) | |||||||||
Compile each non-comment line of the file indicated by this | (defn compile-file [filename] (compile-string (slurp filename))) | |||||||||
A very simple parser which parses production rules of the following forms:
it generates rules in the form expected by It is, as I say, very simple; it generates a complete rule, or it fails completely, returning nil. Very occasionally it generates a wrong rule - one which is not a correct translation of the rule semantics - but that is buggy behaviour, which I'll try to fix over the next few weeks, not a design fault. More significantly it does not generate useful error messages on failure. This is, I think, a much more complex issue which I don't yet know how to address. | ||||||||||
(ns mw-parser.core
(:use mw-engine.utils
[clojure.string :only [split trim triml]])) | ||||||||||
(declare parse-conditions) (declare parse-not-condition) (declare parse-simple-condition) | ||||||||||
a regular expression which matches string representation of numbers | (def re-number #"^[0-9.]*$") | |||||||||
error thrown when an attempt is made to set a reserved property | (def reserved-properties-error "The properties 'x' and 'y' of a cell are reserved and should not be set in rule actions") | |||||||||
error thrown when a rule cannot be parsed | (def bad-parse-error "I did not understand '%s'") | |||||||||
If this token appears to represent an explicit number, return that number; otherwise, make a keyword of it and return that. | (defn- keyword-or-numeric
[token]
(cond
(re-matches re-number token) (read-string token)
(keyword? token) token
true (keyword token))) | |||||||||
Generally all functions in this file with names beginning 'parse-' take a sequence of tokens (and in some cases other optional arguments) and return a vector comprising
In every case if the function cannot parse the desired construct from the front of the sequence of tokens it returns nil. | ||||||||||
Parse a number. | (defn parse-numeric-value [[value & remainder]] (if (re-matches re-number value) [(read-string value) remainder])) | |||||||||
Parse a token assumed to be the name of a property of the current cell, whose value is assumed to be an integer. | (defn parse-property-int [[value & remainder]] (if value [(list 'get-int 'cell (keyword value)) remainder])) | |||||||||
Parse a token assumed to be the name of a property of the current cell. | (defn parse-property-value [[value & remainder]] (if value [(list (keyword value) 'cell) remainder])) | |||||||||
Parse a token assumed to be a simple token value. | (defn parse-token-value [[value & remainder]] (if value [(keyword value) remainder])) | |||||||||
Parse a value from the first of these | (defn parse-simple-value
([tokens expect-int]
(or
(parse-numeric-value tokens)
(cond expect-int
(parse-property-int tokens)
true (parse-token-value tokens))))
([tokens]
(parse-simple-value tokens false))) | |||||||||
Parse a single value from this single token and return just the generated code, not a pair. | (defn gen-token-value [token expect-int] (first (parse-simple-value (list token) expect-int))) | |||||||||
Parse a list of values from among these | (defn parse-disjunct-value
[[OR token & tokens] expect-int]
(cond (member? OR '("or" "in"))
(let [value (first (parse-simple-value (list token) expect-int))
seek-others (= (first tokens) "or")]
(cond seek-others
(let [[others remainder] (parse-disjunct-value tokens expect-int)]
[(cons value others) remainder])
true
[(list value) tokens])))) | |||||||||
Parse a value from among these | (defn parse-value
([tokens expect-int]
(or
(parse-disjunct-value tokens expect-int)
(parse-simple-value tokens expect-int)))
([tokens]
(parse-value tokens false))) | |||||||||
Parses a condition of the form '[property] in [value] or [value]...' | (defn parse-member-condition
[[property IS IN & rest]]
(if (and (member? IS '("is" "are")) (= IN "in"))
(let [[l remainder] (parse-disjunct-value (cons "in" rest) false)]
[(list 'member? (list (keyword property) 'cell) (list 'quote l)) remainder]))) | |||||||||
Parse '[property] less than [value]'. | (defn- parse-less-condition
[[property IS LESS THAN & rest]]
(cond (and (member? IS '("is" "are")) (member? LESS '("less" "fewer")) (= THAN "than"))
(let [[value remainder] (parse-value rest true)]
[(list '< (list 'get-int 'cell (keyword property)) value) remainder]))) | |||||||||
Parse '[property] more than [value]'. | (defn- parse-more-condition
[[property IS MORE THAN & rest]]
(cond (and (member? IS '("is" "are")) (member? MORE '("more" "greater")) (= THAN "than"))
(let [[value remainder] (parse-value rest true)]
[(list '> (list 'get-int 'cell (keyword property)) value) remainder]))) | |||||||||
(defn- parse-between-condition
[[p IS BETWEEN v1 AND v2 & rest]]
(cond (and (member? IS '("is" "are")) (= BETWEEN "between") (= AND "and") (not (nil? v2)))
(let [property (first (parse-simple-value (list p) true))
value1 (first (parse-simple-value (list v1) true))
value2 (first (parse-simple-value (list v2) true))]
[(list 'or
(list '< value1 property value2)
(list '> value1 property value2)) rest]))) | ||||||||||
Parse clauses of the form 'x is y', 'x is in y or z...', 'x is between y and z', 'x is more than y' or 'x is less than y'. It is necessary to disambiguate whether value is a numeric or keyword. | (defn- parse-is-condition
[[property IS value & rest]]
(cond
(member? IS '("is" "are"))
(let [tokens (cons property (cons value rest))]
(cond
(re-matches re-number value) [(list '= (list 'get-int 'cell (keyword property)) (read-string value)) rest]
value [(list '= (list (keyword property) 'cell) (keyword value)) rest])))) | |||||||||
Parse the negation of a simple condition. | (defn- parse-not-condition
[[property IS NOT & rest]]
(cond (and (member? IS '("is" "are")) (= NOT "not"))
(let [partial (parse-simple-condition (cons property (cons "is" rest)))]
(cond partial
(let [[condition remainder] partial]
[(list 'not condition) remainder]))))) | |||||||||
(defn- gen-neighbours-condition
([comp1 quantity property value remainder comp2 distance]
[(list comp1
(list 'count
(list 'get-neighbours-with-property-value 'world
'(cell :x) '(cell :y) distance
(keyword property) (keyword-or-numeric value) comp2))
quantity)
remainder])
([comp1 quantity property value remainder comp2]
(gen-neighbours-condition comp1 quantity property value remainder comp2 1))) | ||||||||||
Parse conditions of the form '...more than 6 neighbours are [condition]' | (defn parse-comparator-neighbours-condition
[[MORE THAN n NEIGHBOURS WITHIN distance have-or-are & rest]]
(let [quantity (first (parse-numeric-value (list n)))
comparator (cond (= MORE "more") '>
(member? MORE '("fewer" "less")) '<)]
(cond
(not (= WITHIN "within"))
(parse-comparator-neighbours-condition
(flatten
;; two tokens were mis-parsed as 'within distance' that weren't
;; actually 'within' and a distance. Splice in 'within 1' and try
;; again.
(list MORE THAN n NEIGHBOURS "within" "1" WITHIN distance have-or-are rest)))
(and quantity
comparator
(= THAN "than")
(= NEIGHBOURS "neighbours"))
(cond
(= have-or-are "are")
(let [[value & remainder] rest
dist (gen-token-value distance true)]
(gen-neighbours-condition comparator quantity :state value remainder = dist))
(= have-or-are "have")
(let [[property comp1 comp2 value & remainder] rest
dist (gen-token-value distance true)]
(cond (and (= comp1 "equal") (= comp2 "to"))
(gen-neighbours-condition comparator quantity property
value remainder = dist)
(and (= comp1 "more") (= comp2 "than"))
(gen-neighbours-condition comparator quantity property
value remainder > dist)
(and (= comp1 "less") (= comp2 "than"))
(gen-neighbours-condition comparator quantity property
value remainder < dist))))))) | |||||||||
(defn parse-some-neighbours-condition
[[SOME NEIGHBOURS & rest]]
(cond
(and (= SOME "some") (= NEIGHBOURS "neighbours"))
(parse-comparator-neighbours-condition (concat '("more" "than" "0" "neighbours") rest)))) | ||||||||||
Parse conditions of the form '...6 neighbours are [condition]' | (defn parse-simple-neighbours-condition
[[n NEIGHBOURS WITHIN distance have-or-are & rest]]
(let [quantity (first (parse-numeric-value (list n)))]
(cond
(and quantity (= NEIGHBOURS "neighbours"))
(cond
(not (= WITHIN "within"))
(parse-simple-neighbours-condition
(flatten
;; two tokens were mis-parsed as 'within distance' that weren't
;; actually 'within' and a distance. Splice in 'within 1' and try
;; again.
(list n NEIGHBOURS "within" "1" WITHIN distance have-or-are rest)))
(= have-or-are "are")
(let [[value & remainder] rest
dist (gen-token-value distance true)]
(gen-neighbours-condition '= quantity :state value remainder = dist))
(= have-or-are "have")
(let [[property comp1 comp2 value & remainder] rest
dist (gen-token-value distance true)]
(cond (and (= comp1 "equal") (= comp2 "to"))
(gen-neighbours-condition '= quantity property value remainder =
dist)
(and (= comp1 "more") (= comp2 "than"))
(gen-neighbours-condition '= quantity property value remainder >
dist)
(and (= comp1 "less") (= comp2 "than"))
(gen-neighbours-condition '= quantity property value remainder <
dist))))))) | |||||||||
Parse conditions referring to neighbours | (defn parse-neighbours-condition
[tokens]
(or
(parse-simple-neighbours-condition tokens)
(parse-comparator-neighbours-condition tokens)
(parse-some-neighbours-condition tokens))) | |||||||||
Parse conditions of the form '[property] [comparison] [value]'. | (defn parse-simple-condition
[tokens]
(or
(parse-neighbours-condition tokens)
(parse-member-condition tokens)
(parse-not-condition tokens)
(parse-less-condition tokens)
(parse-more-condition tokens)
(parse-between-condition tokens)
(parse-is-condition tokens))) | |||||||||
Parse '... or [condition]' from | (defn- parse-disjunction-condition
[left tokens]
(let [partial (parse-conditions tokens)]
(if partial
(let [[right remainder] partial]
[(list 'or left right) remainder])))) | |||||||||
Parse '... and [condition]' from | (defn- parse-conjunction-condition
[left tokens]
(let [partial (parse-conditions tokens)]
(if partial
(let [[right remainder] partial]
[(list 'and left right) remainder])))) | |||||||||
Parse conditions from | (defn- parse-conditions
[tokens]
(let [partial (parse-simple-condition tokens)]
(if partial
(let [[left [next & remainder]] partial]
(cond
(= next "and") (parse-conjunction-condition left remainder)
(= next "or") (parse-disjunction-condition left remainder)
true partial))))) | |||||||||
Parse the left hand side ('if...') of a production rule. | (defn- parse-left-hand-side [[IF & tokens]] (if (= IF "if") (parse-conditions tokens))) | |||||||||
Parse actions of the form '[property] should be [property] [arithmetic-operator] [value]', e.g. 'fertility should be fertility + 1', or 'deer should be deer - wolves'. | (defn- parse-arithmetic-action
[previous [prop1 SHOULD BE prop2 operator value & rest]]
(cond
(member? prop2 '("x" "y"))
(throw
(Exception. reserved-properties-error))
(and (= SHOULD "should")
(= BE "be")
(member? operator '("+" "-" "*" "/")))
[(list 'merge (or previous 'cell)
{(keyword prop1) (list 'int
(list (symbol operator) (list 'get-int 'cell (keyword prop2))
(cond
(re-matches re-number value) (read-string value)
true (list 'get-int 'cell (keyword value)))))}) rest])) | |||||||||
Parse actions of the form '[property] should be [value].' | (defn- parse-set-action
[previous [property SHOULD BE value & rest]]
(cond
(member? property '("x" "y"))
(throw
(Exception. reserved-properties-error))
(and (= SHOULD "should") (= BE "be"))
[(list 'merge (or previous 'cell)
{(keyword property) (cond (re-matches re-number value) (read-string value) true (keyword value))}) rest])) | |||||||||
(defn- parse-simple-action [previous tokens]
(or (parse-arithmetic-action previous tokens)
(parse-set-action previous tokens))) | ||||||||||
Parse actions from tokens. | (defn- parse-actions
[previous tokens]
(let [[left remainder] (parse-simple-action previous tokens)]
(cond left
(cond (= (first remainder) "and")
(parse-actions left (rest remainder))
true (list left))))) | |||||||||
Parse a probability of an action from this collection of tokens | (defn- parse-probability
[previous [n CHANCE IN m & tokens]]
(cond
(and (= CHANCE "chance")(= IN "in"))
(let [[action remainder] (parse-actions previous tokens)]
(cond action
[(list 'cond
(list '<
(list 'rand
(first (parse-simple-value (list m) true)))
(first (parse-simple-value (list n) true)))
action) remainder])))) | |||||||||
Parse the right hand side ('then...') of a production rule. | (defn- parse-right-hand-side
[[THEN & tokens]]
(if (= THEN "then")
(or
(parse-probability nil tokens)
(parse-actions nil tokens)))) | |||||||||
Parse a complete rule from this Throws an exception if parsing fails. | (defn parse-rule
[line]
(cond
(string? line)
(let [rule (parse-rule (split (triml line) #"\s+"))]
(cond rule rule
true (throw (Exception. (format bad-parse-error line)))))
true
(let [[left remainder] (parse-left-hand-side line)
[right junk] (parse-right-hand-side remainder)]
(cond
;; there should be a valide left hand side and a valid right hand side
;; there shouldn't be anything left over (junk should be empty)
(and left right (empty? junk))
(list 'fn ['cell 'world] (list 'if left right)))))) | |||||||||
Parse this Throws an exception if parsing fails. | (defn compile-rule
([rule-text return-tuple?]
(do
(use 'mw-engine.utils)
(let [afn (eval (parse-rule rule-text))]
(cond
(and afn return-tuple?)(list afn (trim rule-text))
true afn))))
([rule-text]
(compile-rule rule-text false))) | |||||||||