-
Notifications
You must be signed in to change notification settings - Fork 49
/
Copy path6-02-computations-ml.Rmd
187 lines (140 loc) Β· 5.65 KB
/
6-02-computations-ml.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
# Machine Learning {#v8-ml}
In this chapter, we build a package that, via V8, wraps [ml.js](https://github.com/mljs/ml), a library that brings machine learning to JavaScript. It covers quite a few models; we only include one: the linear regression. This is an interesting example because it reveals some proceedings that one is likely to run into when creating packages using V8.
```js
const x = [0.5, 1, 1.5, 2, 2.5];
const y = [0, 1, 2, 3, 4];
const regression = new ml.SimpleLinearRegression(x, y);
```
## Dependency {#v8-ml-dependency}
We start by creating a package and add the V8 package as dependency\index{dependency}.
```r
usethis::create_package("ml")
usethis::use_package("V8")
```
Then we create the `inst` directory in which we place the ml.js file downloaded from the CDN.
```r
dir.create("inst")
download.file(
"https://www.lactame.com/lib/ml/4.0.0/ml.min.js",
"inst/ml.min.js"
)
```
With the dependency\index{dependency} downloaded, one can start working on the R code. First, a new V8 context needs to be created and the ml.js file needs to be imported into it.
```r
# zzz.R
ml <- NULL
.onLoad <- function(libname, pkgname){
ml <<- V8::v8()
mljs <- system.file("ml.min.js", package = "ml")
ml$source(mljs)
}
```
## Simple Regression {#v8-ml-regression}
The ["simple linear regression"](https://github.com/mljs/regression-simple-linear) consists of a simple function that takes two arrays. We can thus create a function that takes two vectors, `x`, and `y`, and runs the regression.
```r
#' @export
ml_simple_lm <- function(y, x){
# assign x and y
ml$assign("x", x)
ml$assign("y", y)
# run regression
ml$eval(
"var regression = new ML.SimpleLinearRegression(x, y);"
)
# return results
ml$get("regression")
}
```
Then we can document and load the model to the function.
```r
ml_simple_lm(cars$speed, cars$dist)
## $name
## [1] "simpleLinearRegression"
##
## $slope
## [1] 0.1655676
##
## $intercept
## [1] 8.283906
```
This works but has a few issues, namely running two or more regression internally will override the variable `regression` in the V8 context. Let us demonstrate by implementing a function to predict.
```r
#' @export
ml_predict <- function(x){
ml$call("regression.predict", x)
}
```
We then document and load the functions to run two regressions in a row then observe the issue. Unlike R, the model we created only exists in JavaScript, unlike the `lm`, the function `ml_simple_lm` does not return the model. Therefore, `ml_simple_lm` does not distinguish between models, unlike `predict`, which takes the model as the first argument and runs the prediction on it.
This implementation of ml.js is indeed very dangerous.
```r
# first model
ml_simple_lm(cars$speed, cars$dist)
ml_predict(15)
## 25.18405
# overrides model
ml_simple_lm(1:10, runif(10))
# produces different predictions
ml_predict(15)
## 10.76742
```
The package ml currently under construction should emulate R in that respect; the `ml_simple_lm` should return the model, which should be usable with the `predict` function. In order to do so, we are going to need to track regressions internally in V8 so the `ml_simple_lm` returns a proxy of the model that `predict` can use to predict on the intended model.
In order to track and store regressions internally, we are going to declare an empty array when the package is loaded.
```r
# zzz.R
ml <- NULL
.onLoad <- function(libname, pkgname){
ml <<- V8::v8()
mljs <- system.file("ml.min.js", package = "ml")
ml$source(mljs)
ml$eval("var regressions = [];")
}
```
Then, one can track regressions by creating an R object, which is incremented every time `ml_simple_lm` runs; this can be used as a variable name in the JavaScript `regressions` array declare in `.onLoad`. This variable name must be stored in the object we intend to return so the `predict` method we will create later on can access the model and run predictions. Finally, in order to declare a new method on the `predict` function, we need to return an object bearing a unique class. Below we use `mlSimpleRegression`.
```r
counter <- new.env(parent = emptyenv())
counter$regressions <- 0
#' @export
ml_simple_lm <- function(y, x){
counter$regressions <- counter$regressions + 1
# assign variables
ml$assign("x", x)
ml$assign("y", y)
# address
address <- paste0("regressions['", counter$regressions, "']")
# create regression
code <- paste0(
address, " = new ML.SimpleLinearRegression(x, y);"
)
ml$eval(code)
# retrieve and append address
regression <- ml$get(address)
regression$address <- address
# create object of new class
structure(
regression,
class = c("mlSimpleRegression", class(regression))
)
}
```
Then one can implement the `predict` method for `mlSimpleRegression`. The function uses the `address` of the model to run the JavaScript `predict` method on that object.
```r
#' @export
predict.mlSimpleRegression <- function(object, newdata, ...){
code <- paste0(object$address, ".predict")
ml$call(code, newdata)
}
```
We can then build and load the package to it in action.
```{r}
library(ml)
# first model
model_cars <- ml_simple_lm(cars$speed, cars$dist)
# second model
model_random <- ml_simple_lm(1:10, runif(10))
predict(model_random, 15)
predict(model_cars, 15)
```
## Exercises {#v8-ml-exercises}
There are too many great JavaScript libraries that would be great additions to the R ecosystem but perhaps try and integrate one of these below. They are simple yet exciting and thus provide ideal first forays into what this part of the book explained.
- [chance.js](https://github.com/chancejs/chancejs) - random generator
- [currency.js](https://github.com/scurker/currency.js) - helpers to work with currencies