Databricks Runtime 4.3

Databricks released this image in August 2018.

The following release notes provide information about Databricks Runtime 4.3, powered by Apache Spark.

New Features

  • Databricks Delta
    • TRUNCATE TABLE command: Delete all rows from a table. Unlike its counterpart for Spark tables, Databricks Delta tables do not support deleting specific partitions. Also see TRUNCATE TABLE.
    • ALTER TABLE REPLACE COLUMNS command: Replace columns in a Databricks Delta table. It supports changing the comment of a column and reordering multiple columns. Also see ALTER TABLE.
    • FSCK REPAIR TABLE command: Remove the file entries from the transaction log of a Databricks Delta table that can no longer be found in the underlying file system. This can happen when these files have been manually deleted. Also see FSCK REPAIR TABLE.
    • Support for queries on stale Databricks Delta tables to improve the interactive query experience: Queries on Databricks Delta tables can now run on a stale version of the table when up to date results are not necessary. This feature reduces the latency of queries especially when underlying Databricks Delta tables are updated continuously through streams. Also see Improving Performance For Interactive Queries.
  • Structured Streaming
    • Scalable streaming write support for Azure SQL Data Warehouse Connector.
    • Support for foreachBatch() in Python (already available in Scala). See foreach and foreachBatch documentation for more details.
    • Support for choosing either the min or max watermark when there are multiple input streams in a query. Previously the minimum timestamp was always used. See the multiple watermark policy for more details.
    • Support for the LIMIT operator for streams in Append and Complete output modes. To minimize OOM errors on the driver, LIMIT is automatically applied when you use display() on unbounded streams.

Improvements

  • Databricks Delta
    • Private preview of new scalable implementation of MERGE INTO command that does not have the 10000 row insert limit. Contact support if you’d like to try this out.
    • Better performance and scalability of the OPTIMIZE command, especially on larger clusters.
    • The OPTIMIZE command now commits to the table incrementally, meaning that if the command fails, a retry will not need to process the entire data set.
    • Reduced the number of file system RPCs required to discover new data when using Databricks Delta as a streaming source.
    • Added support for df.writeStream.table(table-name) in Python to create a Databricks Delta table from a stream.
  • Improved performance for queries with multiple joins, aggregations, or windows.
  • Improved efficiency for partition-level pruning in queries with broadcast hash joins.
  • Improvements to whole stage code generation to detect duplicate expressions, reduce the amount of code generated, and improve performance for certain expression types.
  • High concurrency clusters now support running %fs in notebooks.
  • Upgraded Py4J used by PySpark to 0.10.7.
  • Improved performance of the Databricks IO Cache on Azure Ls series instances. The cache is now enabled by default on these instances, accelerating workloads that repeatedly read Parquet files.

Deprecation

  • Data Skipping outside of Databricks Delta is deprecated. An enhanced version of data skipping will continue to be available as part of Databricks Delta. We recommend that you switch to using Databricks Delta to continue to take advantage of this feature. See Databricks Delta Data Skipping for details.

Bug Fixes

  • Fixed incorrect predicate pushdown MERGE INTO statement for Delta when the ON condition had predicates that referenced only the target table.
  • Fixed bug in mapGroupsWithState and flatMapGroupsWithState that prevented setting timeouts when state has been removed (SPARK-22187).
  • Fixed bug the prevented watermarking to work correctly with Trigger.Once (SPARK-24699).
  • Update command now validates the columns in the SET clause to make sure all columns actually exist and no column is set more than once.
  • Fixed a potential race condition that could cause deadlocks for directory commit.
  • Fixed a bug causing a deprecated version of the DBFS client to be used when refreshing mounts.

Known Issues

  • Databricks Delta configuration options for a table take effect only in the first notebook that loads the table.

Apache Spark

Databricks Runtime 4.3 includes Apache Spark 2.3.1. This release includes all fixes and improvements included in Databricks Runtime 4.2, as well as the following additional bug fixes and improvements made to Spark:

  • [SPARK-24934][SQL] Explicitly whitelist supported types in upper/lower bounds for in-memory partition pruning
    • When complex data types are used in query filters against cached data, Spark always returns an empty result set. The in-memory stats-based pruning generates incorrect results, because null is set for upper/lower bounds for complex types. The fix is to not use in-memory stats-based pruning for complex types.
  • [SPARK-24957][SQL] Average with decimal followed by aggregation returns wrong result
    • The incorrect results of AVERAGE might be returned. The CAST added in the Average operator will be bypassed if the result of Divide is the same type which it is casted to.
  • [SPARK-24867][SQL] Add AnalysisBarrier to DataFrameWriter
    • SQL cache is not being used when using DataFrameWriter to write a DataFrame with UDF. This is a regression caused by the changes we made in AnalysisBarrier, since not all the Analyzer rules are idempotent.
  • [SPARK-24790][SQL] Allow complex aggregate expressions in Pivot
    • Relax the check to allow complex aggregate expressions, like ceil(sum(col1)) or sum(col1) + 1, which roughly means any aggregate expression that could appear in an Aggregate plan except pandas UDF.
  • [SPARK-24870][SQL] Cache can’t work normally if there are case letters in SQL
    • Fixes a plan canonicalization issue.
  • [SPARK-24852]Have spark.ml training use updated Instrumentation APIs.
  • [SPARK-24891][SQL] Fix HandleNullInputsForUDF rule
    • Make HandleNullInputsForUDF rule idempotent, to avoid plan mismatch in cache manager when a plan is analyzed more than once.
  • [SPARK-24878][SQL] Fix reverse function for array type of primitive type containing null.
  • [SPARK-24871][SQL] Refactor Concat and MapConcat to avoid creating concatenator object for each row.
  • [SPARK-24802][SQL] Add a new config for Optimization Rule Exclusion
    • Provides a config to users to exclude some optimizer rules.
  • [SPARK-24879][SQL] Fix NPE in Hive partition pruning filter pushdown
    • When the partition predicate is something like col IN (1, null), an NPE will be thrown. This patch fixes it.
  • [SPARK-23731][SQL] Make FileSourceScanExec canonicalizable after being (de)serialized
  • [SPARK-24755][CORE] Executor loss can cause task to not be resubmitted
    • Fixes a bug that Spark may not resubmit tasks failed by executor loss. This bug was introduced in Spark 2.3.
  • [SPARK-24677][CORE] Avoid NoSuchElementException from MedianHeap
    • Fixes a speculative tasks related bug when collecting task duration metrics.
  • [SPARK-24868][PYTHON] add sequence function in Python
  • [SPARK-21811][SPARK-24012][SPARK-24737][SPARK-24165][SPARK-24734][SPARK-24840][SQL] Fix type coercions and nullabilities.
  • [SPARK-24699][SS] Make watermarks work with Trigger.Once by saving updated watermark to commit log
  • [SPARK-24537][R] Add array_remove / array_zip / map_from_arrays / array_distinct
  • [SPARK-22187][SS] Update unsaferow format for saved state in flatMapGroupsWithState to allow timeouts with deleted state (4.x)
  • [SPARK-24681][SQL] Verify nested column names in Hive metastore
    • Make sure nested column names do not include ‘,’, ‘:’, and ‘;’ in Hive metastore
  • [SPARK-23486]cache the function name from the external catalog for lookupFunctions
    • To speed up function lookups.
  • [SPARK-24781][SQL] Using a reference from Dataset in Filter/Sort might not work
  • [SPARK-24208][SQL] Fix attribute deduplication for FlatMapGroupsInPandas
    • Fix self-join failure on a dataset which contains a FlatMapGroupsInPandas because of duplicate attributes
  • [SPARK-24530][PYTHON] Add a control to force Python version in Sphinx via environment variable, SPHINXPYTHON
  • [SPARK-24250]support accessing SQLConf inside tasks
    • Save all the SQL configs to job properties when an SQL execution is triggered. At executor side we rebuild the SQLConf from job properties.
  • [SPARK-23936][SQL] Implement map_concat
  • [SPARK-23914][SQL] Add array_union function
  • [SPARK-24732][SQL] Type coercion between MapTypes.
    • Adds support for type coercion between MapTypes where both the key types and the value types are compatible. For example, types MapType(IntegerType, FloatType) and MapType(LongType, DoubleType) can be coerced to type MapType(LongType, DoubleType)
  • [SPARK-24662][SQL][SS] Support limit in structured streaming
  • [SPARK-24730][SS] Add policy to choose max as global watermark when streaming query has multiple watermarks (branch-4.x)
  • [SPARK-24596][SQL] Non-cascading Cache Invalidation
    • When uncache or drop temp view, it’s unnecessary to cascadingly uncache all the plans that depend on the view, as the underlying data are not changed.
  • [SPARK-23927][SQL] Add “sequence” expression
  • [SPARK-24636][SQL] Type coercion of arrays for array_join function
  • [SPARK-22384][SQL] Refine partition pruning when attribute is wrapped in Cast
    • Improve partition pruning, able to push down partition predicates with safe type cast(int to long, not long to int).
  • [SPARK-24385][SQL] Resolve self-join condition ambiguity for EqualNullSafe
    • Implements EqualNullSafe for self-join condition ambiguity resolving.
  • [SPARK-24696][SQL] ColumnPruning rule fails to remove extra Project
    • Fixes a bug in the ColumnPruning rule that caused an infinite loop error in the Optimizer.
  • [SPARK-24603][SQL] Fix findTightestCommonType reference in comments
  • [SPARK-24613][SQL] Cache with UDF could not be matched with subsequent dependent caches
    • Wraps the logical plan with a AnalysisBarrier for execution plan compilation in CacheManager, in order to avoid the plan being analyzed again. This is also a regression of Spark 2.3.
  • [SPARK-24017][SQL] Refactor ExternalCatalog to be an interface
  • [SPARK-24324][PYTHON] Pandas Grouped Map UDF should assign result columns by name
    • Assigns result columns by schema name if user labeled with strings, otherwise using position.
  • [SPARK-23778][CORE] Avoid unneeded shuffle when union gets an empty RDD
    • Ignores incoming empty RDDs in the union method to avoid an unneeded extra-shuffle when all the other RDDs have the same partitioning.
  • [SPARK-24552][CORE][SQL] Use unique id instead of attempt number for writes.
    • Passes the unique task attempt id instead of attempt number to v2 data sources because attempt number is reused when stages are retried. This affects the data source V1 and V2 APIs, but the file format APIs will not be affected because DBR using different commit protocol.
  • [SPARK-24588][SS] streaming join should require HashClusteredPartitioning from children.
  • [SPARK-24589][CORE] Correctly identify tasks in output commit coordinator.
    • Adds more information to the stage state tracked by the coordinator, so that only one task is allowed to commit the output. This fix also removes the useless code changes introduced by SPARK-18113.
  • [SPARK-23933][SQL] Add map_from_arrays function
  • [SPARK-24583][SQL] Wrong schema type in InsertIntoDataSourceCommand
    • When creating a Databricks Delta table with NOT NULL constraints, we could drop nullability and insert the NULL values without checking the violation.
  • [SPARK-24542][SQL] UDF series UDFXPathXXXX allow users to pass carefully crafted XML to access arbitrary files
    • This is a security patch reported from the community. UDF series UDFXPathXXXX allow users to pass carefully crafted XML to access arbitrary files. When users use the external access control library, users might bypass them and access the file contents.
  • [SPARK-23934][SQL] Adding map_from_entries function
  • [SPARK-23912][SQL] Add array_distinct
  • [SPARK-24574][SQL] array_contains, array_position, array_remove and element_at functions deal with Column type

Maintenance Updates

Maintenance updates made to Databricks Runtime 4.3 since its initial release include:

  • Sep 11, 2018
    • [SPARK-25214][SS] Fix the issue that Kafka v2 source may return duplicated records when failOnDataLoss=false.
    • [SPARK-24987][SS] Fix Kafka consumer leak when no new offsets for TopicPartition.
    • Filter reduction should handle null value correctly.
    • Improved stability of execution engine.
  • Aug 28, 2018
    • Fixed a bug in Databricks Delta Delete command that would incorrectly delete the rows where the condition evaluates to null.
    • [SPARK-25142]Add error messages when Python worker could not open socket in _load_from_socket.
  • Aug 23, 2018
    • [SPARK-23935]mapEntry throws org.codehaus.commons.compiler.CompileException.
    • Fixed nullable map issue in Parquet reader.
    • [SPARK-25051][SQL] FixNullability should not stop on AnalysisBarrier.
    • [SPARK-25081]Fixed a bug where ShuffleExternalSorter may access a released memory page when spilling fails to allocate memory.
    • Fixed an interaction between Databricks Delta and Pyspark which could cause transient read failures.
    • [SPARK-25084]“distribute by” on multiple columns (wrap in brackets) may lead to codegen issue.
    • [SPARK-25096]Loosen nullability if the cast is force-nullable.
    • Lowered the default number of threads used by the Databricks Delta Optimize command, reducing memory overhead and committing data faster.
    • [SPARK-25114]Fix RecordBinaryComparator when subtraction between two words is divisible by Integer.MAX_VALUE.
    • Fixed secret manager redaction when command partially succeed.

System Environment

  • Operating System: Ubuntu 16.04.4 LTS
  • Java: 1.8.0_162
  • Scala: 2.11.8
  • Python: 2.7.12 for Python 2 clusters and 3.5.2 for Python 3 clusters. For details, see Python Clusters.
  • R: R version 3.4.4 (2018-03-15)
  • For GPU clusters, the following NVIDIA GPU libraries are installed:
    • CUDA 9.0
    • cuDNN 7.0

Installed Python Libraries

Library Version Library Version Library Version
ansi2html 1.1.1 argparse 1.2.1 backports-abc 0.5
boto 2.42.0 boto3 1.4.1 botocore 1.4.70
brewer2mpl 1.4.1 certifi 2016.2.28 cffi 1.7.0
chardet 2.3.0 colorama 0.3.7 configobj 5.0.6
cryptography 1.5 cycler 0.10.0 Cython 0.24.1
decorator 4.0.10 docutils 0.14 enum34 1.1.6
et-xmlfile 1.0.1 freetype-py 1.0.2 funcsigs 1.0.2
fusepy 2.0.4 futures 3.2.0 ggplot 0.6.8
html5lib 0.999 idna 2.1 ipaddress 1.0.16
ipython 2.2.0 ipython-genutils 0.1.0 jdcal 1.2
Jinja2 2.8 jmespath 0.9.0 llvmlite 0.13.0
lxml 3.6.4 MarkupSafe 0.23 matplotlib 1.5.3
mpld3 0.2 msgpack-python 0.4.7 ndg-httpsclient 0.3.3
numba 0.28.1 numpy 1.11.1 openpyxl 2.3.2
pandas 0.19.2 pathlib2 2.1.0 patsy 0.4.1
pexpect 4.0.1 pickleshare 0.7.4 Pillow 3.3.1
pip 10.0.1 ply 3.9 prompt-toolkit 1.0.7
psycopg2 2.6.2 ptyprocess 0.5.1 py4j 0.10.3
pyarrow 0.8.0 pyasn1 0.1.9 pycparser 2.14
Pygments 2.1.3 PyGObject 3.20.0 pyOpenSSL 16.0.0
pyparsing 2.2.0 pypng 0.0.18 Python 2.7.12
python-dateutil 2.5.3 python-geohash 0.8.5 pytz 2016.6.1
requests 2.11.1 s3transfer 0.1.9 scikit-learn 0.18.1
scipy 0.18.1 scour 0.32 seaborn 0.7.1
setuptools 39.2.0 simplejson 3.8.2 simples3 1.0
singledispatch 3.4.0.3 six 1.10.0 statsmodels 0.6.1
tornado 5.0.2 traitlets 4.3.0 urllib3 1.19.1
virtualenv 15.0.1 wcwidth 0.1.7 wheel 0.31.1
wsgiref 0.1.2        

Installed R Libraries

Library Version Library Version Library Version
abind 1.4-5 assertthat 0.2.0 backports 1.1.2
base 3.4.4 BH 1.66.0-1 bindr 0.1.1
bindrcpp 0.2.2 bit 1.1-12 bit64 0.9-7
bitops 1.0-6 blob 1.1.1 boot 1.3-20
brew 1.0-6 broom 0.4.4 car 3.0-0
carData 3.0-1 caret 6.0-79 cellranger 1.1.0
chron 2.3-52 class 7.3-14 cli 1.0.0
cluster 2.0.7-1 codetools 0.2-15 colorspace 1.3-2
commonmark 1.4 compiler 3.4.4 crayon 1.3.4
curl 3.2 CVST 0.2-1 data.table 1.10.4-3
datasets 3.4.4 DBI 0.8 ddalpha 1.3.1.1
DEoptimR 1.0-8 desc 1.1.1 devtools 1.13.5
dichromat 2.0-0 digest 0.6.15 dimRed 0.1.0
doMC 1.3.5 dplyr 0.7.4 DRR 0.0.3
forcats 0.3.0 foreach 1.4.4 foreign 0.8-70
gbm 2.1.3 ggplot2 2.2.1 git2r 0.21.0
glmnet 2.0-16 glue 1.2.0 gower 0.1.2
graphics 3.4.4 grDevices 3.4.4 grid 3.4.4
gsubfn 0.7 gtable 0.2.0 h2o 3.16.0.2
haven 1.1.1 hms 0.4.2 httr 1.3.1
hwriter 1.3.2 hwriterPlus 1.0-3 ipred 0.9-6
iterators 1.0.9 jsonlite 1.5 kernlab 0.9-25
KernSmooth 2.23-15 labeling 0.3 lattice 0.20-35
lava 1.6.1 lazyeval 0.2.1 littler 0.3.3
lme4 1.1-17 lubridate 1.7.3 magrittr 1.5
mapproj 1.2.6 maps 3.3.0 maptools 0.9-2
MASS 7.3-50 Matrix 1.2-14 MatrixModels 0.4-1
memoise 1.1.0 methods 3.4.4 mgcv 1.8-24
mime 0.5 minqa 1.2.4 mnormt 1.5-5
ModelMetrics 1.1.0 munsell 0.4.3 mvtnorm 1.0-7
nlme 3.1-137 nloptr 1.0.4 nnet 7.3-12
numDeriv 2016.8-1 openssl 1.0.1 openxlsx 4.0.17
parallel 3.4.4 pbkrtest 0.4-7 pillar 1.2.1
pkgconfig 2.0.1 pkgKitten 0.1.4 plogr 0.2.0
plyr 1.8.4 praise 1.0.0 prettyunits 1.0.2
pROC 1.11.0 prodlim 1.6.1 proto 1.0.0
psych 1.8.3.3 purrr 0.2.4 quantreg 5.35
R.methodsS3 1.7.1 R.oo 1.21.0 R.utils 2.6.0
R6 2.2.2 randomForest 4.6-14 RColorBrewer 1.1-2
Rcpp 0.12.16 RcppEigen 0.3.3.4.0 RcppRoll 0.2.2
RCurl 1.95-4.10 readr 1.1.1 readxl 1.0.0
recipes 0.1.2 rematch 1.0.1 reshape2 1.4.3
rio 0.5.10 rlang 0.2.0 robustbase 0.92-8
RODBC 1.3-15 roxygen2 6.0.1 rpart 4.1-13
rprojroot 1.3-2 Rserve 1.7-3 RSQLite 2.1.0
rstudioapi 0.7 scales 0.5.0 sfsmisc 1.1-2
sp 1.2-7 SparkR 2.3.1 SparseM 1.77
spatial 7.3-11 splines 3.4.4 sqldf 0.4-11
SQUAREM 2017.10-1 statmod 1.4.30 stats 3.4.4
stats4 3.4.4 stringi 1.1.7 stringr 1.3.0
survival 2.42-3 tcltk 3.4.4 TeachingDemos 2.10
testthat 2.0.0 tibble 1.4.2 tidyr 0.8.0
tidyselect 0.2.4 timeDate 3043.102 tools 3.4.4
utf8 1.1.3 utils 3.4.4 viridisLite 0.3.0
whisker 0.3-2 withr 2.1.2 xml2 1.2.0

Installed Java and Scala libraries (Scala 2.11 cluster version)

Group ID Artifact ID Version
antlr antlr 2.7.7
com.amazonaws amazon-kinesis-client 1.7.3
com.amazonaws aws-java-sdk-autoscaling 1.11.313
com.amazonaws aws-java-sdk-cloudformation 1.11.313
com.amazonaws aws-java-sdk-cloudfront 1.11.313
com.amazonaws aws-java-sdk-cloudhsm 1.11.313
com.amazonaws aws-java-sdk-cloudsearch 1.11.313
com.amazonaws aws-java-sdk-cloudtrail 1.11.313
com.amazonaws aws-java-sdk-cloudwatch 1.11.313
com.amazonaws aws-java-sdk-cloudwatchmetrics 1.11.313
com.amazonaws aws-java-sdk-codedeploy 1.11.313
com.amazonaws aws-java-sdk-cognitoidentity 1.11.313
com.amazonaws aws-java-sdk-cognitosync 1.11.313
com.amazonaws aws-java-sdk-config 1.11.313
com.amazonaws aws-java-sdk-core 1.11.313
com.amazonaws aws-java-sdk-datapipeline 1.11.313
com.amazonaws aws-java-sdk-directconnect 1.11.313
com.amazonaws aws-java-sdk-directory 1.11.313
com.amazonaws aws-java-sdk-dynamodb 1.11.313
com.amazonaws aws-java-sdk-ec2 1.11.313
com.amazonaws aws-java-sdk-ecs 1.11.313
com.amazonaws aws-java-sdk-efs 1.11.313
com.amazonaws aws-java-sdk-elasticache 1.11.313
com.amazonaws aws-java-sdk-elasticbeanstalk 1.11.313
com.amazonaws aws-java-sdk-elasticloadbalancing 1.11.313
com.amazonaws aws-java-sdk-elastictranscoder 1.11.313
com.amazonaws aws-java-sdk-emr 1.11.313
com.amazonaws aws-java-sdk-glacier 1.11.313
com.amazonaws aws-java-sdk-iam 1.11.313
com.amazonaws aws-java-sdk-importexport 1.11.313
com.amazonaws aws-java-sdk-kinesis 1.11.313
com.amazonaws aws-java-sdk-kms 1.11.313
com.amazonaws aws-java-sdk-lambda 1.11.313
com.amazonaws aws-java-sdk-logs 1.11.313
com.amazonaws aws-java-sdk-machinelearning 1.11.313
com.amazonaws aws-java-sdk-opsworks 1.11.313
com.amazonaws aws-java-sdk-rds 1.11.313
com.amazonaws aws-java-sdk-redshift 1.11.313
com.amazonaws aws-java-sdk-route53 1.11.313
com.amazonaws aws-java-sdk-s3 1.11.313
com.amazonaws aws-java-sdk-ses 1.11.313
com.amazonaws aws-java-sdk-simpledb 1.11.313
com.amazonaws aws-java-sdk-simpleworkflow 1.11.313
com.amazonaws aws-java-sdk-sns 1.11.313
com.amazonaws aws-java-sdk-sqs 1.11.313
com.amazonaws aws-java-sdk-ssm 1.11.313
com.amazonaws aws-java-sdk-storagegateway 1.11.313
com.amazonaws aws-java-sdk-sts 1.11.313
com.amazonaws aws-java-sdk-support 1.11.313
com.amazonaws aws-java-sdk-swf-libraries 1.11.22
com.amazonaws aws-java-sdk-workspaces 1.11.313
com.amazonaws jmespath-java 1.11.313
com.carrotsearch hppc 0.7.2
com.chuusai shapeless_2.11 2.3.2
com.clearspring.analytics stream 2.7.0
com.databricks Rserve 1.8-3
com.databricks dbml-local_2.11 0.4.1-db1-spark2.3
com.databricks dbml-local_2.11-tests 0.4.1-db1-spark2.3
com.databricks jets3t 0.7.1-0
com.databricks.scalapb compilerplugin_2.11 0.4.15-9
com.databricks.scalapb scalapb-runtime_2.11 0.4.15-9
com.esotericsoftware kryo-shaded 3.0.3
com.esotericsoftware minlog 1.3.0
com.fasterxml classmate 1.0.0
com.fasterxml.jackson.core jackson-annotations 2.6.7
com.fasterxml.jackson.core jackson-core 2.6.7
com.fasterxml.jackson.core jackson-databind 2.6.7.1
com.fasterxml.jackson.dataformat jackson-dataformat-cbor 2.6.7
com.fasterxml.jackson.datatype jackson-datatype-joda 2.6.7
com.fasterxml.jackson.module jackson-module-paranamer 2.6.7
com.fasterxml.jackson.module jackson-module-scala_2.11 2.6.7.1
com.github.fommil jniloader 1.1
com.github.fommil.netlib core 1.1.2
com.github.fommil.netlib native_ref-java 1.1
com.github.fommil.netlib native_ref-java-natives 1.1
com.github.fommil.netlib native_system-java 1.1
com.github.fommil.netlib native_system-java-natives 1.1
com.github.fommil.netlib netlib-native_ref-linux-x86_64-natives 1.1
com.github.fommil.netlib netlib-native_system-linux-x86_64-natives 1.1
com.github.luben zstd-jni 1.3.2-2
com.github.rwl jtransforms 2.4.0
com.google.code.findbugs jsr305 2.0.1
com.google.code.gson gson 2.2.4
com.google.guava guava 15.0
com.google.protobuf protobuf-java 2.6.1
com.googlecode.javaewah JavaEWAH 0.3.2
com.h2database h2 1.3.174
com.jamesmurty.utils java-xmlbuilder 1.1
com.jcraft jsch 0.1.50
com.jolbox bonecp 0.8.0.RELEASE
com.mchange c3p0 0.9.5.1
com.mchange mchange-commons-java 0.2.10
com.microsoft.azure azure-data-lake-store-sdk 2.2.8
com.microsoft.sqlserver mssql-jdbc 6.2.2.jre8
com.ning compress-lzf 1.0.3
com.sun.mail javax.mail 1.5.2
com.thoughtworks.paranamer paranamer 2.8
com.trueaccord.lenses lenses_2.11 0.3
com.twitter chill-java 0.8.4
com.twitter chill_2.11 0.8.4
com.twitter parquet-hadoop-bundle 1.6.0
com.twitter util-app_2.11 6.23.0
com.twitter util-core_2.11 6.23.0
com.twitter util-jvm_2.11 6.23.0
com.typesafe config 1.2.1
com.typesafe.scala-logging scala-logging-api_2.11 2.1.2
com.typesafe.scala-logging scala-logging-slf4j_2.11 2.1.2
com.univocity univocity-parsers 2.5.9
com.vlkan flatbuffers 1.2.0-3f79e055
com.zaxxer HikariCP 3.1.0
commons-beanutils commons-beanutils 1.7.0
commons-beanutils commons-beanutils-core 1.8.0
commons-cli commons-cli 1.2
commons-codec commons-codec 1.10
commons-collections commons-collections 3.2.2
commons-configuration commons-configuration 1.6
commons-dbcp commons-dbcp 1.4
commons-digester commons-digester 1.8
commons-httpclient commons-httpclient 3.1
commons-io commons-io 2.4
commons-lang commons-lang 2.6
commons-logging commons-logging 1.1.3
commons-net commons-net 2.2
commons-pool commons-pool 1.5.4
info.ganglia.gmetric4j gmetric4j 1.0.7
io.airlift aircompressor 0.8
io.dropwizard.metrics metrics-core 3.1.5
io.dropwizard.metrics metrics-ganglia 3.1.5
io.dropwizard.metrics metrics-graphite 3.1.5
io.dropwizard.metrics metrics-healthchecks 3.1.5
io.dropwizard.metrics metrics-jetty9 3.1.5
io.dropwizard.metrics metrics-json 3.1.5
io.dropwizard.metrics metrics-jvm 3.1.5
io.dropwizard.metrics metrics-log4j 3.1.5
io.dropwizard.metrics metrics-servlets 3.1.5
io.netty netty 3.9.9.Final
io.netty netty-all 4.1.17.Final
io.prometheus simpleclient 0.0.16
io.prometheus simpleclient_common 0.0.16
io.prometheus simpleclient_dropwizard 0.0.16
io.prometheus simpleclient_servlet 0.0.16
io.prometheus.jmx collector 0.7
javax.activation activation 1.1.1
javax.annotation javax.annotation-api 1.2
javax.el javax.el-api 2.2.4
javax.jdo jdo-api 3.0.1
javax.servlet javax.servlet-api 3.1.0
javax.servlet.jsp jsp-api 2.1
javax.transaction jta 1.1
javax.validation validation-api 1.1.0.Final
javax.ws.rs javax.ws.rs-api 2.0.1
javax.xml.bind jaxb-api 2.2.2
javax.xml.stream stax-api 1.0-2
javolution javolution 5.5.1
jline jline 2.11
joda-time joda-time 2.9.3
log4j apache-log4j-extras 1.2.17
log4j log4j 1.2.17
net.hydromatic eigenbase-properties 1.1.5
net.iharder base64 2.3.8
net.java.dev.jets3t jets3t 0.9.4
net.razorvine pyrolite 4.13
net.sf.jpam jpam 1.1
net.sf.opencsv opencsv 2.3
net.sf.supercsv super-csv 2.2.0
net.snowflake snowflake-jdbc 3.6.3
net.snowflake spark-snowflake_2.11 2.4.1
net.sourceforge.f2j arpack_combined_all 0.1
org.acplt oncrpc 1.0.7
org.antlr ST4 4.0.4
org.antlr antlr-runtime 3.4
org.antlr antlr4-runtime 4.7
org.antlr stringtemplate 3.2.1
org.apache.ant ant 1.9.2
org.apache.ant ant-jsch 1.9.2
org.apache.ant ant-launcher 1.9.2
org.apache.arrow arrow-format 0.8.0
org.apache.arrow arrow-memory 0.8.0
org.apache.arrow arrow-vector 0.8.0
org.apache.avro avro 1.7.7
org.apache.avro avro-ipc 1.7.7
org.apache.avro avro-ipc-tests 1.7.7
org.apache.avro avro-mapred-hadoop2 1.7.7
org.apache.calcite calcite-avatica 1.2.0-incubating
org.apache.calcite calcite-core 1.2.0-incubating
org.apache.calcite calcite-linq4j 1.2.0-incubating
org.apache.commons commons-compress 1.4.1
org.apache.commons commons-crypto 1.0.0
org.apache.commons commons-lang3 3.5
org.apache.commons commons-math3 3.4.1
org.apache.curator curator-client 2.7.1
org.apache.curator curator-framework 2.7.1
org.apache.curator curator-recipes 2.7.1
org.apache.derby derby 10.12.1.1
org.apache.directory.api api-asn1-api 1.0.0-M20
org.apache.directory.api api-util 1.0.0-M20
org.apache.directory.server apacheds-i18n 2.0.0-M15
org.apache.directory.server apacheds-kerberos-codec 2.0.0-M15
org.apache.hadoop hadoop-annotations 2.7.3
org.apache.hadoop hadoop-auth 2.7.3
org.apache.hadoop hadoop-client 2.7.3
org.apache.hadoop hadoop-common 2.7.3
org.apache.hadoop hadoop-hdfs 2.7.3
org.apache.hadoop hadoop-mapreduce-client-app 2.7.3
org.apache.hadoop hadoop-mapreduce-client-common 2.7.3
org.apache.hadoop hadoop-mapreduce-client-core 2.7.3
org.apache.hadoop hadoop-mapreduce-client-jobclient 2.7.3
org.apache.hadoop hadoop-mapreduce-client-shuffle 2.7.3
org.apache.hadoop hadoop-yarn-api 2.7.3
org.apache.hadoop hadoop-yarn-client 2.7.3
org.apache.hadoop hadoop-yarn-common 2.7.3
org.apache.hadoop hadoop-yarn-server-common 2.7.3
org.apache.htrace htrace-core 3.1.0-incubating
org.apache.httpcomponents httpclient 4.5.4
org.apache.httpcomponents httpcore 4.4.8
org.apache.ivy ivy 2.4.0
org.apache.orc orc-core-nohive 1.4.3
org.apache.orc orc-mapreduce-nohive 1.4.3
org.apache.parquet parquet-column 1.8.3-databricks2
org.apache.parquet parquet-common 1.8.3-databricks2
org.apache.parquet parquet-encoding 1.8.3-databricks2
org.apache.parquet parquet-format 2.3.1
org.apache.parquet parquet-hadoop 1.8.3-databricks2
org.apache.parquet parquet-jackson 1.8.3-databricks2
org.apache.thrift libfb303 0.9.3
org.apache.thrift libthrift 0.9.3
org.apache.xbean xbean-asm5-shaded 4.4
org.apache.zookeeper zookeeper 3.4.6
org.bouncycastle bcprov-jdk15on 1.58
org.codehaus.jackson jackson-core-asl 1.9.13
org.codehaus.jackson jackson-jaxrs 1.9.13
org.codehaus.jackson jackson-mapper-asl 1.9.13
org.codehaus.jackson jackson-xc 1.9.13
org.codehaus.janino commons-compiler 3.0.8
org.codehaus.janino janino 3.0.8
org.datanucleus datanucleus-api-jdo 3.2.6
org.datanucleus datanucleus-core 3.2.10
org.datanucleus datanucleus-rdbms 3.2.9
org.eclipse.jetty jetty-client 9.3.20.v20170531
org.eclipse.jetty jetty-continuation 9.3.20.v20170531
org.eclipse.jetty jetty-http 9.3.20.v20170531
org.eclipse.jetty jetty-io 9.3.20.v20170531
org.eclipse.jetty jetty-jndi 9.3.20.v20170531
org.eclipse.jetty jetty-plus 9.3.20.v20170531
org.eclipse.jetty jetty-proxy 9.3.20.v20170531
org.eclipse.jetty jetty-security 9.3.20.v20170531
org.eclipse.jetty jetty-server 9.3.20.v20170531
org.eclipse.jetty jetty-servlet 9.3.20.v20170531
org.eclipse.jetty jetty-servlets 9.3.20.v20170531
org.eclipse.jetty jetty-util 9.3.20.v20170531
org.eclipse.jetty jetty-webapp 9.3.20.v20170531
org.eclipse.jetty jetty-xml 9.3.20.v20170531
org.fusesource.leveldbjni leveldbjni-all 1.8
org.glassfish.hk2 hk2-api 2.4.0-b34
org.glassfish.hk2 hk2-locator 2.4.0-b34
org.glassfish.hk2 hk2-utils 2.4.0-b34
org.glassfish.hk2 osgi-resource-locator 1.0.1
org.glassfish.hk2.external aopalliance-repackaged 2.4.0-b34
org.glassfish.hk2.external javax.inject 2.4.0-b34
org.glassfish.jersey.bundles.repackaged jersey-guava 2.22.2
org.glassfish.jersey.containers jersey-container-servlet 2.22.2
org.glassfish.jersey.containers jersey-container-servlet-core 2.22.2
org.glassfish.jersey.core jersey-client 2.22.2
org.glassfish.jersey.core jersey-common 2.22.2
org.glassfish.jersey.core jersey-server 2.22.2
org.glassfish.jersey.media jersey-media-jaxb 2.22.2
org.hibernate hibernate-validator 5.1.1.Final
org.iq80.snappy snappy 0.2
org.javassist javassist 3.18.1-GA
org.jboss.logging jboss-logging 3.1.3.GA
org.jdbi jdbi 2.63.1
org.joda joda-convert 1.7
org.jodd jodd-core 3.5.2
org.json4s json4s-ast_2.11 3.2.11
org.json4s json4s-core_2.11 3.2.11
org.json4s json4s-jackson_2.11 3.2.11
org.lz4 lz4-java 1.4.0
org.mariadb.jdbc mariadb-java-client 2.1.2
org.mockito mockito-all 1.9.5
org.objenesis objenesis 2.1
org.postgresql postgresql 42.1.4
org.roaringbitmap RoaringBitmap 0.5.11
org.rocksdb rocksdbjni 5.2.1
org.rosuda.REngine REngine 2.1.0
org.scala-lang scala-compiler_2.11 2.11.8
org.scala-lang scala-library_2.11 2.11.8
org.scala-lang scala-reflect_2.11 2.11.8
org.scala-lang scalap_2.11 2.11.8
org.scala-lang.modules scala-parser-combinators_2.11 1.0.2
org.scala-lang.modules scala-xml_2.11 1.0.5
org.scala-sbt test-interface 1.0
org.scalacheck scalacheck_2.11 1.12.5
org.scalanlp breeze-macros_2.11 0.13.2
org.scalanlp breeze_2.11 0.13.2
org.scalatest scalatest_2.11 2.2.6
org.slf4j jcl-over-slf4j 1.7.16
org.slf4j jul-to-slf4j 1.7.16
org.slf4j slf4j-api 1.7.16
org.slf4j slf4j-log4j12 1.7.16
org.spark-project.hive hive-beeline 1.2.1.spark2
org.spark-project.hive hive-cli 1.2.1.spark2
org.spark-project.hive hive-exec 1.2.1.spark2
org.spark-project.hive hive-jdbc 1.2.1.spark2
org.spark-project.hive hive-metastore 1.2.1.spark2
org.spark-project.spark unused 1.0.0
org.spire-math spire-macros_2.11 0.13.0
org.spire-math spire_2.11 0.13.0
org.springframework spring-core 4.1.4.RELEASE
org.springframework spring-test 4.1.4.RELEASE
org.tukaani xz 1.0
org.typelevel machinist_2.11 0.6.1
org.typelevel macro-compat_2.11 1.1.1
org.xerial sqlite-jdbc 3.8.11.2
org.xerial.snappy snappy-java 1.1.2.6
org.yaml snakeyaml 1.16
oro oro 2.0.8
software.amazon.ion ion-java 1.0.2
stax stax-api 1.0.1
xmlenc xmlenc 0.52