Skip to content

node_put_fsm_active maxing out [JIRA: RIAK-2268] #789

@cread

Description

@cread

We've been building riak from source for a while, but I've had trouble getting the 2.1.x releases built reliably and so would like to revert back to using the .deb package. The problem I have is that in our test environment we always manage to max out node_put_fsm_active under sustained write loads, and they never drop, even when disk activity drops to zero.

When running riak 2.0.4 on R16B02-basho5 (our current prod version) everything is working as expected.

Using the .deb package of 2.0.4 pushes us to R16B02_basho6, which is where we see the problem arrive of node_puts_fsm_active going up and never dropping back own again, even after the write load stops.

Further testing with the riak 2.0.6 and 2.1.1 .deb packages (both contain R16B02_basho8) show the same problem.

As the one that works is using hipe as the VM, and the one that locks up does not, could it be a VM difference?

Some technical details of the hardware:

  • Tests run on exactly the same machines
  • Ubuntu 14.04
  • Kernel 3.13.0-53-generic
  • Filesystem is XFS
  • Backend is bitcask
  • 5 node cluster
  • Ring size 128

Select riak-admin status output

System that works:

sys_system_architecture : <<"x86_64-unknown-linux-gnu">>
sys_system_version : <<"Erlang R16B02-basho5 (erts-5.10.3) [source] [64-bit] [smp:24:24] [async-threads:64] [hipe] [kernel-poll:true]">>
riak_auth_mods_version : <<"2.0.0-1-g9ae39fe">>
erlydtl_version : <<"0.7.0">>
riak_control_version : <<"2.0.0-3-ge6f9b10">>
cluster_info_version : <<"2.0.0-0-g1fae829">>
yokozuna_version : <<"2.0.0-34-g122659d">>
ibrowse_version : <<"4.0.1">>
riak_search_version : <<"2.0.0-7-g86bfb89">>
merge_index_version : <<"2.0.0-0-gb701dde">>
riak_kv_version : <<"2.0.2-42-g757111c">>
riak_api_version : <<"2.0.1-6-g618cab1">>
riak_pb_version : <<"2.0.0.16-0-gfc18a9b">>
protobuffs_version : <<"0.8.1p4-0-g5257dfe">>
riak_dt_version : <<"2.0.0-22-g5dd5307">>
sidejob_version : <<"2.0.0-0-gc5aabba">>
riak_pipe_version : <<"2.0.0-10-gc8fc8da">>
riak_core_version : <<"2.0.1-100-gb2103c2">>
exometer_core_version : <<"1.0.0-basho1-0-ga46aa8b">>
poolboy_version : <<"0.8.1p2-0-g84d836a">>
pbkdf2_version : <<"2.0.0-0-g7076584">>
eleveldb_version : <<"2.0.0-3-g10fcb78">>
clique_version : <<"0.2.5-0-g3af4db8">>
bitcask_version : <<"1.7.0">>
basho_stats_version : <<"1.0.3">>
webmachine_version : <<"1.10.5-0-ge5f8233">>
mochiweb_version : <<"1.5.1p6">>
inets_version : <<"5.9.6">>
erlang_js_version : <<"1.3.0-0-g07467d8">>
runtime_tools_version : <<"1.8.12">>
os_mon_version : <<"2.2.13">>
riak_sysmon_version : <<"2.0.0">>
ssl_version : <<"5.3.1">>
public_key_version : <<"0.20">>
crypto_version : <<"3.1">>
asn1_version : <<"2.0.3">>
sasl_version : <<"2.3.3">>
lager_version : <<"2.0.3">>
syslog_version : <<"1.0.2">>
goldrush_version : <<"0.1.6">>
compiler_version : <<"4.9.3">>
syntax_tools_version : <<"1.6.11">>
stdlib_version : <<"1.19.3">>
kernel_version : <<"2.16.3">>

System that locks up:

sys_system_architecture : <<"x86_64-unknown-linux-gnu">>
sys_system_version : <<"Erlang R16B02_basho6 (erts-5.10.3) [source-bcd8abb] [64-bit] [smp:24:24] [async-threads:64] [kernel-poll:true] [frame-pointer]">>
riak_auth_mods_version : <<"2.0.0-1-g9ae39fe">>
erlydtl_version : <<"0.7.0">>
riak_control_version : <<"2.0.0-1-gadc50dc">>
cluster_info_version : <<"2.0.0-0-g1fae829">>
yokozuna_version : <<"2.0.0-24-g1e4dba8">>
ibrowse_version : <<"4.0.1">>
riak_search_version : <<"2.0.0-7-g86bfb89">>
merge_index_version : <<"2.0.0-0-gb701dde">>
riak_kv_version : <<"2.0.2-30-g23de41a">>
riak_api_version : <<"2.0.1-6-g618cab1">>
riak_pb_version : <<"2.0.0.16-0-gfc18a9b">>
protobuffs_version : <<"0.8.1p4-0-g5257dfe">>
riak_dt_version : <<"2.0.0-11-g38797a0">>
sidejob_version : <<"2.0.0-0-gc5aabba">>
riak_pipe_version : <<"2.0.0-8-g4662392">>
riak_core_version : <<"2.0.1-93-g0562e0b">>
exometer_core_version : <<"1.0.0-basho1-0-ga46aa8b">>
poolboy_version : <<"0.8.1p2-0-g84d836a">>
pbkdf2_version : <<"2.0.0-0-g7076584">>
eleveldb_version : <<"2.0.0-3-g10fcb78">>
clique_version : <<"0.2.4-0-g14a0966">>
bitcask_version : <<"1.7.0">>
basho_stats_version : <<"1.0.3">>
webmachine_version : <<"1.10.5-0-ge5f8233">>
mochiweb_version : <<"1.5.1p6">>
inets_version : <<"5.9.6">>
erlang_js_version : <<"1.3.0-0-g07467d8">>
runtime_tools_version : <<"1.8.12">>
os_mon_version : <<"2.2.13">>
riak_sysmon_version : <<"2.0.0">>
ssl_version : <<"5.3.1">>
public_key_version : <<"0.20">>
crypto_version : <<"3.1">>
asn1_version : <<"2.0.3">>
sasl_version : <<"2.3.3">>
lager_version : <<"2.0.3">>
syslog_version : <<"1.0.2">>
goldrush_version : <<"0.1.6">>
compiler_version : <<"4.9.3">>
syntax_tools_version : <<"1.6.11">>
stdlib_version : <<"1.19.3">>
kernel_version : <<"2.16.3">>

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions