I'm running RabbitMQ 3.0.1 on a Windows Server 2008 x64 machine with Erlang
R15B01, but from time to time RabbitMQ crashes and corrupts its database.


When starting, I get the following error in my log file.


Error description:
{could_not_start,rabbit,
{bad_return,
{{rabbit,start,[normal,[]]},
{'EXIT',
{rabbit,failure_during_boot,
{badmatch,
{error,
{{{function_clause,
[{rabbit_queue_index,journal_minus_segment1,
[{no_pub,no_del,ack},undefined],
[]},
{rabbit_queue_index,'-journal_minus_segment/2-fun-0-',4,[]},
{array,sparse_foldl_3,7,[{file,"array.erl"},{line,1675}]},
{array,sparse_foldl_2,9,[{file,"array.erl"},{line,1669}]},
{rabbit_queue_index,'-recover_journal/1-fun-0-',1,[]},
{lists,map,2,[{file,"lists.erl"},{line,1173}]},
{rabbit_queue_index,segment_map,2,[]},
{rabbit_queue_index,recover_journal,1,[]}]},
{gen_server2,call,[<0.211.0>,out,infinity]}},
{child,undefined,msg_store_persistent,
{rabbit_msg_store,start_link,
[msg_store_persistent,


"c:/Users/RabbitUser/AppData/Roaming/RabbitMQ/db/rabbit at RH-PORTAL-mnesia",
[],
{#Fun<rabbit_queue_index.2.89252852>,
{start,


[{resource,<<"rexholmmcb">>,queue,<<"DynamicsNav_control">>},
{resource,<<"rexholmmcb">>,queue,
<<"RexholmService_control">>},
{resource,<<"rexholmmcb">>,queue,<<"RexholmService">>},
{resource,<<"rexholmmcb">>,queue,<<"DynamicsNav">>},
{resource,<<"rexholmmcb">>,queue,<<"McbService">>},
{resource,<<"rexholmmcb">>,queue,<<"McbService_error">>},
{resource,<<"rexholmmcb">>,queue,
<<"McbService_control">>}]}}]},
transient,4294967295,worker,
[rabbit_msg_store]}}}}}}}}}


Log files (may contain more information):
C:/Users/RabbitUser/AppData/Roaming/RabbitMQ/log/rabbit at RH-PORTAL.log


C:/Users/RabbitUser/AppData/Roaming/RabbitMQ/log/rabbit at RH-PORTAL-sasl.log


Just before that badmatch error, I have these two entries in the log:
=ERROR REPORT==== 7-Jan-2013::13:50:35 ===
** Generic server <0.165.0> terminating
** Last message in was {'$gen_cast',
{submit_async,
#Fun<rabbit_queue_index.26.17658518>}}
** When Server state == 1
** Reason for termination ==
** {function_clause,
[{rabbit_queue_index,journal_minus_segment1,
[{no_pub,no_del,ack},undefined],
[]},
{rabbit_queue_index,'-journal_minus_segment/2-fun-0-',4,[]},
{array,sparse_foldl_3,7,[{file,"array.erl"},{line,1675}]},
{array,sparse_foldl_2,9,[{file,"array.erl"},{line,1669}]},
{rabbit_queue_index,'-recover_journal/1-fun-0-',1,[]},
{lists,map,2,[{file,"lists.erl"},{line,1173}]},
{rabbit_queue_index,segment_map,2,[]},
{rabbit_queue_index,recover_journal,1,[]}]}


=ERROR REPORT==== 7-Jan-2013::13:50:35 ===
** Generic server <0.216.0> terminating
** Last message in was {'$gen_cast',
{submit_async,
#Fun<rabbit_queue_index.26.17658518>}}
** When Server state == 1
** Reason for termination ==
** {noproc,[{erlang,link,[<0.215.0>],[]},
{rabbit_queue_index,'-queue_index_walker/1-fun-0-',2,[]},
{worker_pool_worker,handle_cast,2,[]},
{gen_server2,handle_msg,2,[]},


{proc_lib,init_p_do_apply,3,[{file,"proc_lib.erl"},{line,227}]}]}


=ERROR REPORT==== 7-Jan-2013::13:50:35 ===
** Generic server <0.217.0> terminating
** Last message in was {'$gen_cast',
{submit_async,
#Fun<rabbit_queue_index.26.17658518>}}
** When Server state == 2
** Reason for termination ==
** {noproc,[{erlang,link,[<0.215.0>],[]},
{rabbit_queue_index,'-queue_index_walker/1-fun-0-',2,[]},
{worker_pool_worker,handle_cast,2,[]},
{gen_server2,handle_msg,2,[]},


{proc_lib,init_p_do_apply,3,[{file,"proc_lib.erl"},{line,227}]}]}


Is this a known problem, and what could I do to resolve it?
I don't know how to provoke it. It happens sporadically with several weeks
between.


Best Regards
Kenneth
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20130107/07f82028/attachment.htm>

Search Discussions

  • Kenneth Fuglsang at Jan 8, 2013 at 7:45 am
    By the way, I was able to get RabbitMQ running again by deleting the queues
    and msg_store_transient directories.


    -------------- next part --------------
    An HTML attachment was scrubbed...
    URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20130107/11538ef7/attachment.htm>
  • Emile Joubert at Jan 8, 2013 at 11:22 am
    Hi Kenneth,

    On 08/01/13 07:41, Kenneth Fuglsang wrote:
    I'm running RabbitMQ 3.0.1 on a Windows Server 2008 x64 machine with
    Erlang R15B01, but from time to time RabbitMQ crashes and corrupts its
    database.
    {{{function_clause,
    [{rabbit_queue_index,journal_minus_segment1,
    [{no_pub,no_del,ack},undefined],
    []},
    {rabbit_queue_index,'-journal_minus_segment/2-fun-0-',4,[]},
    Is this a known problem, and what could I do to resolve it?
    I don't know how to provoke it. It happens sporadically with several
    weeks between.

    This is not a known problem. In order to investigate further it will be
    useful to get a copy of the entire mnesia database directory as well as
    the output of "rabbitmqctl report" and complete logfiles. Please feel
    free to contact me off-list to arrange.


    Can you rule out any other process tampering with the files in the
    mnesia database directory? Errors like the one you report are possible
    if any other process makes any changes to the files in that directory.
    If there is any filesystem corruption or issues with the storage
    hardware then problems are also possible.








    -Emile

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
grouprabbitmq-discuss @
categoriesrabbitmq
postedJan 8, '13 at 7:41a
activeJan 8, '13 at 11:22a
posts3
users2
websiterabbitmq.com
irc#rabbitmq

2 users in discussion

Kenneth Fuglsang: 2 posts Emile Joubert: 1 post

People

Translate

site design / logo © 2017 Grokbase