CASSANDRA-5424

1. Symptom

Nodetool repair -pr command won’t repair the full range when a keyspace isn’t in all datacentre for multi-datacentre setup.

Background: this is an unsupported operation in the current version of cassadra.

 

Category (in the spreadsheet):

wrong computation

 

1.1 Severity

critical

1.2 Was there exception thrown? (Exception column in the spreadsheet)

no exception

 

1.2.1 Were there multiple exceptions?

no

1.3 Was there a long propagation of the failure?

no

1.4 Scope of the failure (e.g., single client, all clients, single file, entire fs, etc.)

single node

 

Catastrophic? (spreadsheet column)

no 

2. How to reproduce this failure

2.0 Version

1.2.5

2.1 Configuration

standard configuration with multiple datacentres. Make sure one of the data center does not have full range of a keyspace.

 

# of Nodes?

2 (1 in each data centre)

2.2 Reproduction procedure

1) Run nodetool repair -pr command (feature start)

 

Num triggering events

1

 

2.2.1 Timing order (Order important column)

NA

2.2.2 Events order externally controllable? (Order externally controllable? column)

yes

2.3 Can the logs tell how to reproduce the failure?

yes

2.4 How many machines needed?

2

3. Diagnosis procedure

Error msg?

yes

3.1 Detailed Symptom (where you start)

Nodetool repair -pr command on all nodes won’t repair the full range when a keyspace isn’t in all of the data centers.

3.2 Backward inference

Initially, the developer just suggested a workaround because implementing this feature correctly would not be a good idea. The developer suggested to fix in cassandra 2.0. Then the developer decides to implement this new feature anyways.

 

4. Root cause

trying to use an unsupported feature.

4.1 Category:

semantic

4.2 Are there multiple fault?

no

4.2 Can we automatically test it?

yes

5. Fix

5.1 How?

The fix implemented the feature the reported wanted.