FAQ
hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
---------------------------------------------------------------------------------------------------------------------------

Key: HADOOP-2350
URL: https://issues.apache.org/jira/browse/HADOOP-2350
Project: Hadoop
Issue Type: Bug
Components: contrib/hbase
Reporter: Michael Bieniosek
Fix For: 0.16.0
Attachments: TestScannerAPI.java

I'm attaching a test case that fails.

I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.

eg., if I insert rows so my table will look like this:
{code}
row - a:a - b:b
aaa a:1 nil
bbb a:2 b:2
ccc a:3 b:3
{code}

The scanner will tell me my table looks something like this:
{code}
row - a:a - b:b
bbb a:1 b:2
bbb a:2 b:3
{code}


--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Search Discussions

  • Michael Bieniosek (JIRA) at Dec 5, 2007 at 1:58 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Michael Bieniosek updated HADOOP-2350:
    --------------------------------------

    Attachment: TestScannerAPI.java

    Here's a test case which illustrates the problem
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Michael Bieniosek (JIRA) at Dec 5, 2007 at 2:00 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548506 ]

    Michael Bieniosek commented on HADOOP-2350:
    -------------------------------------------

    A secondary problem is that the HScannerInterface.iterator().next() sometimes returns a Map.Entry with a null value for both the key and value. I think this may have something to do with the health of my cluster.
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Michael Bieniosek (JIRA) at Dec 5, 2007 at 2:02 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Michael Bieniosek updated HADOOP-2350:
    --------------------------------------

    Attachment: TestScannerAPI.java

    Oops, re-upload without hardcoded hostname.
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java, TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Michael Bieniosek (JIRA) at Dec 5, 2007 at 2:02 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548507 ]

    Michael Bieniosek commented on HADOOP-2350:
    -------------------------------------------

    This was not a problem in release 0.15; it has only occurred since we moved to trunk.
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java, TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Michael Bieniosek (JIRA) at Dec 5, 2007 at 2:04 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Michael Bieniosek updated HADOOP-2350:
    --------------------------------------

    Attachment: (was: TestScannerAPI.java)
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Edward Yoon (JIRA) at Dec 5, 2007 at 2:14 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548511 ]

    Edward Yoon commented on HADOOP-2350:
    -------------------------------------

    Oh.. Looks like a critical thing.
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • stack (JIRA) at Dec 5, 2007 at 7:27 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    stack reassigned HADOOP-2350:
    -----------------------------

    Assignee: stack
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Assignee: stack
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Michael Bieniosek (JIRA) at Dec 6, 2007 at 12:06 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Michael Bieniosek updated HADOOP-2350:
    --------------------------------------

    Priority: Critical (was: Major)

    Bump priority because it is a correctness issue
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Assignee: stack
    Priority: Critical
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Jim Kellerman (JIRA) at Dec 7, 2007 at 9:35 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Jim Kellerman reassigned HADOOP-2350:
    -------------------------------------

    Assignee: Jim Kellerman (was: stack)
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Assignee: Jim Kellerman
    Priority: Critical
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Jim Kellerman (JIRA) at Dec 7, 2007 at 9:35 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Work on HADOOP-2350 started by Jim Kellerman.
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Assignee: Jim Kellerman
    Priority: Critical
    Fix For: 0.16.0

    Attachments: TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Jim Kellerman (JIRA) at Dec 8, 2007 at 5:44 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Jim Kellerman updated HADOOP-2350:
    ----------------------------------

    Attachment: patch.txt
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Assignee: Jim Kellerman
    Priority: Critical
    Fix For: 0.16.0

    Attachments: patch.txt, TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Jim Kellerman (JIRA) at Dec 8, 2007 at 5:46 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Jim Kellerman updated HADOOP-2350:
    ----------------------------------

    Status: Patch Available (was: In Progress)

    Tests run locally. Try Hudson
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Assignee: Jim Kellerman
    Priority: Critical
    Fix For: 0.16.0

    Attachments: patch.txt, TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hadoop QA (JIRA) at Dec 8, 2007 at 6:47 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12549664 ]

    Hadoop QA commented on HADOOP-2350:
    -----------------------------------

    +1 overall. Here are the results of testing the latest attachment
    http://issues.apache.org/jira/secure/attachment/12371280/patch.txt
    against trunk revision r602284.

    @author +1. The patch does not contain any @author tags.

    javadoc +1. The javadoc tool did not generate any warning messages.

    javac +1. The applied patch does not generate any new compiler warnings.

    findbugs +1. The patch does not introduce any new Findbugs warnings.

    core tests +1. The patch passed core unit tests.

    contrib tests +1. The patch passed contrib unit tests.

    Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1301/testReport/
    Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1301/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
    Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1301/artifact/trunk/build/test/checkstyle-errors.html
    Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1301/console

    This message is automatically generated.
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Assignee: Jim Kellerman
    Priority: Critical
    Fix For: 0.16.0

    Attachments: patch.txt, TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Jim Kellerman (JIRA) at Dec 8, 2007 at 6:57 am
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Jim Kellerman updated HADOOP-2350:
    ----------------------------------

    Resolution: Fixed
    Status: Resolved (was: Patch Available)

    Tests passed (including new one derived from the one supplied with the issue). Committed.
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Assignee: Jim Kellerman
    Priority: Critical
    Fix For: 0.16.0

    Attachments: patch.txt, TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hudson (JIRA) at Dec 8, 2007 at 12:18 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12549702 ]

    Hudson commented on HADOOP-2350:
    --------------------------------

    Integrated in Hadoop-Nightly #326 (See [http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/326/])
    hbase scanner api returns null row names, or skips row names if different column families do not have entries for some rows
    ---------------------------------------------------------------------------------------------------------------------------

    Key: HADOOP-2350
    URL: https://issues.apache.org/jira/browse/HADOOP-2350
    Project: Hadoop
    Issue Type: Bug
    Components: contrib/hbase
    Reporter: Michael Bieniosek
    Assignee: Jim Kellerman
    Priority: Critical
    Fix For: 0.16.0

    Attachments: patch.txt, TestScannerAPI.java


    I'm attaching a test case that fails.
    I noticed that if I create a table with two column families, and start a scanner on a row which only has an entry for one column family, the scanner will skip ahead to the row name for which the other column family has an entry.
    eg., if I insert rows so my table will look like this:
    {code}
    row - a:a - b:b
    aaa a:1 nil
    bbb a:2 b:2
    ccc a:3 b:3
    {code}
    The scanner will tell me my table looks something like this:
    {code}
    row - a:a - b:b
    bbb a:1 b:2
    bbb a:2 b:3
    {code}
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-dev @
categorieshadoop
postedDec 5, '07 at 1:58a
activeDec 8, '07 at 12:18p
posts16
users1
websitehadoop.apache.org...
irc#hadoop

1 user in discussion

Hudson (JIRA): 16 posts

People

Translate

site design / logo © 2022 Grokbase